Glossary
Continuous Reinforcement
A schedule of reinforcement where the desired behavior is reinforced every single time it occurs. This leads to rapid learning but also rapid extinction if reinforcement stops.
Example:
A vending machine that always dispenses a drink when money is inserted operates on a continuous reinforcement schedule.
Extinction
The gradual weakening and eventual disappearance of a conditioned response when reinforcement is no longer provided. In operant conditioning, this occurs when a learned behavior is no longer rewarded.
Example:
If a child stops getting candy for throwing a tantrum, they will eventually stop throwing tantrums, illustrating extinction of the behavior.
Fixed-Interval (FI)
A partial reinforcement schedule that reinforces the first response after a fixed amount of time has passed. Response rates tend to increase as the time for reinforcement approaches.
Example:
Students studying intensely right before a weekly quiz, knowing the quiz will occur at a set time, are responding to a fixed-interval (FI) schedule.
Fixed-Ratio (FR)
A partial reinforcement schedule that reinforces a response only after a specified number of responses. The number of responses required for reinforcement is constant.
Example:
A coffee shop offering a free drink after every 10 purchases uses a fixed-ratio (FR) schedule.
Negative Punishment
Removing a desirable stimulus after a behavior to decrease the likelihood of that behavior recurring. It involves taking away something pleasant.
Example:
When a teenager loses their phone privileges for breaking curfew, it's an example of negative punishment, as a desirable item is removed.
Negative Reinforcement
Removing an aversive stimulus after a behavior to increase the likelihood of that behavior recurring. It involves taking away something unpleasant.
Example:
Putting on your seatbelt to stop the annoying beeping sound in your car is an example of negative reinforcement, as it removes an unpleasant stimulus.
Operant Conditioning
A type of learning where the likelihood of a behavior is increased or decreased by the consequences that follow it. It focuses on voluntary behaviors, unlike classical conditioning which deals with automatic responses.
Example:
A student learns to study more often for tests because good grades (a positive consequence) follow their studying, demonstrating operant conditioning.
Partial (Intermittent) Reinforcement
A schedule of reinforcement where the desired behavior is reinforced only some of the time. This leads to slower learning but greater resistance to extinction compared to continuous reinforcement.
Example:
A fisherman who doesn't catch a fish every time they cast their line is experiencing partial (intermittent) reinforcement.
Positive Punishment
Adding an aversive stimulus after a behavior to decrease the likelihood of that behavior recurring. It involves presenting something unpleasant.
Example:
Receiving a parking ticket for parking illegally is an example of positive punishment, as an undesirable consequence (the ticket) is added.
Positive Reinforcement
Adding a desirable stimulus after a behavior to increase the likelihood of that behavior recurring. It involves presenting something pleasant.
Example:
When a dog sits on command and immediately receives a treat, the treat is an example of positive reinforcement.
Punishment
Any consequence that weakens a behavior, making it less likely to occur again in the future. Its goal is to decrease the frequency of an undesirable behavior.
Example:
A parent scolding a child for hitting their sibling is a form of punishment, aiming to reduce the hitting behavior.
Reinforcement
Any consequence that strengthens a behavior, making it more likely to occur again in the future. It is a core principle in operant conditioning.
Example:
Giving a child praise for sharing their toys acts as reinforcement, increasing the chances they will share again.
Shaping
An operant conditioning procedure in which reinforcers guide behavior toward closer and closer approximations of the desired behavior. It's used to teach complex behaviors by rewarding small steps.
Example:
Training a cat to use a toilet by first rewarding it for sitting on the toilet seat, then for eliminating in the toilet, is an example of shaping.
Variable-Interval (VI)
A partial reinforcement schedule that reinforces the first response after an unpredictable amount of time has passed. This schedule produces slow, steady rates of responding.
Example:
Checking your email periodically for a reply, not knowing exactly when one will arrive, is an example of a variable-interval (VI) schedule.
Variable-Ratio (VR)
A partial reinforcement schedule that reinforces a response after an unpredictable number of responses. This schedule produces high rates of responding and is highly resistant to extinction.
Example:
Playing a slot machine, where the payout comes after an unpredictable number of plays, is a classic example of a variable-ratio (VR) schedule.