zuai-logo

Glossary

C

Continuous Reinforcement

Criticality: 2

A schedule of reinforcement where the desired behavior is reinforced every single time it occurs. This leads to rapid learning but also rapid extinction if reinforcement stops.

Example:

A vending machine that always dispenses a drink when money is inserted operates on a continuous reinforcement schedule.

E

Extinction

Criticality: 2

The gradual weakening and eventual disappearance of a conditioned response when reinforcement is no longer provided. In operant conditioning, this occurs when a learned behavior is no longer rewarded.

Example:

If a child stops getting candy for throwing a tantrum, they will eventually stop throwing tantrums, illustrating extinction of the behavior.

F

Fixed-Interval (FI)

Criticality: 3

A partial reinforcement schedule that reinforces the first response after a fixed amount of time has passed. Response rates tend to increase as the time for reinforcement approaches.

Example:

Students studying intensely right before a weekly quiz, knowing the quiz will occur at a set time, are responding to a fixed-interval (FI) schedule.

Fixed-Ratio (FR)

Criticality: 3

A partial reinforcement schedule that reinforces a response only after a specified number of responses. The number of responses required for reinforcement is constant.

Example:

A coffee shop offering a free drink after every 10 purchases uses a fixed-ratio (FR) schedule.

N

Negative Punishment

Criticality: 3

Removing a desirable stimulus after a behavior to decrease the likelihood of that behavior recurring. It involves taking away something pleasant.

Example:

When a teenager loses their phone privileges for breaking curfew, it's an example of negative punishment, as a desirable item is removed.

Negative Reinforcement

Criticality: 3

Removing an aversive stimulus after a behavior to increase the likelihood of that behavior recurring. It involves taking away something unpleasant.

Example:

Putting on your seatbelt to stop the annoying beeping sound in your car is an example of negative reinforcement, as it removes an unpleasant stimulus.

O

Operant Conditioning

Criticality: 3

A type of learning where the likelihood of a behavior is increased or decreased by the consequences that follow it. It focuses on voluntary behaviors, unlike classical conditioning which deals with automatic responses.

Example:

A student learns to study more often for tests because good grades (a positive consequence) follow their studying, demonstrating operant conditioning.

P

Partial (Intermittent) Reinforcement

Criticality: 3

A schedule of reinforcement where the desired behavior is reinforced only some of the time. This leads to slower learning but greater resistance to extinction compared to continuous reinforcement.

Example:

A fisherman who doesn't catch a fish every time they cast their line is experiencing partial (intermittent) reinforcement.

Positive Punishment

Criticality: 3

Adding an aversive stimulus after a behavior to decrease the likelihood of that behavior recurring. It involves presenting something unpleasant.

Example:

Receiving a parking ticket for parking illegally is an example of positive punishment, as an undesirable consequence (the ticket) is added.

Positive Reinforcement

Criticality: 3

Adding a desirable stimulus after a behavior to increase the likelihood of that behavior recurring. It involves presenting something pleasant.

Example:

When a dog sits on command and immediately receives a treat, the treat is an example of positive reinforcement.

Punishment

Criticality: 3

Any consequence that weakens a behavior, making it less likely to occur again in the future. Its goal is to decrease the frequency of an undesirable behavior.

Example:

A parent scolding a child for hitting their sibling is a form of punishment, aiming to reduce the hitting behavior.

R

Reinforcement

Criticality: 3

Any consequence that strengthens a behavior, making it more likely to occur again in the future. It is a core principle in operant conditioning.

Example:

Giving a child praise for sharing their toys acts as reinforcement, increasing the chances they will share again.

S

Shaping

Criticality: 2

An operant conditioning procedure in which reinforcers guide behavior toward closer and closer approximations of the desired behavior. It's used to teach complex behaviors by rewarding small steps.

Example:

Training a cat to use a toilet by first rewarding it for sitting on the toilet seat, then for eliminating in the toilet, is an example of shaping.

V

Variable-Interval (VI)

Criticality: 3

A partial reinforcement schedule that reinforces the first response after an unpredictable amount of time has passed. This schedule produces slow, steady rates of responding.

Example:

Checking your email periodically for a reply, not knowing exactly when one will arrive, is an example of a variable-interval (VI) schedule.

Variable-Ratio (VR)

Criticality: 3

A partial reinforcement schedule that reinforces a response after an unpredictable number of responses. This schedule produces high rates of responding and is highly resistant to extinction.

Example:

Playing a slot machine, where the payout comes after an unpredictable number of plays, is a classic example of a variable-ratio (VR) schedule.