[Intro. to Psy Course Note] Ch 6
Conditioning and Learning
Some types of learning
- Learning
- Associative learning- 2 types of associative learning: classical conditioning / operant conditioning
- Reinforcement: any event that increases the probability that a response will recur
- Antecedents: events that precede a response. more important in classical conditioning
- Consequences: effects that follow a response. more important in operant conditioning
 
- Cognitive learning
Classical conditioning
- Discovered by Ivan Pavlov, the Russian physiologist who studied digestion by presenting dogs with meat powder and measuring salivation
- Neutral stimulus 中性刺激: stimulus that does not evoke a response
- Conditioned stimulus 條件刺激: stimulus that evokes a response because it has been repeatedly paired with an unconditioned stimulus
- Unconditioned stimulus 無條件刺激: stimulus innately capable of eliciting a response
- Unconditioned response: innate reflex response elicited by an unconditioned stimulus
- Conditioned response: learned response elicited by a conditioned stimulus
- Unlike Pavlov, many psychologists think that classical conditioning involves cognition because it is related to information that might aid survival- Informational view: organisms look for associations among events
- Expectancy: expectation about how events are interconnected
 
- principles of classical conditioning- Acquisition: training period in conditioning when a response is strengthened
- Higher-order conditioning 高層次制約: a conditioned stimulus is used to reinforce further learning; the CS is used as though it were a UCS
 
- Extinction: weakening of a conditioned response through removal of reinforcement
- Spontaneous recovery: reappearance of a learned response following apparent extinction
- Generalization and discrimination- Stimulus generalization 刺激類化
- Stimulus discrimination 刺激區辨
 
- Classical conditioning in humans- Phobia
- Conditioned emotional response (CER) 制約情緒反應
- Vicarious classical conditioning 替代古典制約: learning to respond emotionally to a stimulus by observing another’s emotional reactions
 
- Overcoming phobias- Systematic desensitization: decreasing fear or anxiety by exposing phobic people gradually to feared stimuli while they stay calm and relaxed
 
Operant conditioning (Instrumental Learning)
- Learning is based on the consequences of responding; we associate responses with their consequences
- Law of effect (Edward Thorndike): the probability of a response is altered by the effect it has, so responses that lead to desired effects are repeated; those that lead to undesired effects are not
- Operant reinforcer: any event that follows a response and increases its likelihood of recurring
- Acquiring an operant response- Conditioning chamber (Skinner box): 關動物做實驗
- Animals actively emit behavior; consequences influence how frequently behavior occurs
- Response-contingent reinforcement 反應依仗強化: reinforcement given only after a desired response occurs
 
- timing of reinforcement- Operant reinforcement most effective when given immediately after a correct response
- Response chaining: a linked series of actions that leads to reinforcement
- Superstitious behaviors: behaviors that are repeated because they appear to produce reinforcement (i.e., the timing is right), even though they are not necessary (動物可能發展出奇怪的儀式化反應;它可能輕敲自己的頭,轉圈子,以後腳站立,或是做一系列當餵食機開啟時,他正在做的行為。這些儀式化的行為,我們把它們歸類為迷信行為,因為動物看起來似乎相信它所做的行為使得食物丸出現,因為在這種情境裡獎賞與動物的行為無關,我們將它稱為非連鎖增強作用)
 
- Shaping: 海洋公園訓練海狗那樣
- Operant extinction- When learned responses are NOT reinforced, they gradually fade away
- Negative attention seeking: using misbehavior to gain attention
- 消除不良行為的方法 -> 消弱,讓個體失去增強物,以降低不良行為發生頻率
 
- Reinforcement contingencies 強化依仗性- Reinforcer 強化物
- Reinforcement 強化 (動作本身): increases probability of that behavior
- 正強化: behavior is followed by delivery of appetitve stimulus
- 負強化: behavior is followed by removal of aversive stimulusk
 
- Punishment- Punisher 跟上面一樣概念
- Punishment -> decreases probability of that behavior
- 正懲罰: behavior is followed by delivery of aversive stimulus
- 負懲罰: behavior is followed by removal of appetitive stimulus
- Downside of punishment- Aversive stimulus 嫌惡刺激: 不舒服的 stimulus
- Escape learning: learning to make a response to end an aversive stimulus
- Avoidance learning
 
 
- Stimulus control- Operant stimulus generalization 刺激類化
- Operant stimulus discrimination 刺激區辨
 
- Operant reinforcers- Primary reinforcer 初級強化物: unlearned and natural; satisfies physiological needs (e.g., food, water, sex)- Intracranial stimulation (ICS) 顱內刺激: natural primary reinforcer; involves direct activation of brain’s “pleasure centers”
 
- Secondary reinforcer 次級強化物: learned reinforcer (e.g., money, grades, approval, praise); gains reinforcing properties by associating with a primary reinforcer
- Token reinforcer 代幣強化物: tangible secondary reinforcer
- Social reinforcer 社會強化物: attention and approval provided by other people
 
- Primary reinforcer 初級強化物: unlearned and natural; satisfies physiological needs (e.g., food, water, sex)
- Reinforcement concepts- Schedules of reinforcement
- Continuous reinforcement
- Partial reinforcement- Fixed ratio schedule 定比強化: a set number of correct responses must be made to obtain a reinforcer
- Variable ratio schedule 不定比強化: varied number of correct responses must be made to get a reinforcer
- Fixed interval schedule 定時強化
- Variable interval schedule 不定時強化
 
- Partial reinforcement effect 部分強化效應: responses acquired with partial reinforcement are more resistant to extinction
 
Cognitive learning
- Higher-level learning involving thinking, knowing, understanding, and anticipating
- Latent learning 潛伏學習: occurs without obvious reinforcement and is not demonstrated (or is hidden) until reinforcement is provided
- Cognitive map 認知地圖: internal images or other mental representations of an area (maze, city, etc.) that underlie an ability to choose alternate paths to the same goal
- Feedback
- Learning aids- Programmed instruction: learning format that presents information in small amounts, gives immediate practice, and provides continuous feedback
- Computer-assisted instruction (CAI): learning aided by computer-presented information and exercises
- Serious games: CAI presented in a game format
- Educational simulations: imaginary situations or “microworlds” that simulate real-world problems
 
- Discovery learning- Rote learning 機械式學習
- Base on insight & understanding
 
- Observational learning- Occurs by watching and imitating actions of another person or by noting consequences of a person’s actions
- Model 模範
 
- Self-management skills
- Premack principle- Any high-frequency response can be used to reinforce a low-frequency response (e.g., no TV watching until you finish your homework)
- Self-recording: self-management based on keeping records of response frequencies
 
- How to break bad habits- Alternate behavior: try to get the same reinforcement with a new response
- Break up response chains: scramble the chain of events that leads to an undesired response
- Reduce cues and antecedents: try to avoid, narrow down, or remove stimuli that elicit the bad habit
- Use covert sensitization and reward- Covert sensitization 隱藏式敏感化: the use of aversive imagery to reduce undesired behavior
- Covert reinforcement 隱藏式強化: the use of positive imagery to reinforce desired behavior