The Everyday Trainer Podcast

Learning Theory For Real Life Dogs

Meghan Dougherty

Use Left/Right to seek, Home/End to jump to start or end. Hold shift to jump forward or backward.

0:00 | 38:47

You can love your dog and still feel baffled by their choices. Why do they blow off a recall they “know,” pull like a freight train, or lose their mind at another dog the second the stakes go up? We’re going back to the foundation that makes all of it make sense: learning theory, the scientific framework behind how behavior is built and changed.

I’m Meg, and I walk through the two big pillars we use in dog training: classical conditioning and operant conditioning. We talk about how associations shape emotions, why marker words work when they’re properly conditioned, and how tiny timing errors can blur the picture and create a dog that feels uncertain. If you’ve ever wondered why treats work beautifully at home but fall apart around squirrels, strangers, or other dogs, you’ll finally have language for what’s happening.

Then we get practical with the four quadrants of operant conditioning, including what “positive” and “negative” really mean. We also tackle the balanced training versus purely positive reinforcement debate without the online drama, including where a full toolkit can matter for genetics, prey drive, reactivity, and long histories of accidental reinforcement. Finally, we talk e-collar training at a true working level, the two phases I use for conditioning, and how reading dog body language like a green yellow red traffic light tells you when learning is actually possible.

If you want more of this kind of training science made usable, subscribe, share the episode with a dog friend, and leave a review so more owners can find it. What topic should we unpack next?

Visit us on the website here to see what we've got going on and how you can join our pack of good dogs and owners.


Welcome And Why It Matters

SPEAKER_00

Hello, hello, and welcome back to the Everyday Trainer Podcast. My name is Meg, and I am a dog trainer. Okay, so today's episode is one that I've been wanting to do for a really long time. We are talking about learning theory. And I know that sounds like the most boring thing I could possibly say to open a podcast episode, but I genuinely think this is the most important thing I teach. And once you understand it, everything about how you train your dog, how you think about your dog's behavior, how you talk about your methodology just clicks. Like everything makes sense. I cover this in week one of the virtual shadow program because I believe that you cannot be a confident trainer. You cannot defend your methods, you cannot troubleshoot a problem in real time if you don't understand the science behind why you're doing what and like why it works. And honestly, a lot of dog owners are the same. You don't have to want to be a trainer to benefit from understanding learning theory. So if you have a dog, this episode is going to change how you see them. You guys know the drill, grab your tasty drink and meet me back here. I want to start this episode by saying something that I think trips people up before we even begin getting into learning theory. Um, learning theory is not a training philosophy. It's not balance training versus positive reinforcement. It's not controversial, it's not somebody's opinion. This is a scientific framework that describes how behavior is acquired and changed. That's it. And it applies to every animal, including us, including your dog, including the most stubborn cattle dog on the planet. So once you understand that, the controversy around different training methods goes away because you realize it's not really a debate about values, it's a debate about which tools and quadrants people are willing to use. And we will absolutely get into that today. Okay, let's start from the beginning. So, what is learning theory and why does it matter? Learning theory is the scientific study of how behavior is acquired, maintained, and changed through experience. I have lots of notes this episode, just so you guys know. So there are two main types that we use in dog training: there's classical conditioning and operate conditioning, and we're gonna cover both of them today. So let me start with why this matters for you as a dog owner, because I think this is the place that makes people actually care. Every single thing you do with your dog is training, whether you intend it to be or not. Every time you let your dog jump on you and you give them attention, you have trained something. Every time your dog pulls to whatever tree they want to sniff and you let them get there, you have trained something. Every time you call your dog and they don't come and you just let it go, you've trained something. You are always training. The question is just whether you're doing it intentionally or accidentally. So understanding how learning works doesn't just make you a better dog trainer, dog owner. It makes you understand your dog better. And when you understand your dog, you stop thinking they're stubborn or dominant or spiteful. You're thinking, okay, what behavior have I accidentally reinforced? What am I communicating to my dog in this moment? What does this dog actually understand versus what am I assuming they understand? That shift in thinking changes everything because now instead of being frustrated at your dog, you're more curious about them. And curious, I think, is a much better headspace to train from than frustrated. Okay, let's start with classical conditioning. So, classical conditioning is learning through association. Most people have heard of this, even if they don't know the name. Think about Pavlov. I've mentioned this in previous episodes. He was the scientist who noticed that his dog started salivating, not just when they got the food, but when they heard the bell that came before the food, which is very important. It came before the food. So because the bell was associated or preceded food, it had been paired so many times. The bell alone then triggered the same response that food would. That is classical conditioning. You pair a neutral stimulus with something that already has meaning, and eventually the neutral stimulus takes on that meaning all by itself. We use this constantly in dog training. The most obvious place you see it is in marker words. So when I first teach a dog what yes means, I'm pairing the sound of my voice, yes, with a treat. I follow it up with a reward of some sort over and over until yes alone makes the dog excited. The word then becomes meaningful through association. So that is classical conditioning. It's also what we're doing when we introduce uh like scary things slowly and pair them with good things. So, like your dog is nervous around strangers, we pair strangers with a high value reward. Over time, the dog starts to feel differently about strangers. It's the same principle. Operant conditioning is different. Operant conditioning is learning through consequences. The dog does something, something happens as a result, and that outcome changes the likelihood of the dog doing that thing again. So, this is the foundation of all the training we're gonna talk about today, and it breaks down into four quadrants that I will walk you through one by one. It took me, like to be completely honest, it took me probably far too long to actually understand the quadrants. And I have my little notes in front of me just so that I don't mess anything up, but I am sharing all of this information with you, knowing that I don't necessarily talk to my clients about this in sessions, and I talk to my virtual shadow program students a lot about this. Is that like I don't necessarily use the fancy terminology, but I do think that it's very important for us to understand the science and why we're doing what we're doing. So before I get into the four quadrants, I want to clear something up. In learning theory, positive and negative don't mean good and bad. Positive means you're adding something, negative means you're removing something. And that distinction matters enormously and is what trips most people up when they first learn this. I this, I really struggled with this, I'm not gonna lie. And I also like went to school for this and I still struggle with it a little bit. So just hold that in your head as we go through the four quadrants. Okay, four quadrants. Each one is defined by two things: whether you're adding or removing something, and whether the result increases or decreases a behavior. So let's go through all four. Quadrant one is positive reinforcement or R. You add something, the dog likes, the behavior increases. For example, dog sits, you give them a treat, then the dog sits more. That is positive reinforcement. This is what most people think of when they think of dog training, and it makes sense because it's the most intuitive. You do the thing that I want, good stuff happens, you do it more, clean, clear, simple. Positive reinforcement works incredibly well. We use it constantly. I use food, play, praise, access to things that the dog wants. The important thing to understand, though, is that the dog decides what is reinforcing, not you. If your dog doesn't actually want the treat you're offering, it is not a reinforcer. It's just a piece of food. This is why some dogs seem unresponsive to food-based training, and why some trainers don't like to use food in training. Either the food isn't valuable enough or the environment is competing too hard. So, like the squirrel, the other dog, the smell on the ground, whatever it is, is more interesting than what you're offering. And in that moment, food is not doing its job as a reinforcer. I raise my dogs from puppies working for their meals. So when I say yes and offer food, they come sprinting back to me because food means something. That's a trained response built over time. It doesn't just happen automatically. All right, quadrant two, negative punishment. P minus, if you kind of see it in the diagram, you remove something the dog likes and the behavior decreases. So we most often see this with like dog jumps on you, you turn away and cross your arms and take away your attention. The dog stops jumping because the thing that they wanted, which is your attention, went away. This is actually used really uh like frequently in purely positive training, removing attention, removing access to something the dog wants. It's not harsh, it's just clear communication. You do that thing, the good stuff goes away. Okay, quadrant three positive punishment P plus. You add something the dog doesn't like, the behavior decreases. So dog pulls on leash. I can give the dog a pop on a prong collar, I can tap on the E collar, the dog stops pulling, pressure turns off, all that jazz. This is the quadrant that gets the most controversial. And I understand why, because the word punishment makes people uncomfortable. But remember what positive means here. It means you're adding something, not good or bad, just adding. When we use leash pressure, a prong collar, or any collar at appropriate levels with the right timing, we're using positive punishment. And the key word in that sentence are appropriate and right timing, because a tool applied poorly at the wrong level with bad timing is a problem. The quadrant itself is not the problem. The application is. Quadrant four, negative reinforcement, R-Yove something the dog doesn't like, the behavior increases. So dog is in a sit with leash pressure, right? I can guide the dog back into a sit with leash pressure. Dog sits, pressure releases, the dog learns that staying in the position makes the pressure go away. So they do it more. This is the foundation of the first phase of e-color conditioning, which we're definitely going to get into. And I've talked about this many, many times in other episodes, but it's essentially the sensation goes on, the dog figures out how to turn it off by doing what you asked, the behavior increases because it feels better than not doing it. Not because they're scared, it's because they've solved the puzzle. Okay, we're gonna bring it all together. So a purely positive reinforcement trainer works almost exclusively in R and P minus. They add good things and remove good things, they avoid adding anything uncomfortable. A balanced trainer uses all four quadrants based on what the dog in front of them needs in that moment. That is not a physical sorry, I cannot talk. That is not a philosophical position. That is just a complete toolkit. Think about it this way: if you only own a hammer, every problem looks like a nail. That works great for nails, not so great for screws or other tools. A full toolkit means you can actually help more dogs, including the dogs that have spent two years learning that ignoring you is a viable option, including the dog that is genetically driven, that food simply cannot compete in that moment. Dogs that purely positive methods genuinely couldn't help. And look, I want to say this clearly: I am not anti-positive reinforcement. We use positive reinforcement constantly. It is a huge part of how I train. I use a ton of food in our training. You can talk to my clients, I'm like, grab a handful of food, and they're like, is this enough? And I'm like, no, more. But it is just one of four quadrants, and pretending the other three don't exist doesn't really serve the dogs that we're working with. So let's get into classical conditioning and marker words. We'll start with marker words because this is where classical conditioning shows up most in our day-to-day training. And it's the thing I am most picky with in my students. A marker word is a precise sound that tells the dog exactly which behavior earned the reward. It works because of classical conditioning. So you pair the word with something the dog loves over and over until the word itself has meaning. The reason we use a word instead of just delivering food is because of the timing. By the time you pull a treat out of your pocket, the moment has already passed. The dog doesn't know what behavior you're you're rewarding, but the word can be delivered in the exact second the right behavior happens. And that precision is everything in dog training. So here's my system, four words, that's it. We keep it simple. Yes is my terminal marker. It means that was it. Come, you're done with the behavior. Come to me, get your reward. So when I say yes, the dog releases out of whatever position they're in and comes to get their food and they shoot out of the command. Good is my continuation marker. It means that's right, keep doing what you're doing. Your reward is coming to you in position. Good is an implied stay. When I say good, the dog holds what they're doing because the reward is coming to them where they are. No means don't do that. A correction is coming. And uh-uh is that's not what I asked. Try again, but there's no correction. Four words, very simple communication. Now, here is the mistake I see most every single student make when they first submit their videos in the virtual shadow program. And it's the same mistake I see dog owners make everywhere. They say yes and rewarding position, or they say good and let the dog wander off. And it seems small, but it's not small because now the dog doesn't know whether yes means stay or come. They don't know whether good means to hold or release. You've blurred the picture, and a very blurry picture creates a dog that's uncertain. And an uncertain dog is slower, they're less confident and harder to train. And I see it in my students' videos all the time when we're inconsistent with what we expect with each word. The dog is kind of confused and like doesn't really know what they need to do. So precision in your marker words is not about being like rigid or robotic. It's just about being clear. And clarity creates a dog that actually wants to work with you because they understand how to get rewarded, how to get the good stuff. I learned my marker word foundations from trainers who work I really respect. And over the years, I've refined the system based on what I see actually working with the dogs and owners I work with. This like four-word system, I did not invent, um, but it's what I've landed on because it's simple enough for any owner to use, clear, and it's super clear that the dog can always understand what's being communicated. One more thing on marker words. The conditioning matters. You can't just say yes and throw food once and expect the dog to have a strong marker. You have to build the association over time. Like I'm talking hundreds of repetitions. And the stronger the marker, the more useful it becomes. And a really strong yes can eventually be used to disrupt reactivity, redirect a dog away from a trigger, and build speed and drive in your training. But that only works if the marker is conditioned really well. So put in the time early. All right, let's get into balanced versus R and what the actual difference is. I want to talk about this without the drama because honestly, the drama is mostly online and it's not what I see in real training sessions. I have been training dogs for over seven years. I have trained thousands of dogs and owners. I have clients who come to me from force-free trainers who couldn't get results, and I have clients who come to me from trainers who were too heavy-handed. I have genuinely seen it all. And here is what I actually think. Purely positive reinforcement training is effective. I'm not here to tell you that it doesn't work for many dogs and many behaviors. It works incredibly well. If you have a food-motivated, easy-going dog who has been raised well from puppyhood and you're working in low distraction environments, positive reinforcement only can get you very far. So the question isn't whether it works. The question is whether it covers every dog and every situation you are going to encounter as a trainer or as an owner. And in my honest experience, for a lot of dogs and a lot of real world situations, it doesn't. Here's where a balanced approach fills in those gaps. The dog knows the behavior, but chooses not to do it when something more interesting is happening. So, like you call your dog, they look at you, they look at the squirrel, they choose the squirrel. They know the recall. They just decided the squirrel was a better deal. And a treat is not going to compete with that in that moment. So then what do you do? The dog who has been accidentally reinforced for two years for pulling on leash, every single walk, they pull and they get to go where they want to go. That's two years of positive reinforcement for pulling. And now you're trying to undo it with treats. And it can be done, but it's slow. And in the meantime, you're getting dragged down the street every morning. The dog with serious prey drive or reactivity who genuinely cannot hear you when they're over threshold and they're popping off at another dog, their nervous system is flooded. Food is not on their radar. So you need a way to communicate that can get through to them when they are in that state of mind. And these are all real situations that every dog owner encounters constantly. And a full toolkit gives you ways to address them that don't exist in a purely positive approach. Now, here's the thing I want to say about genetics because I don't think it gets talked about enough. So much of a dog's behavior comes down to genetics, like so much more than people want to admit. The dog that blows through a recall because the prey drive is overwhelming, that dog's genetics are pushing really hard in a particular direction. That is not a training failure. That's just what that dog is. I have had bored and trained dogs where I've done everything right and there are still certain things that the dog is always going to want to do because of what they were bred for. And our job as trainers and as owners is to recognize what our dog actually is, not what we wish they were, not what the breeder told us they'd be, what they actually are. So get real with yourself about your dog and then use all the tools available to you to help them be the best version of themselves in the life you guys have built together. Let's talk about the e-collar. So I want to spend a few minutes specifically on e-collar because I know it's the tool with the most controversy around it. And I think the controversy comes almost entirely from misinformation and not from people who have actually used one correctly. So let me explain what it is. An e-collar is an electronic collar. That's where the e comes from. It delivers a low-level electrical stimulation to the dog's neck. And when I say low level, I genuinely mean it. Obviously, not all e-collars are created equal. And obviously, we can use it at high levels, but at the working level, which is the lowest level the dog actually feels the sensation. It is just a tap. It is the same sensation as a TENS unit, which is what physical therapists use on humans for muscle recovery. It's like a vibration. We're not actually shocking the dog. So here is what I do every single time. I have a client who's skeptical and I have this conversation on basically every intake call. I put a collar at working level and I let them feel it on their own wrist every time. And every single time they go, That's it? That's what you're using. Yeah, that's it. So the idea that e-collars are torture devices comes from people who have never actually felt one or from stories about people using them incorrectly at very high levels, which is a real problem, but it's a problem of misuse, not a problem with the tool. And like I said, we can definitely use the e-collar as a correction. And, you know, I had a conversation with my client about this today. We're kind of in the stage of reactivity training where I'm like, let your dog mess up, let your dog go out and commit to being reactive, and we can correct that. And it's going to be really uncomfortable. But guess what? They're not going to do it again because that is a true punishment, right? So the difference between a punishment and just like a disruptor is a punishment stops the behavior from repeating again. Whereas like a disruptor just disrupts it in that moment. But if the dog were to be in the same situation, they might pop off at the dog again. Okay. Anyways, so here's how I use the e-collar in two phases. Phase one is escape. This is where I teach the dog what the sensation is and how to turn it off. The collar goes on at a working level just before or at the same time as my cue. So the cue tells them how to turn the sensation off. So I'm walking, the dog gets out in front of me, I tap on the collar, I change direction, dog follows me, sensation stops. Over time, the dog learns when I feel that tap, I pay attention and I move with her. That turns it off. That is literally the whole game. And we can pair this with whatever we want. So I can say place, tap, tap, tap, tap, dog goes on to place, tapping stops. I can say sit, tap, tap, tap, tap, dog goes into a sit, tapping stops. I get out of that phase as fast as possible. The second the dog shows me they understand, meaning they start moving with urgency before I've finished the cue. I am moving on. Because here's the thing about staying in phase one for too long: the collar sensation becomes paired with every obedience command. And then the dog starts to avoid the command because it's always associated with the collar going on. And I see this all the time with dogs who have been trained by somebody else before they come to me. The dog hears place and shuts down because their whole experience of place is the collar turning on. That means somebody spent way too long in phase one. So phase two is avoidance. This is what I send dogs home in. Tell the dogs what you want, give them a moment. If they don't follow through, the collar goes on and stays on until they do. Tap, tap, tap until they move into position. Once they're there, it stops. They learn that doing the thing makes it stop. Doing the thing becomes the better option. For pet owners, I keep it really simple. Tell your dog what you want, give them a chance. If they don't take it, that's when the collar comes on. We are not layering the collar over commands constantly. We're using it as accountability after the dog has already been given the opportunity to choose. So that's the difference is I like the dog is given the opportunity to avoid the STEM entirely if they follow through with what I've asked of them. And one more thing: your working level is going to change based on the environment, based on the dog's arousal state, based on what's happening around them. A dog that works at 12 in your backyard might need a 40 around the dog park. That's just physiology. Their arousal level is higher, their threshold for feeling the sensation is higher. So you adjust. You always adjust to the dog in front of you. Don't get hung up on the levels, just go off of how the dog is responding. Okay, last section before we wrap up. And this one is very important because all of this theory only works if you are actually paying attention to the dog in front of you. Dogs are communicating constantly. Their body language is a full language, and most people miss most of it. Not because they're bad owners, just because they haven't learned to see it yet. Once you start seeing it, you can't unsee it and it changes every single training session. I think about dog body language like a traffic light: green, yellow, red. And your job in any training session is to know what color your dog is in. Green zone is where you want to be. The dog is relaxed and ready to learn, loose body, soft eyes, relaxed mouth, maybe a little wiggly. This dog is open. This dog is available for training. This is when you make the most progress in your training sessions. Yellow zone is stress. This is like the lip licking when they're not thirsty, yawning, looking away, sniffing the ground, the whale eyes, where you can see like the whites of their eyes, a tucked tail, ears back. These are like the dog is telling you they're uncomfortable. They're telling you to slow down, back up, and kind of give them more space. So in the yellow zone, you can still train, but you need to adjust and lower your criteria. Increase your reward rate. When I see the dog starting to get, I tell my owners like they'll get a little flat. We'll do some like treat chases where I do like get it, right? Or for my dogs, if they're getting a little flat in training, I get them barking at me. Um, this could be adding distance to whatever is stressing them. Just don't push them too far to the point where the dog is like completely shutting down. So the red zone is completely shut down. They're stiff, their eyes are just like hard, high, stiff tail, hackles up. Um, they're fixating on something, they're not able to look away. This dog is not available for learning, and the training stops. You're not gonna get the response that you want. And if you try to force it, you're probably going to make things worse. So, your job when the dog is in this like red zone is to help this dog get back to green. So, distance, decompression time, food traces chases, like whatever you need to do to bring that dog back to you. And here's the practical application of this. Before you ever start a training session, check out your dog. Watch them. What zone is that dog in right now? What happened before you got here? Is like if you're a dog trainer, is the owner stressed? Is there a trigger nearby? Is the dog already amped up from something else? Were they barking in the car on the way over to the session? You plan for the session. It should be based on what the dog is actually giving you in that moment, not the plan that you had in your head on the way over to the session. And that's one thing that I think a lot of my owners really struggle with is there's you can have a plan, but we can't always follow the plan to the T because we have to be able to look at the dog in front of us. And I had a uh follow-up session with a board and train today, and that's kind of what I was talking about. And we did three lessons. The first one, I covered punishment, basically. I was like, all right, this is when we would punish certain behaviors. We're keeping it super simple. Day two, we went over an active training session. What does that look like? How can I get the dog excited to train even when there's other dogs around, even when there's triggers around? I can, you know, open the window of training, say, are you ready? Do our treat chases, throw in some obedience, right? And the dog doesn't care about what is going on. And then our final day, we went over how to redirect her dog, right? So we covered like, hey, if you're able to redirect your dog and let's say, you know, yes, and reward them before they have a chance to go be reactive, do that. You know, if you've missed that and the dog is going and like fully lunging, has committed, then hey, I want you to punish that behavior, but we're not gonna linger there. I want you to get the dog back on you, get them hyped up, bring them back into that green zone. But you have to be able to read the dog in front of you and constantly either like you know, get them back up if they're a little bit shut down or like stressed out. How can I get that dog up? Tree chases, like I told her, like, push your dog a little bit. He likes that. Like you push him, hype him up, and wow, good job. He's like, oh my gosh, okay, okay. So being able to have our tools available to us, either redirecting or rewarding or punishing, and reading the dog in front of us to decide what we use in that moment. So I will adjust a whole session based on what I see in the first two minutes, because a dog in the yellow zone needs something completely different from a dog in a green zone. And running your planned session on a yellow zone dog without adjusting is one of the most common mistakes I see new trainers make. The trainers who get consistent results read the dog in front of them. They're not just following a perfect protocol, they're in a conversation with the real living animal in front of them. Okay, let's tie all of this together. Let's bring it all home. Learning theory is the science of how behavior is acquired and changed. It is not a philosophy, it is not up for debate. Classical conditioning is how we build associations and it's the foundation of our marker words. Operant conditioning happens in four quadrants, and a complete trainer uses all four based on what the dog actually needs. Positive reinforcement is powerful, it is one of the best tools we have, but it is one of four. And pretending the others don't exist doesn't help our dogs. The e-collar is not a torture device. At working level, it is a tap, it is a sensation used correctly in two phases. It gives us one of the clearest forms of communication we can have with a dog. And body language is your real-time feedback system. Learn to read it, it tells you everything about whether your training is landing or not. So this is week one of the virtual shadow program because I believe if you understand the why behind the work, everything else makes sense. You stop second-guessing your tools, you stop apologizing for your methods that work, you just show up and train the dog. So if this is the kind of content you want more of and you're thinking about becoming a dog trainer yourself, the virtual shout-out program is where we go 10 times deeper on all of this. Four weeks, daily videos, live sessions. There's a community of people who are in the exact same boat as you. And you guys can visit my website if this is something that you're interested in. And if you're here just as a dog owner who wanted to understand your dog a little better, I hope this episode was useful. I know it was a little short, but it was pretty dense, so I didn't want to drag on for too long. And I hope it made you a little more curious and a little less frustrated next time your dog does something that makes no sense. So, as always, thank you guys so so much for being here. And I'll see you next week.