Posit Academy Overview

To me, that seems like a training failure. And I'm not going to speak for everyone, but I almost feel like it's a dirty secret of the training industry that you don't get terrific outcomes as a trainer.

And I'm not going to speak for everyone, but I almost feel like it's a dirty secret of the training industry that you don't get terrific outcomes as a trainer. I've heard people talk about marketing and say, you know, we know half of what we do works. We just don't know which half. Well, it seems like that might be the case for training as well. But we're dealing with a field where people aren't trained to be trainers. So I think there's lots of upside to do much, much better. And that's what we set out to do at RStudio, which is now Posit.

And the way we fix this is by looking into why this happens. So some of you may have seen this graph before. It's called the forgetting curves or the ebbing house curves based on the gentleman who first postulated it. This summarizes a body of research that goes all the way back to the 1800s, has been verified in many different fields. It's just a fact of human life.

The way to look at the graph is these blue lines are someone's ability to do something, be that to do a task, to remember vocabulary, to take a test. It's just their ability to do that thing that they learned. And the first blue line shows how that ability changes over time.

What happens consistently is your ability to do something drops off after you first learn it. And especially if you never use it again, your brain basically cannibalizes the resources that devoted that ability and uses them to do other more important things. And eventually you can't do that thing anymore.

But you don't have to settle for that outcome. If you wait until that thing starts to get difficult for you, and then you practice it and do it again, you're essentially relearning it through your practice. And you do it again, and you jump up to the next blue line. So your ability started to fall off, but you practice, and now you can do it again pretty well. And then it starts to fall off again. But it doesn't fall off as fast as it did the first time if you had never practiced it. And if you repeat this process and you practice again, you'll be on the next blue line, and then the next blue line.

Whenever you don't use something, your brain does start to lose its ability to do it. But the more you practice the thing, the more your brain decides to conserve those neural networks and retain that ability. Now if you've practiced something enough times, you've learned to ride the bike and it stays with you forever.

You could sort of see why this doesn't play out in a workshop that you take for half a day or two days or what have you, and then you never really see it again. These practice sessions work because your brain is building new neural circuitry in between. That means you have to go to sleep, nighttime sleep, in between these sessions to really build up a robust neural network.

How the brain builds skills

All right, so we could see in the data that's how that works, but we also know from research and study why it happens. This is your brain. It's a collection of neurons that are joined together, and your abilities and the information you know and the memories you have are all encoded into your brain as networks between these neurons.

The network is set up so when one neuron in the network fires, the other neurons fire as well, and they trigger whatever behavior needs to come together to make this experience that your brain is recreating for you. We can imagine having two networks that do two separate things in our brain. One of these we use, every time we use it, those neurons fire together, and then your brain wires them together a little more strongly, because this is how the brain works. You probably heard neurons that fire together wire together. Every time you use it, that network gets stronger and stronger, whereas the one you don't use gets weaker and weaker.

Your brain has important things to do. It's not going to maintain a network that doesn't do anything useful until eventually over time you'll have a very, very strong network at the thing you were doing, so strong that doing it no longer takes conscious thought. It's completely automated. When one of the neurons in the network fires due to some cue or trigger, all the rest just automatically cascade, and you juggle the ball, or fit the model, or interpret the data. Whereas that other neural network that we never use, it doesn't even exist anymore. It's gone.

This is the biological phenomenon that explains the forgetting curves. Like I said, this wiring together occurs while you're sleeping, so this is something that happens with lots of repetition over a long period of time, weeks, let's say.

If we go back to our model and think about, do we want our colleague to be able to do this new skill, pick a skill where the answer is yes, and then consider, what is going to determine whether or not he or she could do that six months from now? You'll discover it's very little to do with the initial instruction we give the colleague and everything to do with the amount of practice they've had shortly after that instruction.

How do we train to take advantage of this? How do we train to make people who can use R as well as SAS six months from now?

Something else to consider as well, just watching a lecture or watching a video on your computer about doing something probably builds a neural network, but it's a neural network that's related to sitting still and watching a video unfold. It's actually doing the thing that builds a neural network so you'll later use to do the thing. We have to be conscious of that in training too, and unfortunately a lot of training doesn't really focus on the student actually doing the things. It's more, hey, watch this video.

Well, the revelation we had at our studio is that data science is a skill. If you want to learn to do good data science with code or otherwise, you have to practice it and learn it as a skill. A great analogy would be playing the piano or some musical instrument. Those are skills as well.

There are things you need to learn like how to read notation, rhythm, music, but you also need to sit down and practice the piano if you want to play a song on the piano. You wouldn't expect to do this well by watching an online course or videos online. You wouldn't expect to learn to play the piano well either by attending a short workshop that lasts for a day or two days. I mean, it might help, it might get you enthusiastic to do the practice you need afterwards, but at the end of the day, if you want to play the piano, you do need to practice over time. And usually with some sort of feedback from an expert who's mentoring you as you play the piano. And the same is true for data science. We need to use practice over time to learn to do it in a new way.

Well, what we find or what you would find too, if you research the training communities, students don't complete online courses. They don't stay very engaged with them even when they do complete them. And that being someone who's designed many online courses, I can tell you, it's very hard to design an online course that multiple students can come to and be challenged by without a teacher there overseeing the process. You sort of have to design for the lowest common denominator, and that's not enough to help the rest of the class learn.

When we look at really well-supported massive online courses, like the Johns Hopkins or Coursera MOOCs, we see that it's typical for about 10% of students to finish those. I've also had the privilege in my life to look at the data for some online data science training providers because we were partnering with them, and what we saw was something that was very similar to a gym. Lots of people pay for a subscription to go to the gym, but at the end of the day, they never show up and go to the gym. They're just sort of paying.

And the missing element for these online courses, whether it's a MOOC or this gym model, is motivation.

So lots of people think education works this way. There's an educator. They have the information that the student needs. There's a student. We just need to transfer that information over, and then the student's now an expert. But that's not how education works at all. Information is cheap and easy, and you could get it from the library or YouTube or wherever you want it, but if you want to learn to use that information, you have to do something after you acquire it, and this is the hardest part of learning something.

You have to practice it in a loop over and over again, and it's easy to start practicing things the wrong way, and as a student, you might not even know you're doing it, and then you're just building neural networks that encode mistakes. It's not going to work out well. You need to practice and get feedback, maybe some coaching. Practice itself is hard work, and all of that work requires you to be motivated. It's hard for busy adults to stay motivated at one more form of busyness, but this blue loop here is what matters if we want our colleagues to become excellent R users or Python users or data scientists. Most training environments ignore the blue loop. It happens after the training's over.

Most training environments ignore the blue loop. It happens after the training's over. So what we decided to do is focus on that blue loop to make it easy to do the hard work of learning, to do it as efficiently as possible so you know it's paying off, and to give you the coaching that keeps that practice loop where it should be so you're practicing the right things instead of spinning off into mistakes.

So what we decided to do is focus on that blue loop to make it easy to do the hard work of learning, to do it as efficiently as possible so you know it's paying off, and to give you the coaching that keeps that practice loop where it should be so you're practicing the right things instead of spinning off into mistakes.