Computing and recommending company-wide employee training pair decisions at scale... posit conf 2024

Transcript#

This transcript was generated automatically and may contain errors.

Hey everyone, um, yes, I do have a four line mouthful title, but I have a shorter title as well. So today I will be talking to you about a collaboration that I did with my colleague Jordan here and a couple other people at Regeneron.

So I'll be talking about a collaboration that I did, um, at Regeneron in order to compute and recommend community-wide employee training pair decisions at scale via an AI matching and an administrative workflow platform that we developed completely in-house at Regeneron. But as you heard, the alternative title is YouCanDrStrange2 and I will get to why that's the alternative title in a couple of slides.

But first here's an overview of what we're going to be talking about today. So we had a worthwhile challenge, um, and we'll be talking about exactly what that was, but then we addressed it with a scalable solution using a flexible framework for impactful data science.

The challenge: growing mentoring cohorts

So have you ever wondered how impactful companies stay ahead of the game? Well, at Regeneron, upholding and ongoing responsibilities to help improve the lives of others is one of our top priorities. And so we do this by continuously focusing on evolving our people, our process, and our technologies. And so there's a bunch of different ways of investing in employees. But one of the ways that you can do it is by providing growth opportunities. Another is by supporting internal groups that facilitate networking. Another is creating DE&I, so diversity, equity, and inclusion initiatives. And another is enabling the work-life balance, um, that is ever elusive for a lot of us in our careers.

But one of the things that I really want to point out is the career mentoring and coaching or institutional knowledge transfer on a pairwise basis. There are a lot of people who are further advanced in their careers that can help out people who are not as advanced yet, but are hoping to be able to be at that level eventually.

So in terms of the mentoring, we actually had a worthwhile challenge. So our mentoring cohorts have grown substantially every year. So if you think about it in terms of, for example, in the year 2022, if you think about all the mentor and mentee applications that we had, if you multiply those numbers together, it's almost 4,000 choices, possibilities of pairs that could occur. And in 2023, that number grew to 35,000 and this year grew to about 84,000.

So in 2022, the process was done manually, and it's a really grueling, excruciating process to look at all of the different possible pairs and be a matchmaker. So that took half a month for those 3,864 choices. But at that rate of 2022, it would have taken four and a half months last year and almost 11 months this year. And that's probably best case scenario because we're all flesh and blood here. So I mean, it's likely that there would have been a slowdown exponentially increasing the amount of time that it actually would have taken.

But it's too long, right? So trying to do this process across all of the applications and the possible pairings would just be impossible. And the random pairings of just saying, oh, let's stick this person with that person, that would have resulted in a lot of disappointment.

So what we ended up with was a problem. There were too many choices and not enough resources. If you try to do the pairing right, you would be irritated because it would take too long. If you try to just stick people together, ultimately as the matchmaker, you'd be embarrassed because everyone in the company would be angry at you because they're like, why did you do this to me?

So we needed a way to easily consider more people's requests and scalably. So where 20 times the number of possibilities did not equal 20 times the amount of time. So how could we, and here's the Dr. Strange part, and it wrote itself, quickly review the 84,000 possibilities to find everyone's best possible match in far less than a year. How could we, Dr. Strange?

Building MAGNETRON AI

So what did we do? We built a scalable solution. We supercharged Regeneron's resources with AI, making it such that the testable options were far under the amount of resource that we had. And so what we did was we built Magnetron AI at Regeneron, and we've submitted a patent on this. So Magnetron is an acronym that stands for Mentoring and Growing Next Generation Talent at Regeneron Online, Magnetron.

So what this resulted in was the flattening of the amount of time that it took to increase the number of applications and still yield pairs that everyone was happy with. So we dramatically reduced it 10x, and it enabled for cohort sizes to increase without substantially increasing the matching time. It's already been used for four additional cohorts other than the two shown here, and there's one actually happening right now. So it could be used over and over and over again.

So we dramatically reduced it 10x, and it enabled for cohort sizes to increase without substantially increasing the matching time.

And the best part is that it's domain agnostic. So it's usable for the multiverse of possibilities that you could possibly have for matching people for institutional knowledge transfer.

And yeah, that's really the summary. We had a worthwhile challenge. We hit it with an AI approach. And starting at the end and working backwards is what this team that I was able to work together with over the past two years used to build Magnetron AI. Thanks.

Q&A

All right, so I think we have time for one question. And it has to do with you covered a lot. Where do you think people should learn more, look to learn more about this type of work? About the matching and the tool? I'm assuming the tools that you mentioned on the tool process.

Yeah, so you mean the earlier ones or the, well, so you could look at the YouTube, youtube.com slash at sign data driven decision making to learn more about the earlier stage. There's a paper from my PhD where I talked about transitive prioritization, the guilt by association rank. So that's a potentially good place to start.

All right. So we'll say YouTube is the place to go. All right. So that's the end of this session. Thank you, everyone. Let's give Regis another round of applause. Thanks.

Computing and recommending company-wide employee training pair decisions at scale... posit conf 2024

Transcript#

The challenge: growing mentoring cohorts

Building MAGNETRON AI

A flexible data science framework

Applying the framework at Regeneron

How the decision recommendation engine works

Democratizing the tool with R and Python

Q&A

Featured software#

dbplyr

DT

plumber

reticulate

Shiny