Jamie Warner @ Plymouth Rock Assurance | Data Science Hangout

Transcript#

This transcript was generated automatically and may contain errors.

Welcome back to the Data Science Hangout. I'm Rachel, I lead Customer Marketing at Posit. So excited to have all of you joining us today. If this is your first time joining us, the Hangout is our open space to hear what's going on in the world of data across different industries, chat about data science leadership, and connect with others facing similar things as you.

So we get together every Thursday at the same time, same place. But I did, I always like to say this, if it is your first time joining us, it really is so nice to meet you. And we'd love to have you say hi in the chat so that we can all welcome you in here as well for anybody joining the first time. We're all dedicated to keeping this a friendly and welcoming space for everybody. And we love hearing from you no matter your years of experience, titles, industry or languages that you work in as well.

It's also totally okay to just listen in here if you want. And you can be a part of the party that happens in the Zoom chat. You'll see there's lots of resources and comments that will get shared there. But there's also three ways that you can jump in and ask questions or provide your own perspective on certain topics. So you can raise your hand on Zoom, and I'll call on you to jump in. You can put questions in the Zoom chat. And then feel free to put a little star next to it if it's something that you want me to read out loud instead. And then lastly, we do have a Slido link, which Hannah or Tyler will share in the chat in just a second here, where you can ask questions anonymously too.

So thank you to Tyler and Hannah who are helping as my co-hosts in the background and will be helping me collect some of the questions. One other note, if you're watching this as a recording on YouTube at some point in the future and want to join us live, the link to add it to your calendar is going to be in the details below on YouTube. And as always, no rule that anybody has to stay on the whole time or talk. Come and go as it fits your own schedule.

But with all that, I am so excited to be joined by my co-host today, Jamie Warner, Managing Director at Plymouth Rock Assurance. Jamie and I are both usually in the Boston area, but we're both traveling. Jamie, I'd love to have you kick us off by just introducing yourself, sharing a little bit about your role, and also something you like to do outside of work.

Sure. So thank you for that intro, Rachel. So as she mentioned, my name's Jamie Warner. I work currently at Plymouth Rock Assurance, which if you're not in the New England area, you might not have heard of us, but we are an insurer that covers home and auto. And I've been in the insurance space for a while after being in the tech space initially. And I really love it because there's really amazing data science to be done in highly regulated industries. We have a lot of problems to solve. We have a lot of data and we have a lot of excitement around how to get to them. And Plymouth embodies that. We're a great test and learn environment.

I'm very lucky here to get to do a couple of things. One of the smaller things that we're doing is our cloud migration. But in addition to that, I also get to own the translate ability and the implementation of our modeling. So how we take those really large scale, unconstrained best data science models and make them something that regulators will approve and will work for us in terms of our pricing, our underwriting, things like that. Something that I do out of work, I do really like to scuba dive. So that's actually why I'm down here. We're taking a three-day vacation with my partner so he can scuba dive for his birthday.

Data science in heavily regulated industries

Awesome. Well, have a great trip. I'm glad you're able to join us today too. So I get to kick off some of the questions. But I know something we talked about when we were sharing your hangout is about revolutionizing the way heavily regulated industries understand and adopt data science. And I'm wondering if you could just share a little bit about what that looks like for you today.

Sure. So I think something that sometimes data scientists stay away from is some of these places that, I don't know, maybe viewed as a little less cool. I think insurance is cool, but I know maybe it's not the hottest topic that everyone on the phone was excited to talk about today. But something that we forget is that insurers have been collecting data for hundreds of years. And actually, if you think about actuaries, if you're not familiar with them, they do a lot of the background work in insurance and the historic data work are kind of like the original kind of statistician data scientists. They used to do that work by hand, even folks that I work with today. And the industry has really evolved and we have tons and tons of data.

But a lot of times we can't do something like a tech company where we just throw out the old data and say, we're just going to use all the new data coming in because we need that history. Because claims and accidents and life, depending on what area of insurance you're in, you really do need what we call a long tail or like a lot of historic data to predict what's going to happen in the future. And so we have to figure out really creative ways to get that data, to use it, to put it in a format that's usable, and then figure out how we use that to best price or distinguish between different types of folks. And the other piece is that we do have really strict regulations around the way that we do that. So the type of data we can use. And so we have to be really creative in how we pick our data, we pick our techniques, and then how we make the story, which is kind of the most critical piece. I'm sure everyone's heard that tagline, it doesn't matter how good your model is if you can't tell the story of why. And that piece is really critical here. So it's kind of a fun space to be doing that in. And it's a space where there's a lot of opportunity, because historically, it hasn't been done as much.

I'm sure everyone's heard that tagline, it doesn't matter how good your model is if you can't tell the story of why. And that piece is really critical here.

Sure, so first kind of in general hiring, the number one thing I look for is curiosity. I think that you can learn, if you already know a programming language, you can learn any other language if you're curious, but I cannot necessarily teach you to be the kind of person that sees a weird pattern in the data and wants to know more.

When I think about the data engineer role that we're hiring, I'm really excited about it because we have historically never had any data engineer roles, and this is a really common thing at lots of companies because they hire a bunch of data scientists and they're like, aren't you data scientists? Why can't you ingest and prep the data as well as building the models? I think I like to think of data scientists as the people that could play a data engineer on TV. They know enough to be dangerous and also to make really badly unoptimized pipelines. It's a totally different skill set, especially as we move into some of these cloud technologies. There's so much really cool, interesting optimization to be done, and you need to understand what to ask for to do that, especially with your IT partners. You need to understand what the capabilities are because you won't always have, you know, in Excel or in SAS, you would have almost total control of everything that was happening. In an environment like AWS, you really have to rely on your infrastructure people, and they don't necessarily know what's best for things like a data pipeline. So that is really what that role is about, is rethinking the way we ingest our data and process it, especially so that we can do it in a way that's automated. There's so many good new tools now to look at things like flagging you're missing, like identifying imputation opportunities, and then actually doing it and keeping a great record of it in the system.

Sure, thanks. I just, yeah, following up on the hiring questions, I was just wondering what are the things that you do to onboard your new data scientists? Like, I work in insurance as well, and I know when people come in, they're sort of a little shocked at the amount of data that we've got in all of the different topics that everything encompasses, compared to a lot of different industries. What does that look like for you?

Absolutely. It's a great question. I think the number one thing, I go back to having really good historic documentation and holding my team to a certain standard makes onboarding very easy, because we have the documentation. Now, for example, I moved to Plymouth about six months ago. I don't have that legacy of me being too aggressive on documentation historically. So, a couple of things. Going back to one of the earlier questions, I really think business knowledge is important. So, one of the first things I have data science folks do is shadow a bunch of the key business users that create the data that they're using. So, for example, if they're working in an agency space, I have them go watch the agents and put whatever they're inputting. Same thing with a claims adjuster or an underwriter. I'll have them do a shadow in the first couple weeks, understand that person's role, understand the intricacies of the data. So then when they get in the data, they recognize, oh, here's what I'm looking at, what I'm trying to see.

I'll always get them a buddy on the product side or someone somewhere else in the business. Those people usually love to hear more about data science, and they get the added benefit of this person learning more, someone they can go to really quickly for business side questions. And then just documentation, documentation, documentation. So, I usually like to give them a tiny project that uses the data that we like to use, but in a way where they'll kind of find some issues and errors that I can know about already and expect to see how they treat them just to get a feel for them too, right? When I hire them, all I know about you is what you've told me. I don't actually know like your quality, what your gaps are, what you're good at. So, giving a little project where I kind of know where the outcome should fall can really help me identify, okay, here's where you're good, here's the growth areas I need from you.

I really love the point about shadowing, and I see Libby had commented about literally begging to shadow some users, but being told no. Did you have to go through anything internally to set that up, or how does that work?

Yeah, it's definitely a, I know when I first started in insurance, the first time I asked to shadow an underwriter, my manager looked at me like I was insane. They were like, no, we don't let the data people near the underwriters. And I think that that's really a conversation you should be, the data science leaders that you have should be having with their peers. Like, I think a lot of times the reason that we don't get adoption is because people don't think that we understand the work that they do. And showing that curiosity, I have almost never had a peer across the business turn me down when I say, I really want my people to understand this better because I want them to be able to make the best outcome for you, and I feel like we don't understand what you do effectively. They're always going to agree that we don't understand what they do. So, that's a much better way to do it.

But again, if you can't get that from inside your company, I would totally recommend going to the professional networks. So, there's like networks of claims adjusters, there's like the underwriting network that I'm a part of. They have like all sorts of different meetups and trainings and things like that. And you can go and meet, you know, underwriters from some other company if the ones from your company won't talk to you. So, that does take a little bit of a proactive push and a little bit of research. But that piece, if you can't get it internally, I would definitely push to get it externally. And if your manager is not pushing for it, I mean, I would consider a kind of, do they have the right vision? And if they don't, sometimes data science managers can be very, very technical and not as good at this particular area. And that's when it's really helpful to have a buddy on the product team who maybe their manager can go have that conversation.