The actuary to data scientist pipeline | Bill Wilkins | Data Science Hangout

Transcript#

This transcript was generated automatically and may contain errors.

Hey there, welcome to the Paws at Data Science Hangout. I'm Libby Herron, and this is a recording of our weekly community call that happens every Thursday at 12 p.m. U.S. Eastern Time. If you are not joining us live, you miss out on the amazing chat that's going on. So find the link in the description where you can add our call to your calendar and come hang out with the most supportive, friendly, and funny data community you'll ever experience.

All right, I am so happy to introduce our featured leader today. It is Bill Wilkins, SVP Advanced Analytics with Practical Applications at Safety National Casualty Corporation. Bill, welcome. Thank you for joining us. And I would love it if you could introduce yourself. Tell us a little bit about what you do and what you like to do outside of work for fun.

All right. Well, thank you. And thank you for having me on. It's a lot of fun. I do know a lot of safety folks do try to participate in the community. So, you know, I do believe that it is a good forum for folks to join into. My role at safety is a little nebulous. I started out, I've been with the company 17 years, but I've been in the insurance industry for 40. I started out as an actuary, but before then, you know, had some programming. I got trained as an underwriter, trained in regional operations. So I've had a very broad experience within the insurance industry.

But as my training is an actuary, you know, I've had some very lucky opportunities over the last 40 years to work with some very talented people. And, you know, actuaries are essentially the data scientists within the insurance community. They were the original ones. Now folks are adding more because actuaries have a very specific view of the world. And luckily I don't hold most of that. That's why I'm where I'm at today. I'm kind of, you know, I believe in the actuarial training, but I do believe in a broad brush that the data science community can have.

So my role now, I went from doing pricing, reserving, data science work to help do predictive analytics. And I'll give you an example of what we do at safety. But you know, now my job is really just to think out of the box, come up with data, come up with connections, come up with technology, whatever we can to figure out what the next question should be that we need to answer. And it's kind of a fun role, frankly. It's, you know, I'm a problem solver by nature.

So, you know, which also you'll see the shark in the background. That's about a 22 foot long tiger shark. Wasn't in a cage, it was with my daughter, we're both divers. She's a, I'm a master diver, she's a dive master, so she gets paid for taking people diving. But one of the things that we like to do is dive, we like, yeah, love the ocean, she's going to save the world, she's a marine biology major, and going to help bring back the coral so we all can live. But, you know, I've always been a calculated risk taker, whether it's trying to fly off the roof of my parents' house, or whether it's rappelling down a building for charity, jumping out of airplanes, diving. I like to push the envelope. I like to see if there's a problem that we can solve.

And frankly, there's no more once-in-a-lifetime events. Things are going to speed up because of the effects of humanity on the earth.

Building community across a global organization

Rachel, I actually really like your question about community. I'm very invested in community, too. Are you available to unmute and ask? Yeah, absolutely. I'm just always curious when I see huge companies and knowing how do people get to know what other people across the company are doing. I hadn't realized before meeting you, Bill, that Safety National is a subsidiary of Tokyo Marine Holdings. So I was just curious, do your data scientists and actuaries have the same community across subsidiaries, or how do you collaborate with each other?

That's actually a really good question. I think Tokyo Marine is in 40 or 50 countries around the world, and I work for other multinational carriers. It really does come down to the people, whether they're actuaries or whether they're data scientists. We have a community of roughly 200 data scientists that meets every couple of months, virtually. It started very organically. I knew this guy, he knew this person, she knew that person. We just said, hey, let's hang out and start talking and seeing if there's stuff we have in common. It started out with maybe 10 people, and hey, I'd like to join, and I'd like to join. All of a sudden, it just starts to mushroom. Because it takes some time, and you have to figure out your way to find the conversations that people want. But that type of organic thing, it's like your data community here. It just starts out, and it'll just start to grow once people find the right conversations and how they want to talk to each other.

We've just had our second in-person global meeting for Tokyo Marine. It really was focused on how can we help others. We got together, and it was really funny because there's no data science department within Tokyo Marine as a whole. It's just us. We get together, we have the conversations, and we're picking problems to help the company solve. We do have some executive support, but it's really getting to know people. That's how it's been my career. I worked for a company that was actually 16 different companies. We just started getting together and talking. You build friendships, and it takes time. I've known a lot of these people for a decade, so it just takes time. Once it's up there, it's such a beautiful thing. I know it'll exist when I leave. It'll keep going.

Career advice and passion projects

I really like the focus on just trying to help people or help people solve problems. That's a huge reason why I'm in communities. I see people who have problems who need help solving them, and I'm like, we could all do this better together if we got together.

Yeah, it's a little bit along the lines of the previous one. My question, I'll just read it, was I just saw the insurance company or insurance industry, and I was wondering, given the flexibility of your role, how much influence on the type of problem-slash-works do you have? Do you have to solely focus on just finding high-cost claimants, or do you have the ability to look at passion projects or try to answer different types of questions? How do you reduce the bias in that?

It's an interesting question. I get to work on things that make sense or will make sense. I'm not necessarily solving things for today, but it could be for tomorrow. I do have some very passionate things because I really believe in what Safety National does, which is wanting to help people. That's what we go for. When I used to run the data analytics group, we didn't do just the projects that helped our claims department identify the claims. We actually put together products for our clients. We handle very large employers, so Apple, Starbucks, State of Texas. My team was putting together products to help them lower their costs and help their employees.

As a really good example, finger sticks is the number one issue in a lot of hospital systems. They waste a lot of time and a lot of energy with people getting hurt because of that. My team was actually isolating that and working with our risk management group to help prevent that, saving people injury, time, potential exposure to biochemicals. I get to do stuff like that. I don't want to say it's random, but you never know when the next really good idea is going to come. I get to work on the next good idea and then turn it over to somebody else who's much smarter than I am to go try to make it work really well. That's the joy part about my job is doing that. That's what I think insurance should be doing.

The bias question is a little bit harder. There are the technical items, but what I find the hardest part about bias is the definition of bias. I spoke on this last year. There is a technical definition of bias. However, that's not how people use it today. Everything is biased. So is it good bias or bad bias? Because there's good bias, which means you're very predictive. Bad bias means you're being predictive for all the wrong reasons. Depending on what you're going for, you want to minimize what I call the bad bias. There are a couple of different standard techniques to do that, but the hardest part is really understanding the data and getting back to that data. Is it incomplete? Was it captured a specific way? That's a much longer conversation than this community.

Third-party data and big data in insurance

Actually, third party data is going to become more and more critical. And actually, what I'm really excited about is the ability to take words and numbers and put them together. With the explosion of generative AI, we're able to do things that we really wanted to do 30 years ago because we have a lot of text in insurance. We have a lot of numbers, but we also have a lot of text and a lot of good text once you clean it up some. And individual insurance companies can do a lot with their own, but there was a recent example of a product out there that is taking medical information and litigation information and putting it all together to really hone a strategy for individual companies. And it's going to get more and more powerful as the computers are able to process more and more. And so big data is going to play a role.

And that third party data, because insurance companies, unfortunately, while they might be large, have their own personality. Every company has their own personality, whether they have a duck going Aflac or they have Flo in her nice white uniform. They take on these personas and their data takes on that persona as well. Eventually, I would like to have something that crossed the industry because I do think we're missing out on critical insights for the populace as a whole. And people do jump insurance companies, but what I see missing is that we don't have something that's a population as a whole type of data structure. Now, there's a lot of insurance companies that wouldn't like that because it could mean some competition, but I think we need more of that to level the playing field.

Causal vs. prediction models in risk evaluation

I learned a long time ago, a number is just a number and it's going to be wrong. Every model is wrong. What you have to understand is what's driving the model. If you don't understand what's driving the model, it could be the best model in the world, but it still is going to be wrong. I actually think you have to have a wide array of both in the process. There's no such thing, unless you can only have one model, there's no such thing. What I like about using a platform like Posit, it allows us to do multiple things all around the prediction process because it shouldn't be a one or the other. It should be what makes the sense for the problem at hand. Again, you always have to know the assumptions. In a good example, actuaries have very specific techniques in observing. Each one will give you a different answer on the exact same data. If you understand how those models work and what the assumptions were that go into them, you can actually level them to see if they're roughly the same. That's the important part. It's understanding what you're doing and what the question should be.

Adapting to AI and regulatory uncertainty

How are you adapting your actuarial models slash approaches to address the evolving risks introduced by recent advancements in AI and the resulting complexity, regulatory uncertainty, as well as ambiguity in liability slash accountability structures?

No, no. Actually, it's a very interesting question because it's evolving. The one thing you don't want to do is overreact because as folks are finding out now, one group of politicians took a path prior to the election and now that path's going in a different direction. The uncertainties have always been there. That's where you want to use types of scenario models, scenario planning type of activities from a risk standpoint. Scenario modeling is not just good for climate change. It's good for all kinds of things. What you want to do is test the various things. Again, it comes back kind of tied back to the prior answer. You want to know what the assumptions are and what you're doing.

I've built a lot of models over the years and they are getting more sophisticated. The problem is that when you get more sophisticated, it doesn't take much to really make an issue within the model. I do believe in keeping models simple, as simple as possible. I would rather take 10 very simple models than one really good complex model. I think what you want to do is within all the models or a model you have, really outline the assumptions, stress the assumptions, vary those assumptions, and figure out where all those things are. The uncertainty is always going to be there. The question is, have you put a pathway for that potential uncertainty? It's okay to say we don't know that this is going to be the answer. When I wrote the pandemic paper, I got lucky. I got about 80% of how the pandemic would work right and about 20% wrong. That's pretty good, but it was just a guess. You have to be comfortable in that and knowing that I took this down a path. That's all you really need to do.

Ethics, regulations, and model accuracy

Well, if I didn't have to worry about ethics and regulations, we could get some very sophisticated, highly accurate models where no one would buy the insurance. Insurance is a pooling mechanism. That's how it started out. When people started to try to compete, that's where a lot of this stuff really started to come into, the segmentation. You can only segment so far because some people are just bad risks and the rest of us have to make up for them. Again, insurance is actually a community product. We put our money into a pool just in case something bad happens. To some people, something bad happens all the time. The question is, how specific do you want to get? I mean, do you want people to have insurance? If you want people to have insurance, you have to lose some of that specificity. Again, it comes down to what's your goal? I believe in the mechanism of insurance. Do I want to pay more than I have to? No, but I'm going to have to so everybody can get it. What's your passion for wanting to help other people?

Explainability vs. accuracy

Yeah. My comment in the chat was just, I find that models that impact actual people's health outcomes or safety outcomes or cost outcomes, speaking as a data scientist embedded in an actuarial department, need to be explainable before anything else. Those simple models that you know exactly how the model got there become so much more important than we threw it into a black box model. We were 97% accurate, and we can't tell you why. If you're going to deny someone coverage, you need to be able to justify that. I was curious as to how you find that paradigm and that balance within some of the work that you do as an actuary or as an insurance provider. What does it look like for you to strike that balance between explainability and accuracy?

Without transparency, there is no model. You cannot charge someone something. You cannot deny someone something unless there's full transparency. There is absolutely no black box at all. I'm going to say our executive management is so on board with that. If we cannot explain it to a third grader, we can't do it.

If we cannot explain it to a third grader, we can't do it.

Posit tools and operational use cases

I'd rather have my data science team do that because they're much smarter and better at it than I am. For us, I'm going to... I still call it RStudio . Sorry. They have gotten a lot of traction using those kinds of tools for what I call the operational side of the world. We have a premium audit department. For those who don't understand, for large accounts, they give us premium upfront, but depending on how many people they have or don't have employed, their premium can change either up or down. At times, you could talk hundreds of thousands of dollars swinging any direction. We are obliged by law to make sure that they either pay or we give back the amount that we should. For us though, because we have so many clients, one of my team's projects was to go out and to basically figure out who should they talk to first. Who has a significant variance from year to year and who doesn't. It's that type of use case in improving operational efficiency because it allows us to get to those clients who we either owe money to or they owe money to us. It helps us figure out where we should be as an organization. That operational use case is really well suited if you structured it right for closet type of tools.

AI and machine learning in the actuarial world

I'm old school. I still call it machine learning. There's no such thing as artificial intelligence. I think it's very useful. Actually, I just wrote an internal paper. I cannot provide it outside because it is for us on how it should be used with an actuarial world in certain respects. The tools are going to make... This is a double-edged sword. The tools are great. I loathe them to death because they make me more efficient because I can do certain things. What I worry about with the tools is the lack of prior knowledge. I've been doing this 40 years. I've been able to see things and understand why they work. I've actually made the comment to a colleague, I started with punch cards. Then I went to a dumb terminal. Then I went to the first personal computer. Then I went to network computers. As all that technology has done, what I still have that I don't see a lot of people today is understanding the underlying concepts. You can take a tool and throw a lot of models at it, but you really do need that underlying concept stuff first. I think AI is going to really speed up and make people better as long as they have the underlying knowledge they need with the tool. My biggest concern is that we're not teaching people enough about some of the harder things that they haven't done yet.

The future of insurance

Given your comments about how things are no longer, quote, unquote, once in a lifetime and will be speeding up, what do you see as the role of insurance, broadly speaking, going into the future? It seems like things are going to get harder and harder to pull in a way that works out in the math.

Actually, it is. I sit on what's called a General Research Council for the Society of Actuaries. We started talking about this because certain risks, I don't think, fit into the insurance mechanism anymore. We need, as a group of really smart people, to figure out how to get them taken care of because they still need to be taken care of. You don't want to leave anyone in a bad state. The question becomes, who are the right people in the room and what are the things that need to be done? Because you see it happening in Florida. You're going to see it really happening in California. The state is going to have to start taking a bigger burden, which means everyone in that state is going to be taking a bigger burden because the insurance mechanism wasn't built for mass casualty events. If you look at all the mechanics behind it, it really is supposed to be the low probability, higher severity type of thing. Instead of an entire community burning down, it's a house in that community. That's how it was originally built. It's transformed over a generation. We, as a group, need to do something. You're going to need to involve regulators. You're going to need to involve the insurance industry, but you're going to need to involve the communities too, to see how best, because every community has something different. You are going to have to sometimes reach outside of our geopolitical sphere to get a solution. It's a complicated thing. We are trying to think about it now, but some things are going to be uninsurable.

Awesome. Thank you, Bill. Thank you so much for all of your insights and sharing everything with us today. I will wrap it up there because we have a minute left. I want to say thank you to Bill and to everybody for showing up. I would really like to let you know who's coming up next week as well. We have Victoria Prince, Senior Manager of Statistics at Takeda Pharmaceuticals. I'm going to put in the chat here for you a link to her page on the Data Science Hangout website, so you can go check her out. Find her on LinkedIn. Thank you so much, everybody, for making this a wonderful discussion. We will see you next week. Everyone have a wonderful day.

The actuary to data scientist pipeline | Bill Wilkins | Data Science Hangout

Transcript#

The needle in the haystack problem

What actuaries do

Actuary to data scientist pipeline

Advanced analytics and pricing models

What people misunderstand about actuaries

Incorporating catastrophic events into models

Building community across a global organization

Career advice and passion projects

Third-party data and big data in insurance

Causal vs. prediction models in risk evaluation

Adapting to AI and regulatory uncertainty

Ethics, regulations, and model accuracy

Explainability vs. accuracy

Posit tools and operational use cases

AI and machine learning in the actuarial world

The future of insurance