Data Science Hangout | Christina Fillmore, GSK | Open source community collaboration in Pharma

Transcript#

This transcript was generated automatically and may contain errors.

Happy December 1st to everybody and welcome back to the Data Science Hangout. Hope you all had a great week. Last week, we were off for Thanksgiving. And so if you're joining us for the first time today, this is an open space for the whole data science community to connect and chat about data science leadership, questions you're facing and getting to learn about what's going on in the world of data science across different industries.

We share the recordings each week to our Posit YouTube, so you can always go back and rewatch or find helpful resources. We are putting them up to our new Posit Data Science Hangout site, just is taking a little bit longer, but they will be there. Together at the hangout, we're all dedicated to creating a welcoming environment for everybody here. So we love when you all can participate. And we can hear from everyone, no matter your level of experience or area of work.

And I had a dream last night that I forgot to say this part next and that nobody was talking. I don't know why I had this dream. But there's always three ways to ask questions and also provide your perspective. So it doesn't just have to be a question if you have something you want to weigh in on or a certain topic. But when you can jump in by raising your hand on zoom to you can put questions into the zoom chat and feel free to put a little star next to it if you want me to read it out instead it's in the zoom chat. And then third, we have a slide a link where you can ask questions anonymously.

One more thing we also have a slide, sorry, a LinkedIn group for the hangout too. So this also helps you connect with each other, we'll share that in the chat in a second here. We've learned that you do have to manually turn on the notifications for the group using the little bell at the top if you wanted to do that.

But with all that, thank you so much for joining us here. Happy to be joined by my co host for today, Christina Fillmore, data science leader at GSK. And Christina, thanks for joining us here. I'd love to have you maybe start by introducing yourself and sharing a little bit about your role. Maybe something you like to do outside of work too.

Not have become an IT company, so us developing software that helps everyone, like, deliver on the thing that we're all supposed to be doing. It's easier to argue that that's not intellectual property for us, because our intellectual property tends to be things like molecules and like, that's like, that's a thing that we make.

Starting a cross-company open source community

Libby and I, and a lot of other people have been talking about trying to do something that's inspired by the pharmaverse, but more for human resources and trying to sort of pull together people in different companies to sort of build packages together. Um, we have far less standardization and honestly, analytics is pretty poor. Um, so anything that can sort of help raise the bar would be really good. Any suggestions on, like, how to kick start a movement of getting people together or any lessons learned from the pharmaverse about what we should sort of do or avoid.

I don't personally believe you need to have a big splash in order to start. I think if you wait for a, like, a big, beautiful website and everything to be working before you try and get into anything, you're going to just, like, be waiting in the circle. Um, so I firmly believe finding some friends and start developing packages with them is, like, a great choice and it doesn't have to be a huge number.

I don't personally believe you need to have a big splash in order to start. I think if you wait for a, like, a big, beautiful website and everything to be working before you try and get into anything, you're going to just, like, be waiting in the circle.

It honestly, like, I wasn't necessarily run for the whole origin story of the pharmaverse, but, like, part of it grew out of something like Michael and Michael and Mike Stackhouse, two people who are not here. And probably you don't know, but basically, like, they were like, we should build some packages together. And I was like, okay. And they were like, Christina come help. And I was like, okay. And I built a package for them with a tourist. So, a different company that is not even a big pharma. It's a CRO. So, like, we, and a CRO, like, built a package or two together. And then, like, the Mikes were having more conversations with other people. And all of a sudden there was a beautiful website. And I was like, wow, everything's coming together. And so, like, to some extent, it's just like, talking to people and then starting to build stuff. Eventually, you'll have enough stuff that you will have your own verse.

But I highly recommend starting. And in terms of the, like, things aren't standard, what do you do? And how do you start? Sometimes a nice starting place is a package that helps standardize stuff. Metadata is not really in a standard format in pharma. Everyone has their own, like, way of holding this metadata, it kind of gets standardized to a thing called a define XML, but that's only like right at the end of the day, typically that define XML comes together. As you're like sending the thing to the FDA, it's not like that's, it's, it's not ready at the beginning.

And so, one of the first packages that I built as part of the pharmaverse was this package called MetaCore. As I said, I'm like, I hold many metadata things, don't know why. But part of the reason was, we wanted to build automation off this metadata. But everyone held it in a different Excel format, typically, or like, it was in a database, but like, it was their own way, they designed it their way. So, we needed to have a thing that we could all look at and be like, this is the thing, and we all use this thing, and this is how it works. So, like, what MetaCore came for us, it's just like an R6 object. It's just an object, like, it's the most boring package in the whole wide world. It does almost no things, it sits there, and you can poke at it. But like, that's it. But that enabled additional automation to be run off. So then, like, we have a different package called MetaTools that does a lot of this automation. But the first step was getting that thing we could all agree on, was like, the thing we were using. And it doesn't have to be perfect.