With a little help from my friends: Tools for developing & deploying algorithms in the hospital

So the chemical element for sodium is also NA. So when we silently deployed ChartWatch, we noticed that there were no sodium tests that were being measured. So when you went and checked the code, what we found was that the R code that we were using to read some of the lab's data was interpreting all of the sodium tests as being NA not available.

So once we were done with the sign period, it's time to go live. So actually for us, we had a pilot phase in which we first deployed to two general internal medicine teams only. And during that time, we had weekly meetings which were used to debug any issues we ran into, go over the patients that were flagged by ChartWatch, you know, check like, does it make sense or is there anything weird or unexpected? So that was the approach that we took for going live. And then we really did in a phased approach to start off with two teams and deploy to the rest of the teams.

And really throughout this whole process, so throughout the pilot phase, throughout deployment, throughout, you know, developing the clinical workflow and user engagement is important. This is not something that could have been possible with just a data science team. So when it came to developing the clinical pathway, we really relied on the general internal medicine team, on the palliative care team, on the nurses we were working with to develop that pathway. And also when it came to developing education and training processes, that was, once again, a highly collaborative effort. So yeah, so things need for deployment, implementation plan, super important.

Monitoring in production

One last thing that I will mention for deployment is something about monitoring. There's a question mark here because, you know, this could be a whole talk and we're already nearing the end of this talk. So it is a whole talk. There's plenty of other people that have spoken about monitoring. So I would redirect you to watch these amazing talks. Lots of great insights. With the time remaining, I will mention a few things specific to chart watch, though.

So a chart watch, one of the things that we want to look at is model performance. But that's a bit hard to do once the model has been deployed.

OK, so we just backtrack a bit. So with chart watch, one of the things that we want to look at is model performance. But that is difficult to do once the model has been deployed. So clinicians, one of the things that they asked for was a target positive predictive value of 0.33. So what this means is that for every three alarms, at least one of them should be correct. But that is kind of hard to measure once you are deployed and in production.

So what I mean by that is, let's say chart watch flags a patient as being high risk and sends out an alarm. And then that patient doesn't deteriorate like they don't transfer to the ICU. They don't die. Does that mean that the chart watch prediction was incorrect? Or does that mean that the chart watch prediction was correct? And then because clinicians received that alert, they were able to intervene and prevent that outcome from happening. So, yeah, in deployment, we we get these cases where there are these false alarms, which we're not sure if they're actually false alarms or if they're averted outcomes just based off this successful intervention.

Yes. Another thing we measure in chart watch is the daily number of alerts. So on average, we're sending two alerts per day, which is a reasonable number that the clinicians are happy with. There are some days where there are no alerts and there are some days where there are lots of alerts. So, for example, I think there's like nine or eight alerts sometime in like April or May.

So, yeah, one of the things that we were really concerned with was make sure there weren't too many alerts. So we worked with the clinicians to ensure that we were minimizing alerts. So right now, the way things work is that, you know, a patient will receive an alert if chart watch classifies them as being high risk. We added some additional rules on top of that. And those rules we develop with the help of the end users. So, you know, we don't re-alert a patient unless it's been 48 hours since their previous alert. You know, this just ensure that we're not alerting for the same patient every hour, for example. And we added this additional rule, which is that we don't alert a patient if they just came from the intensive care unit. Because if a patient is coming from the ICU, you know, like the clinicians already know that they may not be doing too well. And it's expected that, you know, their vital measurements, their labs might be looking a little off because they just came from intensive care.

The last thing I will say about monitoring is regarding clinical adherence. So how do we know that when a patient receives a chart watch alert, the clinician is actually doing something and they're not just ignoring that alert? So one of the things that we look at is the number of vitals that are measured after an alert is sent. So we do that because as part of the clinical workflow, one of the things that the clinicians have to do when they receive a chart watch alert is increase the number of vital measurements. This is something that we can easily measure just by looking at the electronic health care record, looking at the number of vitals that are sent after an alert has been sent. I mean, it's not a perfect measure because the number of vitals can increase for reasons unrelated to chart watch, but it's a proxy and it gives us a good enough idea of clinical adherence.

Summary and Q&A

So this brings us towards the end of our talk. So let me just summarize some of the things that we talked about. So we looked at the different tools for development and deployment of an early warning system in the hospital. And so on the development side, the things that we found useful were database connections, reproducible environment facilitated through renv, package based development, the different development staging and production environments. And then on the deployment side, we had authentication and scheduling, which were facilitated by Posit Connect. We had downtime protocols, a secure way to download internally developed packages with the help of Posit Package Manager. We had an implementation plan in which we're working with our different collaborators, our end users. And then we had monitoring at the very end. And, you know, all of this would not have been possible if it weren't for collaboration. I think it's something I keep saying throughout the talks that collaboration is really important. Developing and deploying chart watch in the hospital is not something that we could have done on our own. I was really by working with a great team and a lot of different people within the hospital that this was possible.

Developing and deploying chart watch in the hospital is not something that we could have done on our own. I was really by working with a great team and a lot of different people within the hospital that this was possible.

So this brings us to the end of what I prepared. And as I said before, the slides will be available. And at the end, I do have the list of all the papers, talks, blogs that I mentioned. You can all find them at the end. Awesome. Thank you so much, Chloe.

One from the beginning, which we touched on for a second, is can you talk more about the development process? What came first? Were there steps dependent or parallel or did these changes happen simultaneously?

A lot of things were happening at the same time. So in the talk, I think I mostly focused on the implementation team and all the things that the implementation team was doing to get this from development to deployment. But there were different sub teams that were working on this. So there was the evaluation team, which is, you know, they were trying to address the question, like, how do we ensure that the ChartWatch deployment is successful? So they're the ones that are coming up with like, you know, we should measure vitals because you've got to measure clinical adherence in one way or another. And there was the modeling team, which is, you know, mostly the data science people and IT that are focused on actually building the model. So different teams working all at once. I see the way this all started is really from the general internal medicine doctor who came up with the idea first and came to the DSA team and said, like, hey, like, it would be cool if we had an early warning system. And then it's kind of like all snowballed into this.

Thank you. I see somebody else asked, and I guess this is a follow up here, but did the team come first or did you architect these workflows and then build the team?

So like the DSA team, or rather at that time we were the chart team, like we did have the chart team first, so we did have a data science team. I feel like the different groups, like the implementation team, is something that came as we progressed, as we started working towards developing the tool. So we had all of the building blocks, we were assembling it together, that kind of happened a bit later.

So on the topic of the team building, then, can you speak to any lessons learned from building four teams within DSA? What did it take for leadership or management to take on a big scale project like ChartWatch?

Yeah, that's a great question. So when we first started off, we did not have the four team structure. It was, we just had data scientists to do a bit of everything. We found that, you know, there were certain people who were a lot more knowledgeable in one aspect. So I think it just made sense to split it up into this four different teams. And then within each of the teams, there is a director for each of these teams. I think it makes the management and coordination a lot more streamlined. So as a data scientist, I can focus on doing data science work. I don't need to think about all the other unrelated things.

Absolutely. And I see Rebecca asked a question over on YouTube. What is going on under the hood for those alerts that you showed us? Rebecca just mentioned, how do you begin with either functions for emailing or database status?

Yeah, yeah. So I guess more specifically, when we're checking the database status, we are just checking that connection can be made. So, you know, I just try to connect using my usual connection function. If that doesn't work, then I know that something is going on. With regards to actually sending out the alerts, for sending Slack alerts, we are working with a Slack API to send that. And then for sending out emails, there's something you can do. There's a bunch of R packages for that, actually. So the one we use is sendmailr. This is what we use to send out our emails.

I see Ezra also asked over on YouTube, how early in a project do you typically shift to developing an R package?

Oh, that's a good question. That's something that we sometimes struggle with, because I guess you don't want to do it too early, because it might seem like you're kind of like over-engineering things. So I think for us specifically, for ChartWatch, it was... So we first deployed in November 2019, and then everything was put on pause. And it was over the summer that we decided, okay, let's just build this all as an R package. So I'd say by that time, we had all of the requirements all mailed, and we all knew this is what the alerting schema would look like. These were the different groups that we would be interacting with. So I'd say wait until you know what the final requirements would look like before shifting to developing an R package.