David Granjon & Bo Wang @ Novartis | User-friendly, self-serve tools | Data Science Hangout

Transcript#

This transcript was generated automatically and may contain errors.

Welcome to the Data Science Hangout. Hope everybody's having a great week and are enjoying the Shiny conference. Thank you so much to the Appsilon team for putting on this amazing conference and for asking for the Data Science Hangout to be a part of that. It's really cool to be included in the conference.

But I know we have quite a few people here joining the Data Science Hangout for the very first time. So thank you all so much for joining us. I'm Rachel Dempsey and I lead our pro community at Posit. So we host this Data Science Hangout every Thursday at the same time, same place.

So what is it? What's going on here? This is our open space to chat about data science leadership, questions you're facing, getting to hear about what's going on in the world of data across different industries. And so every week we feature a different data science leader as my co-host to help lead our discussion and answer questions from you all. There's no presentations or slides, just all conversation with everybody.

We're all dedicated to making this a welcoming environment for everyone. And so we always love to hear from everybody, no matter your level of experience or area of work. This is a bit different than the Hopin platform that you're in for the Shiny conference. So I just want to make people aware of that to be a bit careful with unmuting and those things that come with Zoom.

It is totally, totally OK to just listen in here. But there's also three ways that you can jump in so you can ask questions or provide your own perspective on certain topics by raising your hand here on Zoom. If you're not familiar with Zoom, it's in the bar below where it says reactions. If you click on that, you can raise your hand. You can also put questions in the Zoom chat. And if you want, just put a little star next to it if you don't want to jump in and read it yourself. I'll know that maybe you're in a coffee shop or something and you want me to read it. And then lastly, we also have a Slido link where you can ask questions anonymously.

And our team here at Posit will share that into the chat in just a second. We'll keep posting it in there so that people can find it as well. It's great to see all the messages in the chat. We do share the recordings of each session to the Posit YouTube and our data science hangout site. So you can always go back and rewatch or share with a friend.

But with all of that, thank you so much for joining. It's so amazing to see this big group that we have here today. And thank you so much, David, for joining us as our featured leader. David Grandjohn is a senior full stack Shiny developer at Novartis and the founder and maintainer of the open source R interface organization where he develops Shiny extensions like BS4- and Shiny Mobile. We also have Bo Wang, senior expert data science at Novartis, who's also joining us in the audience today. He'll be helping with any questions that come up on their data monitoring committees at Novartis as well.

Too many options, they didn't know where to click. It depends if you're already familiar with these kind of tools, you know, if you're familiar with web application or not, I would say.

Personal aspirations and career advice

Um, but David, I'm curious, what is the best career advice you've ever received or advice that you've given?

Uh, that's tough one. That's a tough one. Uh, that's not a technical, uh, advice I gave. Uh, that's just a mindset, uh, advice probably like never give up. Um, it's hard today. It will be better tomorrow. You know, these kind of things, uh, like Colleen said in the keynote, I was like, I always go back to the keynote, but, uh, there are days you want to quit, right? It's hard to make things work well, uh, using best practices. But in the long run, yeah, you learn about a lot about yourself. Um, you collaborate better with people. Um, but yeah, I mean, for everything in life, it's hard. Uh, if you want to have like raise your kids, it's hard, uh, do sport. It's hard. So that's like a common point for everything. So believe in yourself and never give up.

CI/CD and deployment with Posit Connect

Uh, so David, you're talking a little bit about the, uh, uh, renv and using, uh, connect for CICD sort of deployment. So the one thing with connect is it's a pull. So it pulls from, you could, you could pull from, um, it's designed to pull things in. Uh, sometimes that's different from, uh, uh, enterprise sort of scenario where you have to, they're expecting everything to be packaged up and sent to the server. Uh, did you use renv to perhaps address some of those challenges around that?

Yeah. So for instance, when we have, um, like your Shiny app, for instance, we want to send to connect. Um, so first of all, we make sure it's using renv, right. We have frozen dependencies. Then we have the CICD pipeline. We have a specific section in the YAML file, which is restoring packages, uh, in the, in the runner. And then, um, you have some instructions to deploy the app, uh, on a Posit Connect. So you need, uh, like to use the connectapi . Uh, so yeah, you, you need a API key, uh, on, uh, different things, not going to enter too much into details.

Posit Connect doesn't use renv. You know, when it restores the application, it uses packrat , but if you have frozen dependencies in the runner, it's much, much easier than if they are packages, you don't even know where they are coming from. And you try to really push everything. And then you have error messages everywhere in the Posit Connect. So yeah. From time to time, uh, you still have problems, right? But, uh, they are significantly reduced.

Shiny modules as reusable building blocks

Super impressed by what's happening at Novartis where you've built essentially the, the data science team have built these Lego bricks for modules for building Shiny apps. Then other people who want to build Shiny apps aren't starting from a blank sheet, but they're kind of reusing those Lego pieces. Do you want to talk us through that? Cause I, I think that's a novel approach.

So Mike, when you say about the legal pieces, are you talking about data pipeline presentation from.

Yeah. We had some presentation at our, some years ago where, yeah, there are some modules to like develop visualization for clinical trials. And then you can combine with modules together to create your application. So you don't need to, you know, develop them yourself.

So the modules we, I don't know how many modules we have, but we probably have like 30, 42 modules. Which you can basically plug into your app. And then depending what you want to do, if you're going to do like safety Explorer. Yeah. Patient profile. Whatever. Yeah. They're very still. And in that case, people need to dive in the code. And at some points that's inevitable, but. For sure. That's a big change for us. The main challenge is also like making rules. Modules. Following best practices in terms of. Like software development.

You know, we are not even 10 in our team. So that's, that's a lot on our shoulders. As Colleen said before, you know, from time to time you, you hear people. They're not super satisfied. But in the meantime, you also have people that are really thankful to what you are doing. So that's what's so rewarding for as a developer.

Tracking app usage with the connectapi

I wanted to ask you about the users. Because the beauty of Shiny is that we can build apps. Like very fast in small iterations. And we can keep getting the feedback and upgrading on them. And this makes it like a bleeding edge. Kind of software that we can provide to the users. And I know that you're also sure that it is not that easy to get. Feedback from the users directly. So I wanted to hear a little bit more about it. How do you cope with this?

So, yeah. Just to know whether your application are properly used. Or whether, you know. These kind of things. Right. And how people are using the applications. Typically.

So. This is. I'll be honest. This is something we will start to do at Novartis. So I did it. Like. In the open source. But I'm really talking. What we are doing at Novartis. So what we have. We have the tracker. In Posit Connect. Which basically. It's an application. That we developed last year. And when I was in Japan. So, you know, when you travel, you can have. You can do many things. And yeah, this basically queries. The API to get. Like all applications. For all application you get. Who is accessing the application. How much time. Which basically give you some insights. So. When you develop an application. Several teams. You can have some insight of. What which team is using the most. This application. And how you could target your support. Like if you see one team is using the application 20 times more than. Another one. Maybe. Like invest. More people to provide. Here.

You can apply to all applications that are deployed. That's not something you need to add to the application. And that that's important. That's something that works automatically. As soon as the app is on the server.

And on the other side. If you want to know how people are using the app. So like the in app usage. You may know. Like Shiny heatmap. That's started from an experiment. I made around. The JS. Library. Because in the past, I was using. To track. On my website. And one day. I. Yeah, I was just wondering. Why. Why should. I just use. So I tried the tracking. To put it in the. And then I couldn't see anything. It didn't work. And I connected the support. And they told me. Yes. It won't work. Because that's. Quite specific technology. And then I took. And brought some. Very simple code. We are in. Put it in a package. Yeah. The Shiny heatmap. So basically each time you click in your app. It will record. X and Y location. And then aggregate the data. At the end of the day. So you can see where people are clicking. Yeah. For instance, you can identify some dead zones in the application. Which will help you to. Refactor the design. At some point, you know, if you. If you realize there is a path that nobody visits. Maybe that's. Because it's badly designed.

For instance, you can identify some dead zones in the application. Which will help you to. Refactor the design. At some point, you know, if you. If you realize there is a path that nobody visits. Maybe that's. Because it's badly designed.