Saumiitha Leelakrishnan - Partnering with Posit for progress on Environmental Stewardship

Transcript#

This transcript was generated automatically and may contain errors.

Good afternoon, everyone. My name is Saumitha Leelakrishnan. I'm a technical specialist, Global Emissions Center of Excellence team at Cummins.

I'm here to share with you a story, which is a journey that is involving a very interesting example of wider adoption of R for business applications. And this story, and surprisingly, starts with a deep breath.

If all of you could take a deep breath with me, if that's OK?

Every day, we are breathing, and we need fresh and clean air . We at Cummins recognize it. Cummins leads at Inverse in innovative data-driven solutions to enhance environmental stewardship through emissions analysis and sustainable practices.

Cummins, a Fortune 120 company, a 100-year-old history company, historically a diesel engine manufacturer, embraces stricter emission standards over the years and has now broadened its portfolio to include diverse energy sources. This company's investment in clean diesel and natural gas and currently into electrified products and hydrogen and fuel cell technology is commendable.

So in this talk, I'll be sharing how partnering with R has helped the global product compliance team at Cummins help deliver solutions for our customers that are safe and lead to a cleaner environment. Even if you have a different data set, some of the lessons that I've learned in emissions analysis and environmental stewardship will definitely improve your workflow. So let's dive in.

Quarto's batch processing automates generation of multiple reports seamlessly. Because at a single point of time, you have 60, minimum 60 different engine families that you would have to generate the reports for.

Monitor and maintain

So now it takes us back to the last, but not the least. This is the last step as we wrap up this whole journey and pipeline, which is monitor and maintain. As long as we are able to set up this process, go on seamlessly and automatically, that's when we will be able to reap the best of the benefits.

So a monthly and quarterly data refresh was set up using ETL jobs and scripts. So that the reports get generated monthly and quarterly. The reports included average IUMPR computation ratios for each engine family based off your unicycle application, equipment manufacture details, so on and so forth. And this report included clear actions directed to the engineering teams who are working on fine tuning the system performance of the engines. So that you can deduce emissions. And these were generated as PDFs and available in the RStudio Connect apps and also in a secure SharePoint location.

Data reflections and closing thoughts

And this is the glimpse of the whole journey. So embracing the right tools and technologies, you will be able to build sustainable solutions.

So I also wanted to share my data reflections on this journey. Take the plunge. Stepping out of the ordinary and taking bold initiatives drives impactful results and growth. Bring the productive synergy. Integrating the best of tools and methodologies, be it R, Python, Databricks, SQL, Quarto, and many more, creates comprehensive solutions. Proactive automation is paramount. As demonstrated by this Quarto report example, this exemplifies and seamlessly integrates efficiency into your process.

This talk, I hope, will benefit the data science community into understanding a real world example of how to harness the power of both R and Python when you are faced with solving a challenging problem and drive innovation within your organization. Even if you are a data scientist, data engineer, developer, researcher, some of the lessons that I've learned and shared with you will definitely help your workflow.

And I would like to thank the partnership with Posit, because partnering with Posit has helped the global product compliance team at Cummins deliver solutions for our customers that are safe and lead to a cleaner environment. A special thanks to our Global Emissions Center of Excellence team, Product Compliance and Regulatory Affairs at Cummins, and a big shout out to the brilliant brains at Posit who have been continuously supporting us. And last but not the least, thank you, everyone, for your participation and attention today. Thank you.

Q&A

Thank you, Sumita. We do have a couple of questions. By the way, this was outstanding.

So what tools do you use to move between R and Python? For example, Reticulate or something else?

No, so when I think going forward, I might be. So I was kind of convinced in this con with some of the tools and packages and even thinking about Posit, like bringing an integration to both the worlds, right? But in this case, I had to clearly use Visual Studio Code to use anything on Python. And I directly use the Posit Workbench for anything on R. But going forward, and for Quarto, I was using RStudio. So I was completely convinced with Posit yesterday because, as you see, how complicated, so many things you would have to do just to achieve some results. But then the results are sustainable, right? So I went into three different environments, basically, for this project. But going forward, I think I'm going to embrace a few things, for example, like Posit and others.

Did the results from the k-means analysis that you did, did that end up in Connect at one point?

Yes, yes. The k-means data, which is for each engine serial number, we were able to identify UDCYCLE. So that data, I was taking back to the SQL itself and relationally connecting with all of the engine serial numbers so that, in real time, the RStudio Connect app can read that data back. And this is real time. So the data refresh is kind of, though it is sent monthly, for the amount of data that we had to get from the cloud. But then anything that is up to date in the back end is live in the RStudio Connect app.

There's another question here. How does the Posit ecosystem help with data transparency?

Data transparency. I'm also having some struggling with what that would mean. So Posit ecosystem, in fact, the data transparency.

So Posit ecosystem was able to be very, very compatible with bringing cluster clear idea to the respective people in terms of the engineering insights, or even the regulatory insights, or even to the decision makers on what this data speaks about. So I did not at least experience any gaps there. So Posit was able to deliver what I envisioned, and even what the regulators and the product compliance team envisioned very transparently to the whole engineering community.

Well, thank you so much. And that's all the time we have for questions. I appreciate it. Yeah, thank you, everyone.

Saumiitha Leelakrishnan - Partnering with Posit for progress on Environmental Stewardship

Transcript#

Step one: grasp your needs

Step two: use your needs as a compass

Advancing with machine learning

Generating and analyzing reports with Quarto

Monitor and maintain

Data reflections and closing thoughts

Q&A

Featured software#

Quarto

rstudio

rstudio-conf