Regina Lionheart - Making Waves with R, Python, and Quarto

Transcript#

This transcript was generated automatically and may contain errors.

Hey, thank you everyone so much for coming to see my talk, and let's dive right in. That will be my only water-related pun for this talk.

Okay, so how would you build a sandcastle? You could make it really simple, you could make it really complex, lots of turrets and all kinds of interesting things on it, but all sandcastles have something in common. Eventually, the waves are going to get them.

So how can we give ourselves some extra time to make our sandcastles last? We could add a moat that'll buy us some time as the tide comes up. Materials matter as well. If we use loose, dry sand, that is not going to last as long as if we use something a bit sturdier, like these kids may have used concrete for theirs. But how do we know what's going to happen to our sandcastle? How can we provide something that is going to protect it from all of the many things that can happen to sand-based structures on water?

And I did that with a single script and a single click out of those 3,024 extractions.

So zooming in on this image, you can see the red dot on that right hand side was an area of high erosion risk. And so we extracted data from that point and then we could start plotting it and asking questions about it.

So the existing dredged surface, this is energy dissipation as wave height increases. So you can see that as time goes up, the wave height is increasing and the dissipation itself is increasing. So small waves do not carry a lot of energy. You can't really transport anything on them and you're not really going to face much of an erosion risk. But as those waves grow in height, they are transporting a lot of energy and that energy is being imparted into the sediments resulting in erosion.

So we knew that wave height was a problem. But what happens if we restore the surface? Now with a naturally restored and filled surface created by our engineers, we can see that with increasing wave height that dissipation itself is way lower. So we have some data now about what restoration we might be able to do.

And we can do that across any variable. That was for energy dissipation, but we can also do wave steepness. Wave steepness is what I was talking about earlier. As that wave starts to break, it trips over itself and high steep waves impart a lot of energy. So you can see on that left hand side, it's a little dark, but that dark level and the dark color is indicating that the natural breaking of waves as they hit the shoreline wasn't happening over that dredged area. The waves never felt the bottom. So instead, that high dissipation energy was happening right on shore. But when we restored the conditions, that breaking of energy is being dissipated across the whole beach and we are not having this high energy spot where the waves are breaking and becoming really steep.

Combining R and Python for analysis

So this is great and all, but I'm fundamentally an R user and I needed the power of ggplot as well. So Python helped me leverage that data from a spatial approach to a more temporal approach and just more flexibility of graphs altogether.

There's Clayton Beach zooming in on that spot. These are two transects that we extracted so we can replace time on the x-axis with geography on the x-axis. And we can see snapshots of time on a spatial approach.

So ggplot, which I love, this is those two profiles that we were looking at. So on the left-hand side, you can see the bathymetry of that dredged trench from those two extracted profiles. And that's showing offshore dredged. When we fill in that offshore dredged with our engineering surfaces, we can see now that the bathymetry is a lot more natural, how like a normal beach would look.

And now across those profiles, we can again extract anything we want from those. So this is energy dissipation again, and there's our two profiles. And on the top thing, on the top set of graphs here, we have the offshore, the existing beach surface. So you can see that that high peak of energy dissipation is happening really close to shore because on the x-axis, we have distance from shore. So right on those sensitive onshore structures, we're getting a really high energy dissipation. On the bottom, when we fill in the restored beach surface, that energy is shifted offshore. So we can never destroy the energy, but we can move it somewhere else.

And that vertical black line that you're seeing is a reference point for those rocks from the image earlier, that old sidecast fill. So we can just actually see that the energy dissipation is being moved away. And this was the selling image for our client to convince them that natural restoration is an option.

And this was the selling image for our client to convince them that natural restoration is an option.

Making results accessible with Quarto

Lastly, these kind of results need to be accessible to a lot of different kind of people. We have engineers, coastal scientists, government officials, and the coastal communities who are actually dealing with this shoreline change. So Quarto ended up being the perfect tool for that. I was able to use my R chunks and my Python chunks in addition to all of that extra scientific, hydrodynamic information that you need to have to give this kind of work context.

So it went from that raw version to a nicely rendered sheet that I could pretty much render indefinitely for multiple different audiences with very few clicks. And we were able to get a single Quarto document that had all of the information that we needed for any audience that wanted to look at it.

And everything is looking bright for this field of research because Python is now producing packages that can handle pre-processing as well as post-processing. This is output from a package that can render the computational grid automatically instead of doing it by hand, which is one of the most time-consuming parts of hydrodynamic modeling is making that grid by hand. So these are tools to protect our shorelines for the next generation.

This is my daughter, Nadia, making a cameo in this. Introducing her to shoreline work early. But it just makes this previously inaccessible type of work more accessible to lots of different kinds of people and saves us a ton of time as well.

So thank you so much for listening to this. This is my LinkedIn if you have any questions. And I would love to talk to you guys more. Thank you so much for listening to my talk.

Q&A

All right, a couple of questions. How did you become interested in this data or in this field? And how do you see this work developing in the future?

So my background is in biochemical oceanography and metabolomics in a lab-based setting. And I honestly wanted to spend less time at sea. So I moved to a more coastal setting. And I've always been really interested in wave dynamics and shoreline protection from a natural perspective, watching the shorelines change over Puget Sound for the time that I've lived here.

So as for where I see this going, I just think that the packages that accompany this kind of modeling are just exploding. It's amazing what people are doing. And some classic packages like X-Ray and the Tidyverse are playing really nicely with some of the new ones that are doing really complex pre- and post-processing steps for hydrodynamic modeling. So I think there's a bright future for this kind of stuff.

I guess a follow-up to that is how much of this type of work needs to be done by domain experts? And how much can it be done by a newly data science graduate?

I think a lot of the post-processing is pretty much is accessible to pretty much anyone who has a decent understanding of data structures and how they work. The modeling and the pre-processing requires a lot of domain expertise to understand where to pull out data for erosion risk areas and understanding of sediment transport to use your resources and the cost of modeling effectively. This is a very small model. This model took maybe 10 minutes to run. I've worked on another that took six hours to run because it uses 3D dynamics and water shearing over time. So some of those pre-processing steps require domain expertise. But a lot of the analysis and the images that I was showing up there, I'm relatively new to R and Python. And I think you guys are probably more experienced. So I think it's really accessible.

I think this is the last question. How far in advance do engineers need to interfere with issues like this? And how do they keep up with all the different problems along the shore?

That is often up to the clients. The eternal problem with shoreline armoring is that erosion is a natural process, especially here in Puget Sound because of our sedimentary rock formations. So there is always a trade-off between natural restoration and what we call hard armoring or bulkhead installation. And engineers have to weigh some of the benefits and the pros and cons of hard armoring. One of the effects of hard armoring is the destruction of the habitat that feeds salmon, which has a really big effect on the salmon industry. So there are a lot of people and opinions at play. But in general, I would say it depends on the location. So anywhere from like a year to if you're in the North Cove location, you know, you have days to think about what you're going to do before the water starts reaching your house.

All right. Yeah, that's all of our questions. Let's thank Tina one more time.

Regina Lionheart - Making Waves with R, Python, and Quarto

Transcript#

Introducing hydrodynamic modeling

Clayton Beach: the project site

Running the wave models

The data extraction challenge

Open science to the rescue

Combining R and Python for analysis

Making results accessible with Quarto

Q&A

Featured software#

Quarto