Richard Iannone - Great Tables for Everyone | SciPy 2024

Transcript#

This transcript was generated automatically and may contain errors.

Thank you. So my talk is Great Tables for Everybody because I'm talking about the package Great Tables. Incidentally, I've got three goals for this talk. Make you like tables by the end of the talk, especially Great Tables tables, the package, the tables you make from Great Tables, and convince you of their goodness in science. We can use Great Tables for Science, we're at SciPy after all, so I want to show you some examples of that.

Okay, to start, why do we make Great Tables? We did it to enable the building of tables, obviously, but it's also focused purely on the display of tables. If you ever had to share a table with somebody, you want to make it look good, you want to make it presentable. So that's the whole idea here. So again, this is not the only approach to table making in Python, but what we have is something that's comprehensive, it's actively developed, and it's really great for science-y tables, as you'll soon see.

And then there's a bit of confusion with tables, right? What are we talking about, really? We can give them other sort of alternate names, like display tables, summary tables, presentation tables. These are just kind of like tables you want to show to people that don't look terrible. That's the idea.

So what I want is... What I want to see in the world is less of the stuff on the left, which is basically just a DataFrame printout in a console, and more of the table on the right, which looks super nice. It has a title, it has nicely formatted pretty much everything. So that's the world I want to live in.

Why not just use a DataFrame or Excel?

So how do we make tables today? Okay. So we could do this. Raw DataFrame, give it to somebody, maybe it's fine, probably it isn't. I don't recommend it. I wouldn't do it. Because you can't even see all the data, for one thing. And it looks kind of like not so great.

Another idea. This is something I'm guilty of. Take your data, dump it to a CSV, and then bring it to Excel, make your table there, and then bring it back to wherever you need to bring it back to. That's okay. But the problem is your reproducible workflow, if you want one, is now broken. So terrible. I don't like it.

What I'm recommending, the whole purpose of this talk, is to use Great Tables, make really great tables entirely in Python. So it's reproducible. Probably less effort once you learn how to use Great Tables. And the tables, they look good. I'll show you more good-looking tables soon.

What I'm recommending, the whole purpose of this talk, is to use Great Tables, make really great tables entirely in Python. So it's reproducible.

And the cool thing about nanoplots is that you can see right away for this patient, the white blood cell count increased beyond the normal range. And that's super important. If you just had numbers, the time to that insight would probably be a little bit lower.

And nanoplots, of course, you can probably think of a million ways to use them. Here's just three more examples. Stock prices. Weather data. Not too dissimilar from what you've already seen. And some sales data based on pizza sales.