Aleksander Dietrichson - AI for Gaming: How I Built a Bot to Play a Video-Game with R and Python

Transcript#

This transcript was generated automatically and may contain errors.

Thank you. So I'll be speaking about AI for Gaming. And what you see here is a software called Gatai playing a video game called Christmas 3. And this is from one of our training sessions. This was when we tried to do this brute force, something that didn't work. And we'll get back to why that didn't work shortly.

And a little bit of history. First, Gato is a web portal for game creators. And Gato, then when they approached me about this project became Gato AI, which turned into Gatai and then Gatai the AI. And Gato means cat in Spanish. And I live and work in Argentina. And because it means that that's why Gatai is a feline, as you can see here. And that sort of stuck with throughout the project.

Christmas 3 is a puzzle game that you can play online, you can look it up and you can play it directly in the browser. And the purpose of the game is to move any of these tiles, either vertically or horizontally, so that they line up in groups of two, three or four, no three, four or five. And then you get different scores based on that. So it's not a very complex game. And I'll just show here, these would be two valid moves to make on this board. And there are 10 different types of tiles. And the board is 10 by 6.

The state space problem

And that is sort of why the brute force approach will not work because you are looking at 10 to the 60th combinations here. And so we had to look for strategies on how to reduce this. This is what is called the state space. So the total number of possible configurations of the board. And that's an important concept in reinforcement learning, which is what we're going to be using to play this game.

Now, so to put this number in context, this is, you know, it's more than the number of atoms on Earth, less than the observable universe. And it's actually quite a bit bigger than chess that actually surprised me when I looked at this. We can talk about why after, but that actually surprised me. It's more complex than chess when you just look at the state spaces.

It's more complex than chess when you just look at the state spaces.

So what is a state space? It's a configuration of the board. In our case, when you use reinforcement learning, there's an algorithm called Q-learning that totally talks about state space, action space. There's a reward, which is essentially the score and the resulting state. So state space is what is on the board. Action space is what you can do with that. And the rewards is the score. And then the resulting state is what does the board look like when you are done, when you have made your move. And this goes into a formula and the algorithm can converge and then solve the game essentially.

if you write the code that plays the video games that these super nerds write, then I think you are a member of the geekhood.

Then the other thing was, I had taken their products from their websites. You know, this is wild caught product from their catalog. And I had applied AI to it. That is what I wanted to do. And I wanted this to be a portfolio project. I wanted to go back to the company and say, you know, there are some business cases for you where we can use AI.

And the game that I analyzed is not one where you have an opponent, but it's relatively easy to see how you could do that. But I did implement an assistant. Let's see if I can get this to run. And here I am actually playing the game on the board. And I'm trying not to make too good moves and huddling a little. And at some point, Gatai will detect that I'm stuck and it will, like Clippy, show up and propose some moves. This is also done with Selenium, because you can send JavaScript all the way through into the browser, and then the JavaScript will paint the Gatai logo and write out this and the arrows and everything. So that point of view, it's quite powerful.

But what really caught their interest was the idea that how about we apply generative AI to this? How about we do this at scale? We can scan in all the games in your catalog, and we can make an LLM that will then create custom games for your users. So I know what kind of game you'd like to play. Here is one that is made exactly for you by our system. And then they asked me how much that would cost. And I said, you know, maybe $5 million, maybe 10. And then they asked when I could submit my last invoice. And that was the end of my career in the gaming industry.

Using the project in teaching

Now, I have a day job. I am a teacher. I teach data science at the university. And I thought of how to reuse this project or project like this in my teaching, because we did a lot of work. We did a lot of simulations. We got results back. If you have an empirical approach, you will say, well, from these results, we can extrapolate these rules with a certain p-value. For example, one of them was there's always going to be a two-move combination that is reasonable to make on this board. We have a p-value for that. Now you can take that to the next level and say, well, that's a conjecture. Can we prove this mathematically? And you can. And then, you know, if you want to follow that trail, what's the minimum number of tile types you need to have for that to be true?

And when you're done with that, you can play with the geometry on the board. You know, we're doing six by 10. What if you do one by 10? What are the rules then, mathematically? Oh, one by infinity. Infinity is always fun to work with when you're proving stuff mathematically.

Of course, all the code we wrote should sit in a package, so we can talk about package development. I implemented the state spaces as S. That's not correct, actually. It's S3 classes. And I implemented the robot as R6 classes. There is a reason for that. What are the pros and cons of each type of class structure that we have available? Graphics processing is an important topic because images are data too, and we should be able to analyze them and manipulate them and use them in data science. And browser automation, I think, is a cool topic.

So, you know, there's a couple of tracks here. There's one very theoretical track, and there's one more engineering track, maybe some system integration. And of course, we have to write this up. We have to create this presentation, use Quarto for that. So I think there's enough here for one semester seminar for a group of students that I have. And I'm hoping to be able to distribute this so that the guys who like mathematical proofs get to work on that, and the people who want to work on more engineering stuff can do that. And then since we do this as a group project, we also learn how to work in an interdisciplinary field, interdisciplinary team. So this is how I tend to use it going forward as a teacher. And that's what I had for you today. Thank you for your attention.

Q&A

Very interesting. I can't wait to go and play some Christmas tree.

Quick question from the audience. You mentioned that the model got an average score of 42.5 per move. Did you get a chance to compare that to an average human player? Only to myself. Yeah. And I got, you know, 20 per move or something like that.

Okay. Interesting. Okay. So do you have any thoughts to apply this concept to other kinds of games or have you looked into that for the future? Yeah. So this is a puzzle game. And then, you know, this kind of algorithm works really well for a puzzle game. And you can also use the Selenium connector to actually play it in the browser, which I think is very powerful.

If you want to do space invaders, for example, then things need to move faster. So you need to intercept. You can't really play it in the browser with a Selenium setup and take a screenshot and evaluate. You're not going to have that much time. So for that, the solution I have would be to use, there's a platform called Unity, which is used to create games. And it has actually has a plugin for AI agents. So then you bypass the rendering on screen of everything. And you actually play the game synthetically just before it goes to the screen and goes to the hardware. So then you can actually train models for faster games, so to speak.

Very cool. One last question, short one. How long did this project take, start to finish, if you're willing to say? Yeah, three weeks.

Very interesting. Thank you so much. Yeah, absolutely.

Aleksander Dietrichson - AI for Gaming: How I Built a Bot to Play a Video-Game with R and Python

Transcript#

The state space problem

Reducing the state space

Browser automation and the vision model

Results and business cases

Using the project in teaching

Q&A

Featured software#

Quarto

Shiny