Appendix

Basic game theory

Tooltip Text

Cooperative AI by Center for AI Safety

5:05 to 20:32

Tooltip Text

25 mins
Multi-Agent Reinforcement Learning Foundations And Modern Approaches

'3 Games: Models of Multi-Agent Interaction' up to and excluding '3.3 Stochastic Games

Tooltip Text

20 mins

Standard Games

Tooltip Text

Game theory posts from nonzerosum.games

The articles under 'The Classics' except 'The Dilemma's Dilemma'

Tooltip Text

Prerequisites
30 mins

Intermediate game theory

Tooltip Text

Extensive form games

The following introduces extensive form games which model scenarios with temporality.

Tooltip Text

Multiagent Systems: Algorithmic, Game-Theoretic, and Logical Foundations

'5.1 Perfect-information extensive-form games' excluding '5.1.3 Subgame-perfect equilibrium' and '5.1.4 Computing equilibria: backward induction'

Tooltip Text

Prerequisites
20 mins

Repeated games

Under basic game theory, you will have been introduced to repeated games. The following is a rigorous introduction to repeated games using the extensive form representation of games.

Tooltip Text

Multiagent Systems: Algorithmic, Game-Theoretic, and Logical Foundations

'6.1 Repeated games' excluding '6.1.3 “Bounded rationality": repeated games played by automata'

Tooltip Text

30 mins

Solution Concepts

This resource explains solution concepts. The chapter '3.4 Further solution concepts for normal-form games' includes descriptions of many important solution concepts and can be used as reference material.

Tooltip Text

Multiagent Systems: Algorithmic, Game-Theoretic, and Logical Foundations

'3.3 Analyzing games: from optimality to equilibrium' up to and excluding '3.3.3 Finding Nash equilibria'

Tooltip Text

Prerequisites
15 mins

Next is another look at solution concepts beyond the context of normal-form games. The chapter '4 Solution Concepts for Games' also includes descriptions of many important solution concepts and can be used as reference material.

Tooltip Text

Multi-Agent Reinforcement Learning Foundations And Modern Approaches

'4 Solution Concepts for Games' up to and including '4.2 Best Response'. And '4.4 Nash Equilibrium'

Tooltip Text

30 mins

Markov Games

Note that Markov games and stochastic games are different terms for the same thing.

Tooltip Text

Multiagent Systems: Algorithmic, Game-Theoretic, and Logical Foundations

'6.2 Stochastic games' excluding '6.2.3 Computing equilibria'

Tooltip Text

Prerequisites
10 mins
Multi-Agent Reinforcement Learning Foundations And Modern Approaches

'3.3 Stochastic Games' and '3.4 Partially Observable Stochastic Games'

Tooltip Text

Prerequisites
20 mins

Fictitious Play and Regret Matching

Tooltip Text

Fictitious Play and Regret Matching

From 1:02

Tooltip Text

Prerequisites
15 mins

In the following resource you can skip most of '2.4 Worked Example: Rock-Paper-Scissors'.

Tooltip Text

An Introduction to Counterfactual Regret Minimization

'2 Regret in Games'

Tooltip Text

Prerequisites
40 mins

Machine Learning

Tooltip Text

But What Is a Neural Network?

All parts

Tooltip Text

25 mins
A short introduction to machine learning

All parts

Tooltip Text

20 mins

Reinforcement Learning

Tooltip Text

Reinforcement Learning: Machine Learning Meets Control Theory

All parts

Tooltip Text

40 mins
Hugging face Introduction to Deep Reinforcement Learning

Unit 1. Introduction to Deep Reinforcement Learning

Tooltip Text

1 hr

Q-Learning

Tooltip Text

Hugging face Introduction to Q-Learning

Unit 2. Introduction to Q-Learning

Tooltip Text

1 hr 30 mins

Proximal Policy Optimisation

Tooltip Text

Hugging face Introduction to Deep Reinforcement Learning

Unit 8. Part 1 Proximal Policy Optimization (PPO)

Tooltip Text

35 mins

Multi-Agent Reinforcement Learning

Tooltip Text

Multi-Agent Reinforcement Learning Foundations And Modern Approaches

'1 Introduction' up to and excluding '1.4 Challenges of MARL'

Tooltip Text

20 mins

Large Language Models

Tooltip Text

Intro to Large Language Models

Up until 21:05

Tooltip Text

30 mins

LLM-Based Agents

Tooltip Text

Foundational Challenges in Assuring Alignment and Safety of Large Language Models

Unit 1. Introduction to Deep Reinforcement Learning

Tooltip Text

Prerequisites
20 mins