Tic-Tac-Toe (Reinforcement Learning)
Pick a mode, then click a square. X always starts.
Mode
Vs Agent
Player vs Player
New Game
X
O
Training (advanced)
Train (50k episodes)
Teacher ability