- 教授:吳毅成
- Lab1: Temporal Difference Learning Demo for Game 2048
- 117/120
- Lab2: Deep Q-Network for Atari MsPacman-v5
- 116/120
- Lab3: Proximal Policy Optimization for Atari Enduro-v5
- 120/120
- Lab4: Twin Delayed DDPG for CarRacing-v2
- 119/130
- Project: Racecar_gym
- 95.6/100