Super Mario Bros RL Agent
Production-ready PPO implementation training AI to play Super Mario Bros. Features custom CNN architecture (4-layer with 512-unit FC), vectorized training with 8 parallel environments, and comprehensive evaluation tools. Achieves 80% success rate with 10M timesteps. Includes Docker support, TensorBoard/W&B integration, and modular codebase.