Environment Wrapper Integration

- Create Gym-compatible environment wrapper for OGM
- Integrate multi-agent observation and action spaces
- Implement single-agent-per-timestep movement constraint
- Add reset, step, and render methods

**OUTPUT**: A working `env/pivoting_cubes_env.py` that wraps OGM in OpenAI Gym interface, handles multi-agent coordination, and provides standardized RL environment API.