Text this: Comparative study of SAC and PPO in multi-agent reinforcement learning using unity ML-agents