 
Intermediate
      
        
            Example
      
        
            Application
      
      Link to External Site
              
          This example demonstrates a multi-agent collaborative-competitive task in which you train three proximal policy optimization (PPO) agents to explore all areas within a grid-world environment.
Radioactivity
              0
           
    