In many multi-agent spatiotemporal systems, the agents are under theinfluence of shared, unobserved variables (e.g., the play a team is executingin a game of basketball). As a result, the trajectories of the agents are oftenstatistically dependent at any given time step; however, almost universally,multi-agent models implicitly assume the agents' trajectories are statisticallyindependent at each time step. In this paper, we introduce baller2vec++, amulti-entity Transformer that can effectively model coordinated agents.Specifically, baller2vec++ applies a specially designed self-attention mask toa mixture of location and "look-ahead" trajectory sequences to learn thedistributions of statistically dependent agent trajectories. We show that,unlike baller2vec (baller2vec++'s predecessor), baller2vec++ can learn toemulate the behavior of perfectly coordinated agents in a simulated toydataset. Additionally, when modeling the trajectories of professionalbasketball players, baller2vec++ outperforms baller2vec by a wide margin.