Large-scale synthetic video dataset of physically simulated multi-object interaction scenes for training and evaluating models on physical reasoning, depth and optical-flow estimation, instance segmentation, and physics-grounded captioning. Provides RGB + lossless depth, per-frame instance masks, per-object physics annotations (NPZ), VLM-grounded captions, and USD scene files — useful for world-model and simulation-to-real work; commercial use permitted.