Discover and explore top open-source AI tools and projects—updated daily.
facebookresearchSelf-supervised pretraining for intuitive physics understanding
Top 98.6% on SourcePulse
This repository provides the code and data to reproduce the findings of the paper "Intuitive physics understanding emerges from self-supervised pretraining on natural videos." It enables researchers and engineers to evaluate models on intuitive physics tasks using self-supervised pretraining on natural videos, facilitating reproducibility and further research in video understanding and AI reasoning.
How It Works
The project leverages self-supervised pretraining, specifically building upon the JEPA (Joint Embedding Predictive Architecture) framework. It evaluates models by extracting "surprise metrics" from natural videos, quantifying their intuitive physics understanding. The approach supports V-JEPA and VideoMAEv2 models, processing raw surprise outputs into performance metrics for analysis and figure generation.
Quick Start & Requirements
data_intphys.tar.gz. Install dependencies via requirements.txt.utils.py) to specify cluster environment and dataset paths.python -m evals.main or distributed runs with submitit via python -m evals.main_distributed. Configuration files (.yaml) specify model checkpoints and evaluation tasks.Highlighted Details
.pth, .csv).intphys_test evaluation.Maintenance & Community
Authored by researchers from Meta AI (formerly Facebook AI Research), including Yann LeCun. No specific community channels (Discord, Slack) or roadmap links are provided in the README.
Licensing & Compatibility
Limitations & Caveats
The setup requires manual adaptation of cluster configurations and dataset paths. The non-commercial license restricts its use in industry or for profit-driven projects. The focus is on reproducing specific research findings rather than providing a general-purpose library.
1 week ago
Inactive
cambrian-mllm
microsoft