Model merging via evolutionary optimization research
Top 30.3% on sourcepulse
This repository provides code and models for SakanaAI's Evolutionary Model Merge series, focusing on optimizing model merging recipes for improved performance. It targets researchers and developers working with large language and vision-language models, offering a method to create superior merged models from existing ones.
How It Works
The project employs an evolutionary optimization approach to discover effective model merging strategies. It iteratively merges base models using a defined recipe and evaluates the resulting merged model against specific benchmarks. The process then selects and combines the best-performing merges to generate the next generation of recipes, aiming to discover optimal merging configurations that outperform individual source models.
Quick Start & Requirements
pip install -e .
lid.176.ftz
fastext model.python evaluate.py --config_path {path-to-config}
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project explicitly states that tests were conducted with Python 3.10.12 and CUDA 12.3, and compatibility is not guaranteed in other environments.
8 months ago
Inactive