FEDOT  by aimclub

AutoML framework for automated modeling and machine learning

Created 5 years ago
692 stars

Top 49.2% on SourcePulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

FEDOT is an open-source AutoML framework designed for automated machine learning pipeline generation. It targets data scientists and researchers seeking to automate the process of building, optimizing, and deploying ML models for classification, regression, and time series forecasting tasks. FEDOT leverages evolutionary algorithms to design complex, composite pipelines, offering flexibility, extensibility, and integrability with popular ML libraries.

How It Works

FEDOT employs a generative design approach based on evolutionary algorithms to construct machine learning pipelines. These pipelines are represented as graphs, enabling the management of complex interactions between data preprocessing steps and modeling blocks. This evolutionary core allows for the exploration of diverse pipeline structures and hyperparameters, aiming to discover optimal solutions for various data types and tasks.

Quick Start & Requirements

Highlighted Details

  • Supports classification, regression, and time series forecasting.
  • Integrates with Scikit-learn, CatBoost, XGBoost, and custom libraries.
  • Offers various hyperparameter tuning methods.
  • Pipelines can be exported to JSON for reproducibility.

Maintenance & Community

  • Supported by the Natural Systems Simulation Lab at ITMO University.
  • Active development on LLM integration, meta-learning, and optimization core.
  • Community: Telegram Channel
  • Roadmap: Implied through ongoing research and development tasks.

Licensing & Compatibility

  • License: 3-Clause BSD.
  • Compatibility: Permissive license suitable for commercial use and integration with closed-source projects.

Limitations & Caveats

The framework is under active development, with ongoing research into advanced features like LLM integration and meta-learning. Specific performance benchmarks or detailed comparisons against other AutoML tools are not explicitly detailed in the README.

Health Check
Last Commit

3 weeks ago

Responsiveness

1 week

Pull Requests (30d)
1
Issues (30d)
1
Star History
7 stars in the last 30 days

Explore Similar Projects

Starred by Edward Sun Edward Sun(Research Scientist at Meta Superintelligence Lab), Phil Wang Phil Wang(Prolific Research Paper Implementer), and
1 more.

grain by google

0.9%
536
Python library for ML training data pipelines
Created 3 years ago
Updated 1 day ago
Feedback? Help us improve.