FEDOT  by aimclub

AutoML framework for automated modeling and machine learning

created 5 years ago
682 stars

Top 50.7% on sourcepulse

GitHubView on GitHub
Project Summary

FEDOT is an open-source AutoML framework designed for automated machine learning pipeline generation. It targets data scientists and researchers seeking to automate the process of building, optimizing, and deploying ML models for classification, regression, and time series forecasting tasks. FEDOT leverages evolutionary algorithms to design complex, composite pipelines, offering flexibility, extensibility, and integrability with popular ML libraries.

How It Works

FEDOT employs a generative design approach based on evolutionary algorithms to construct machine learning pipelines. These pipelines are represented as graphs, enabling the management of complex interactions between data preprocessing steps and modeling blocks. This evolutionary core allows for the exploration of diverse pipeline structures and hyperparameters, aiming to discover optimal solutions for various data types and tasks.

Quick Start & Requirements

Highlighted Details

  • Supports classification, regression, and time series forecasting.
  • Integrates with Scikit-learn, CatBoost, XGBoost, and custom libraries.
  • Offers various hyperparameter tuning methods.
  • Pipelines can be exported to JSON for reproducibility.

Maintenance & Community

  • Supported by the Natural Systems Simulation Lab at ITMO University.
  • Active development on LLM integration, meta-learning, and optimization core.
  • Community: Telegram Channel
  • Roadmap: Implied through ongoing research and development tasks.

Licensing & Compatibility

  • License: 3-Clause BSD.
  • Compatibility: Permissive license suitable for commercial use and integration with closed-source projects.

Limitations & Caveats

The framework is under active development, with ongoing research into advanced features like LLM integration and meta-learning. Specific performance benchmarks or detailed comparisons against other AutoML tools are not explicitly detailed in the README.

Health Check
Last commit

1 week ago

Responsiveness

1 day

Pull Requests (30d)
1
Issues (30d)
1
Star History
17 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems) and Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera).

towhee by towhee-io

0.2%
3k
Framework for neural data processing pipelines
created 4 years ago
updated 9 months ago
Starred by Logan Kilpatrick Logan Kilpatrick(Product Lead on Google AI Studio), Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), and
3 more.

catalyst by catalyst-team

0%
3k
PyTorch framework for accelerated deep learning R&D
created 7 years ago
updated 1 month ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), and
10 more.

open-r1 by huggingface

0.2%
25k
SDK for reproducing DeepSeek-R1
created 6 months ago
updated 3 days ago
Feedback? Help us improve.