BigDL by intel

AI scaling library for Spark/Flink/Ray, from laptop to cloud

Created 8 years ago

2,690 stars

Top 17.4% on SourcePulse

1 Expert Loves This Project

simon-mo

Core Maintainer of vLLM

Project Summary

BigDL is a unified distributed AI framework designed to scale data analytics and AI applications from laptops to clusters. It offers specialized libraries for large language models (LLMs), big data AI pipelines, transparent program acceleration, deep learning on Spark, time series analysis, recommendation systems, and hardware-secured AI.

How It Works

BigDL leverages Apache Spark and Ray for distributed execution, enabling users to scale single-node Python or Scala/Java programs. Its core strength lies in providing high-level APIs that abstract away distributed computing complexities, allowing seamless integration with popular deep learning frameworks like TensorFlow, PyTorch, and Keras. The Nano library further enhances performance by transparently applying CPU optimizations.

Quick Start & Requirements

Installation: pip install bigdl (recommended via conda environment).
Prerequisites: Python, Spark/Ray (for distributed modes). Specific libraries may have additional requirements.
Documentation: BigDL Docs

Highlighted Details

LLM Support: Optimized for Intel CPUs and GPUs, though the LLM library is deprecated in favor of ipex-llm.
Orca: Scales TensorFlow, PyTorch, and OpenVINO programs on Spark/Ray clusters, supporting distributed data processing and model training.
Nano: Transparently accelerates TensorFlow and PyTorch programs on CPUs with optimizations like BF16, INT8 quantization, and JIT compilation, offering up to 10x speedup.
PPML: Provides hardware-secured AI execution using Intel SGX/TDX for enhanced data privacy.

Maintenance & Community

Development Focus: Future development for LLMs is directed to the ipex-llm project.
Community: Support via Mail List, User Group, and GitHub Issues.

Licensing & Compatibility

License: Apache 2.0.
Compatibility: Generally compatible with commercial and closed-source applications.

Limitations & Caveats

The bigdl-llm library is deprecated and users should migrate to ipex-llm. Performance optimizations are primarily targeted at Intel hardware.

Health Check

Last Commit

1 month ago

Responsiveness

1 day

Pull Requests (30d)

0

Issues (30d)

0

Star History

0 stars in the last 30 days

Explore Similar Projects

OpenArc by SearchSavior

Local AI inference engine for Intel devices serving diverse models

Created 11 months ago

Updated 1 week ago

ztachip by ztachip

RISC-V edge AI accelerator platform for FPGAs and ASICs

Created 5 years ago

Updated 6 days ago

Starred by

Deshraj Yadav

Deshraj Yadav(Cofounder of Mem0) and

Amanpreet Singh

Amanpreet Singh(Cofounder of Contextual AI).

huskarl by danaugrs

Deep reinforcement learning framework for fast prototyping

Created 6 years ago

Updated 2 years ago

awesome-edge-machine-learning by Bisonai

Curated list of edge ML resources

Created 6 years ago

Updated 2 years ago

awesome-ai-infrastructures by 1duo

AI infrastructures for scalable ML production workflows

Created 7 years ago

Updated 6 years ago

Starred by

Anton Troynikov

Anton Troynikov(Cofounder of Chroma),

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera), and

6 more.

prime-diloco by PrimeIntellect-ai

Framework for distributed AI model training over the internet

Created 1 year ago

Updated 1 month ago

Starred by

George Hotz

George Hotz(Author of tinygrad; Founder of the tiny corp, comma.ai) and

Carol Willing

Carol Willing(Core Contributor to CPython, Jupyter).

ai-performance-engineering by cfregly

AI Systems Performance Engineering for modern AI workloads

Created 8 months ago

Updated 3 days ago

cube-studio by data-infra

Unified cloud-native AI platform for end-to-end ML workflows

Created 1 year ago

Updated 2 months ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"),

Luis Capelo

Luis Capelo(Cofounder of Lightning AI), and

3 more.

LitServe by Lightning-AI

AI inference pipeline framework

Created 2 years ago

Updated 4 days ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"),

Jiayi Pan

Jiayi Pan(Author of SWE-Gym; MTS at xAI), and

20 more.

alpa by alpa-projects

Auto-parallelization framework for large-scale neural network training and serving

Created 4 years ago

Updated 2 years ago

Starred by

Omar Sanseviero

Omar Sanseviero(DevRel at Google DeepMind),

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera), and

3 more.

mindspore by mindspore-ai

Deep learning framework for mobile, edge, and cloud training/inference

Created 6 years ago

Updated 1 year ago

Starred by

Beyang Liu

Beyang Liu(Cofounder of Sourcegraph),

Hiroshi Shibata

Hiroshi Shibata(Core Contributor to Ruby), and

55 more.

ray by ray-project

AI compute engine for scaling Python and AI applications

Created 9 years ago

Updated 19 hours ago

Feedback? Help us improve.