Transformer library for code LLMs and code intelligence tasks
Top 28.4% on sourcepulse
CodeTF is a comprehensive Python library for code Large Language Models (Code LLMs) and code intelligence, targeting researchers and developers. It simplifies training, fine-tuning, and inference for tasks like code generation, summarization, and translation, offering a unified interface to state-of-the-art models and benchmarks.
How It Works
CodeTF leverages the HuggingFace Transformers ecosystem, providing optimized pipelines for serving pre-quantized models (int8, int16, float16) with features like weight sharding for large models. It integrates HuggingFace PEFT for efficient fine-tuning and uses tree-sitter for robust Abstract Syntax Tree (AST) parsing across 15+ programming languages, enabling detailed code attribute extraction and manipulation.
Quick Start & Requirements
pip install salesforce-codetf
pip install -U git+https://github.com/huggingface/transformers.git git+https://github.com/huggingface/peft.git git+https://github.com/huggingface/accelerate.git
huggingface-cli login
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
CodeTF is designed to complement HuggingFace Transformers; users needing extensive customization may prefer building from scratch. The library does not guarantee infallible code intelligence and advises users to examine models for potential inaccuracies, biases, or security risks before adoption.
3 months ago
1 day