CodeGeeX4 by zai-org

Code generation model for versatile AI software development

Created 1 year ago

2,360 stars

Top 19.1% on SourcePulse

Project Summary

CodeGeeX4-ALL-9B is an open-source, multilingual code generation model designed for comprehensive AI software development tasks. It targets developers seeking a powerful, efficient, and versatile coding assistant, offering capabilities from code completion and interpretation to repository-level Q&A and function calling.

How It Works

This model is a fine-tuned version of GLM-4-9B, leveraging a large-scale, multilingual code dataset. Its architecture supports a 128K sequence length, enabling it to process extensive code contexts for tasks like repository-wide Q&A and "needle in a haystack" retrieval. It uniquely supports function calling, outperforming GPT-4 in execution success rates on this specific capability.

Quick Start & Requirements

Ollama: ollama run codegeex4 (requires Ollama 0.2+)
Huggingface Transformers: transformers>=4.39.0,<4.41.0
vLLM: vllm==0.5.1
Hardware: GPU recommended for optimal performance. CUDA 12 is supported.
Resources: Requires significant VRAM for the 9B model, especially with the 128K context.
Docs: Homepage, VS Code Extension, Jetbrains Extension, HF Demo

Highlighted Details

Achieves state-of-the-art performance for models under 10B parameters on benchmarks like BigCodeBench and NaturalCodeBench.
Supports a 128K context window, demonstrating 100% retrieval accuracy in "Code Needle In A Haystack" evaluations.
Unique function calling capability with higher execution success rates than GPT-4.
Offers extensions for VS Code and Jetbrains, and supports local deployment via Ollama and vLLM.

Maintenance & Community

The project is associated with THUDM (Tsinghua University). Community interaction channels are not explicitly listed in the README.

Licensing & Compatibility

Code: Apache-2.0 license.
Model Weights: Custom "Model License". Academic research is permitted. Commercial use requires registration via a provided form.

Limitations & Caveats

Commercial use of model weights is restricted and requires explicit registration. The README does not detail specific limitations regarding unsupported platforms or known bugs.

CodeGeeX4 by zai-org

Explore Similar Projects

prompt-tower by backnotprop

brokk by BrokkAi

codiumai-vscode-release by Codium-ai

naturalcc by CGCL-codes

build-your-ai-coding-assistant by unit-mesh

awesome-ai-coding by wsxiaoys

IQuest-Coder-V1 by IQuestLab

copilot-clone by hieunc229

CodeXGLUE by microsoft

CodeGen by salesforce

CodeGeeX2 by zai-org

DeepSeek-Coder by deepseek-ai