unit-minions by unit-mesh

LoRA tuning research for AI-assisted software development

Created 2 years ago

1,102 stars

Top 34.7% on SourcePulse

Project Summary

This repository provides a collection of LoRA models and training code for enhancing AI-driven software development efficiency. It targets engineers and researchers interested in fine-tuning large language models like LLaMA and ChatGLM for tasks such as user story generation, test code creation, code completion, and text-to-SQL conversion. The project offers pre-trained LoRA models and detailed tutorials for replicating the training process.

How It Works

The project leverages LoRA (Low-Rank Adaptation) to fine-tune pre-trained models on specific datasets tailored for software engineering tasks. It standardizes the AI-assisted development process by breaking down tasks into granular steps, feeding data for each step to the models. This approach aims to maximize the "copy-paste" effect of AI, generating accurate outputs for each micro-task. Datasets are prepared using OpenAI for generating user tasks and stories, and for creating code and test cases based on class information.

Quick Start & Requirements

Installation: Primarily through provided Jupyter Notebooks (alpaca-lora.ipynb, chatglm-tuning.ipynb) and Python scripts.
Prerequisites: Python, PyTorch, Hugging Face Transformers, and potentially CUDA for GPU acceleration. Specific model requirements (e.g., LLaMA-7B, ChatGLM-6B) are noted. Access to cloud GPUs (e.g., OpenBayes) is recommended for training.
Resources: Training LoRA models can be resource-intensive, with training times varying from 25 minutes to several hours depending on dataset size and hardware.
Links:
- LLaMA Alpaca LoRA Training: https://github.com/tloen/alpaca-lora
- ChatGLM Tuning: https://github.com/unit-mesh/unit-minions/blob/main/chatglm-tuning.ipynb
- Data Preparation: https://github.com/unit-mesh/minions-data-prepare

Highlighted Details

Focuses on practical AI applications in software development lifecycle.
Provides specific LoRA models for user story generation, test code generation, code assistance, and text-to-SQL.
Demonstrates a methodology for fine-grained task decomposition for AI training.
Includes video tutorials and example outputs for various AI-assisted tasks.

Maintenance & Community

The project is associated with unit-mesh.
Sponsors include AIOS Club (for OpenAI Key) and OpenBayes (for Cloud GPU).
Roadmap indicates completion of several key training tasks.

Licensing & Compatibility

The README does not explicitly state a license for the repository's code or datasets. However, it mentions using OpenAI-generated data and publicly available projects, and notes that users are responsible for the consequences of their training. Compatibility for commercial use is not specified.

Limitations & Caveats

The project relies heavily on OpenAI for data generation, which may incur costs and has usage policies.
The quality of generated test cases by OpenAI is noted as potentially unreliable, suggesting a need for human review.
Some datasets used for text-to-SQL and text-to-code are described as having low quality but being usable.

Health Check

Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

0 stars in the last 30 days

Explore Similar Projects

Lingma-SWE-GPT by LingmaTongyi

Specialized LLM for software engineering automation

Created 1 year ago

Updated 1 year ago

aider-mcp-server by disler

AI coding assistant server for offloading tasks

Created 9 months ago

Updated 7 months ago

LLM-Kit by wpydcr

WebUI platform for LLMs, integrating tools for model customization and applications

Created 2 years ago

Updated 1 month ago

ai-engineer-roadmap by dswh

AI engineering roadmap and resources

Created 1 year ago

Updated 1 year ago

Starred by

Edward Sun

Edward Sun(Research Scientist at Meta Superintelligence Lab),

Casper Hansen

Casper Hansen(Author of AutoAWQ), and

1 more.

aideml by WecoAI

ML engineering agent for automated AI R&D, surpassing human experts

Created 1 year ago

Updated 2 months ago

tabnine-vscode by codota

AI code assistant for boosting developer productivity

Created 7 years ago

Updated 2 days ago

awesome-ml by underlines

Curated list of LLM/ML/DS resources

Created 5 years ago

Updated 8 months ago

Starred by

Pawel Garbacki

Pawel Garbacki(Cofounder of Fireworks AI),

Ying Sheng

Ying Sheng(Coauthor of SGLang), and

9 more.

ToolBench by OpenBMB

Open platform for LLM tool learning (ICLR'24 spotlight)

Created 2 years ago

Updated 7 months ago

generative_ai_with_langchain by benman1

Code repo for a generative AI with LangChain book

Created 2 years ago

Updated 2 weeks ago

Starred by

Tobi Lutke

Tobi Lutke(Cofounder of Shopify),

Amanpreet Singh

Amanpreet Singh(Cofounder of Contextual AI), and

8 more.

plandex by plandex-ai

CLI tool for AI-assisted coding on large projects

Created 2 years ago

Updated 3 months ago

Starred by

Lilian Weng

Lilian Weng(Cofounder of Thinking Machines Lab),

Anton Osika

Anton Osika(Cofounder of Lovable), and

20 more.

gpt-engineer by AntonOsika

CLI platform for code generation experimentation

Created 2 years ago

Updated 8 months ago

JeecgBoot by jeecgboot

AI low-code platform for rapid enterprise AI app development

Created 7 years ago

Updated 2 days ago

Feedback? Help us improve.