Discover and explore top open-source AI tools and projects—updated daily.
Semiconductor LLM for domain-specific tasks
Top 77.7% on SourcePulse
SEMIKONG is an open-source, industry-specific large language model (LLM) designed for the semiconductor manufacturing domain. It addresses the need for specialized AI capabilities in this complex field by providing models trained on a comprehensive corpus of semiconductor-related text, enabling better understanding of physics, chemistry, and processes. The project targets engineers, researchers, and companies in the semiconductor industry, offering a foundation for building proprietary AI solutions and improving productivity.
How It Works
SEMIKONG leverages the Transformer architecture and is based on the Llama model, allowing seamless integration with the existing Llama ecosystem. It utilizes a novel pre-training approach incorporating domain-specific knowledge to achieve superior performance on industry-relevant benchmarks compared to general-purpose LLMs. The project offers both 8B and 70B parameter instruct models, with weights available on Hugging Face.
Quick Start & Requirements
git clone https://github.com/aitomatic/semikong.git
), navigate into the directory (cd semikong
), and install dependencies (pip install -r requirements.txt
).Highlighted Details
LlamaForCausalLM
, LlamaTokenizer
).Maintenance & Community
The project is a collaborative effort involving Tokyo Electron, FPT Software AIC, and Aitomatic, with contributions from AI Alliance members. Discussions can be held on GitHub.
Licensing & Compatibility
The code and weights are distributed under the Apache 2.0 License, permitting personal, academic, and commercial use. Derivative works require attribution.
Limitations & Caveats
While efforts are made for data compliance, the model may still produce incorrect or problematic outputs due to data complexity and usage scenarios. The project disclaims responsibility for risks arising from misuse.
4 months ago
Inactive