Discover and explore top open-source AI tools and projects—updated daily.
leochlonLoRA-free fine-tuning for causal language models
New!
Top 89.7% on SourcePulse
NTK-Mirror offers a LoRA-free method for fine-tuning Hugging Face causal language models by learning a small, signed controller on top of a frozen Transformer. This approach is beneficial for users seeking efficient adaptation without permanent weight modifications, targeting researchers and power users working with LLMs.
How It Works
The core mechanism involves learning a sparse set of shared log-gates applied to decoder-layer output channels. These gates, represented as exp(s_layer, channel), modulate the hidden states h. The controller is trained using teacher-forced examples and then attached to the base model during inference. This method avoids introducing LoRA modules or altering the base model's weights, operating purely within the forward pass for efficient, modular adaptation.
Quick Start & Requirements
git clone https://github.com/leochlon/ntkmirror.git), navigate into the directory (cd ntkmirror), and install in editable mode (pip install -e .)..cuda() in the Python API example.transformers, torch. Install optional dataset dependencies with pip install -e '.[datasets]'.https://github.com/leochlon/ntkmirror.gitbash examples/run_demo.shdocs/composability.md, docs/persistent_memory.md, docs/method.mdHighlighted Details
Maintenance & Community
The project is authored by Leon Chlon and associated with Hassana Labs. No specific community channels (like Discord or Slack) or detailed roadmap information are provided in the README.
Licensing & Compatibility
The project is released under the MIT License, which is permissive and generally suitable for commercial use and integration into closed-source projects.
Limitations & Caveats
This package intentionally omits the full research harness for advanced diagnostics (e.g., NTK-vector diagnostics, oracle SGD-displacement fitting). The default retriever for persistent memory is a basic TF-IDF scorer, with recommendations to replace it with embedding-based solutions for production use. The documentation mentions "failure modes," suggesting areas that may require further investigation or represent known limitations.
3 days ago
Inactive
Ziems
tomaarsen
adapter-hub
NVIDIA-NeMo
guidance-ai