LLM for enhanced context utilization
Top 99.5% on sourcepulse
This repository provides the official implementation for FILM-7B, a 32K-context Large Language Model designed to overcome the "lost-in-the-middle" problem. It targets researchers and developers working with long-context LLMs, offering improved performance on both probing and real-world tasks without sacrificing short-context capabilities.
How It Works
FILM-7B is a fine-tuned version of Mistral-7B-Instruct-v0.2, utilizing Information-Intensive (In2) Training. This approach enhances the model's ability to effectively process and recall information from extended contexts, achieving near-perfect scores on probing tasks and state-of-the-art performance for its size class on real-world long-context benchmarks.
Quick Start & Requirements
git clone https://github.com/microsoft/FILM.git
cd FILM
conda create -n FILM python=3.10.11
conda activate FILM
pip install torch==2.0.1 # cuda11.7 and cudnn8
pip install -r requirements.txt
Highlighted Details
Maintenance & Community
This project is from Microsoft and welcomes contributions via pull requests, requiring agreement to a Contributor License Agreement (CLA). It adheres to the Microsoft Open Source Code of Conduct.
Licensing & Compatibility
The repository does not explicitly state a license in the README. Compatibility for commercial use or closed-source linking is not specified.
Limitations & Caveats
This repository is strictly for research purposes and is not an official Microsoft product or service. The specific license for the model weights and code is not detailed in the README.
1 year ago
Inactive