Associative memory-enhanced GPT-2 model
Top 83.0% on sourcepulse
VardaGPT enhances GPT-2 with an associative memory powered by FAISS, aiming to improve context retrieval and text generation. It's designed for researchers and developers interested in memory-augmented language models.
How It Works
VardaGPT integrates a FAISS-based associative memory with a GPT-2 model. During inference and training, it retrieves relevant information from the memory based on input embeddings. This retrieved information is concatenated with the original input embeddings before being processed by the GPT-2 transformer. This approach allows the model to access and utilize a larger, external knowledge base, potentially leading to more coherent and contextually relevant text generation.
Quick Start & Requirements
pip install -r requirements.txt
followed by python train_varda_gpt_associative.py
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
VardaGPTAssociative
model with FAISS CPU; GPU support for FAISS is not explicitly detailed in the provided text.2 years ago
1 day