Discover and explore top open-source AI tools and projects—updated daily.
CLI tool for natural language question answering over text files
Top 74.2% on SourcePulse
fltr
is a command-line tool that enables natural language querying of text files, acting as a "grep for questions." It leverages large language models (LLMs) like Mistral 7B and Mixtral 8x7B to understand and answer questions posed in natural language, filtering files based on the LLM's responses. This is beneficial for users who need to extract specific information from large text datasets without relying on complex regular expressions or keyword searches.
How It Works
fltr
utilizes LLMs to process text files. Users provide a text file and a natural language prompt. The tool then feeds chunks of the text file along with the prompt to the LLM. The LLM evaluates each chunk against the prompt, and fltr
outputs lines where the LLM's inferred answer is affirmative. This approach allows for semantic understanding and context-aware filtering, going beyond simple pattern matching.
Quick Start & Requirements
curl https://raw.githubusercontent.com/moritztng/fltr/main/install.sh -o install.sh && bash install.sh small
small
for Mistral 7B (~7GB) or replace with large
for Mixtral 8x7B (~48GB).fltr --file emails.txt --prompt "Is the following email spam? Email:" --batch-size 32
export PATH=$PATH:~/Fltr
Highlighted Details
--batch-size
for potentially faster processing.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The README does not specify the license, which is crucial for commercial use or integration into closed-source projects. Performance on CPU is significantly limited, making GPU acceleration a practical necessity for reasonable throughput. The tool's effectiveness is highly dependent on the LLM's ability to accurately interpret the prompt and the text content.
1 year ago
Inactive