Discover and explore top open-source AI tools and projects—updated daily.
moritztngCLI tool for natural language question answering over text files
Top 74.1% on SourcePulse
fltr is a command-line tool that enables natural language querying of text files, acting as a "grep for questions." It leverages large language models (LLMs) like Mistral 7B and Mixtral 8x7B to understand and answer questions posed in natural language, filtering files based on the LLM's responses. This is beneficial for users who need to extract specific information from large text datasets without relying on complex regular expressions or keyword searches.
How It Works
fltr utilizes LLMs to process text files. Users provide a text file and a natural language prompt. The tool then feeds chunks of the text file along with the prompt to the LLM. The LLM evaluates each chunk against the prompt, and fltr outputs lines where the LLM's inferred answer is affirmative. This approach allows for semantic understanding and context-aware filtering, going beyond simple pattern matching.
Quick Start & Requirements
curl https://raw.githubusercontent.com/moritztng/fltr/main/install.sh -o install.sh && bash install.sh smallsmall for Mistral 7B (~7GB) or replace with large for Mixtral 8x7B (~48GB).fltr --file emails.txt --prompt "Is the following email spam? Email:" --batch-size 32export PATH=$PATH:~/FltrHighlighted Details
--batch-size for potentially faster processing.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The README does not specify the license, which is crucial for commercial use or integration into closed-source projects. Performance on CPU is significantly limited, making GPU acceleration a practical necessity for reasonable throughput. The tool's effectiveness is highly dependent on the LLM's ability to accurately interpret the prompt and the text content.
1 year ago
Inactive
finic-ai
JohnSnowLabs
fighting41love