Discover and explore top open-source AI tools and projects—updated daily.
pasquini-darioLLM fingerprinting for model identification
Top 99.8% on SourcePulse
LLMmap is a tool designed to identify Large Language Models (LLMs) by analyzing their behavioral traces through minimal, targeted queries. It aims to provide high-accuracy LLM fingerprinting for researchers and developers. The project has been rebuilt in PyTorch (v0.2), offering updated models and procedures for enhanced capabilities.
How It Works
The core methodology involves sending a set of carefully constructed queries, each wrapped in diverse prompt configurations, to target LLMs. LLMmap analyzes the resulting responses to generate a unique behavioral fingerprint for each model. This fingerprint is then used to identify unknown LLMs by comparing their behavioral patterns against a database of known models. The PyTorch-based approach focuses on efficiency and accuracy in this identification process.
Quick Start & Requirements
pip install -r requirements.txt../data/pretrained_models/default/, including PyTorch weights, configuration, and behavioral templates for 52 LLMs.main_interactive.py).Highlighted Details
make_dataset.py script automates the generation of training and testing datasets by querying specified LLMs with configurable prompts and queries.test_model.py) to evaluate the top-k accuracy of pre-trained models against a curated list of supported LLMs.Maintenance & Community
The project welcomes contributions to keep pace with the rapidly evolving LLM landscape, with an email address (chime.infant_0g@icloud.com) provided for collaboration inquiries. A paper detailing the research is available.
Licensing & Compatibility
The provided README does not specify a software license. This absence may pose compatibility concerns for commercial use or integration into closed-source projects.
Limitations & Caveats
The recent PyTorch rebuild (v0.2) is not a direct one-to-one conversion from previous versions, potentially leading to differences in models and procedures. Currently, the add_new_template.py script for extending the model's capabilities only supports Hugging Face LLMs, with plans for broader backend support. The lack of a specified license is a significant caveat.
8 months ago
Inactive
riccardomusmeci
instructlab