slop-forensics  by sam-paech

Analyze LLM output for repetitive patterns

Created 5 months ago
266 stars

Top 96.2% on SourcePulse

GitHubView on GitHub
Project Summary

This toolkit addresses the identification and analysis of "slop"—over-represented lexical patterns—in Large Language Model (LLM) outputs. It enables researchers and developers to generate standardized LLM outputs, profile their repetitive word usage, create canonical slop lists, and cluster models based on linguistic similarity.

How It Works

The toolkit operates in four main stages: dataset generation, slop profiling, slop list creation, and phylogenetic tree building. Slop profiling involves counting word and phrase frequencies, filtering common words and numbers, and calculating repetition scores and vocabulary complexity. Slop lists are created by aggregating these profiles across models to identify consistently overused terms. Phylogenetic trees are then generated by treating models as species and slop term usage as genetic traits, using bioinformatics tools like PHYLIP for parsimony analysis or falling back to hierarchical clustering.

Quick Start & Requirements

Highlighted Details

  • Generates standardized LLM outputs for comparative analysis.
  • Profiles include word, bigram, trigram usage, repetition scores, and vocabulary complexity.
  • Creates canonical slop lists for single words, bigrams, trigrams, and multi-word phrases.
  • Utilizes bioinformatics tools (PHYLIP) for clustering models based on output similarity.

Maintenance & Community

  • Contact: Sam Paech or create an issue on GitHub.
  • Citation: Provided in bibtex format.

Licensing & Compatibility

  • License: MIT License.
  • Compatibility: Permissive for commercial use and integration with closed-source projects.

Limitations & Caveats

The phylogenetic analysis relies on PHYLIP, which may require manual installation and configuration if not available via package managers. The effectiveness of slop analysis is dependent on the quality and diversity of the generated dataset prompts.

Health Check
Last Commit

3 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
6 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Stas Bekman Stas Bekman(Author of "Machine Learning Engineering Open Book"; Research Engineer at Snowflake), and
1 more.

tokenmonster by alasdairforsythe

0.2%
600
Subword tokenizer and vocabulary trainer for multiple languages
Created 2 years ago
Updated 1 year ago
Starred by Shizhe Diao Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA), Tri Dao Tri Dao(Chief Scientist at Together AI), and
1 more.

hnet by goombalab

1.5%
722
Hierarchical sequence modeling with dynamic chunking
Created 2 months ago
Updated 1 month ago
Starred by Peter Norvig Peter Norvig(Author of "Artificial Intelligence: A Modern Approach"; Research Director at Google), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
2 more.

Hands-On-Large-Language-Models by HandsOnLLM

1.4%
16k
Code examples for "Hands-On Large Language Models" book
Created 1 year ago
Updated 1 month ago
Feedback? Help us improve.