frames_of_mind by dhealy05

Tool to visualize LLM chain-of-thought "thought process"

Created 11 months ago

384 stars

Top 74.5% on SourcePulse

Project Summary

This project visualizes the "thought process" of AI models by converting chains of thought into embeddings and plotting them using t-SNE. It is targeted at researchers and developers interested in understanding and analyzing AI reasoning patterns. The primary benefit is providing a visual representation of an AI's internal "thinking" steps, potentially revealing phases like searching, stable thinking, and concluding.

How It Works

The project captures AI "chains of thought" as text, then utilizes the OpenAI API to convert these text sequences into numerical embeddings. These embeddings are subsequently processed with t-SNE for dimensionality reduction, allowing for sequential plotting. Cosine similarity between consecutive embeddings is calculated and normalized to a 0-1 range to quantify the "distance" or change between thought steps, highlighting significant shifts in the AI's reasoning.

Quick Start & Requirements

Install dependencies via pipenv install.
Run analysis with python run.py.
Requires Python 3.x and an OpenAI API key.
Chains of thought are provided in data/chains/.
A JavaScript snippet (pull_cot.js) is available for downloading chat data from Deepseek's interface.

Highlighted Details

Visualizes AI thought processes using t-SNE embeddings.
Analyzes consecutive embedding distances via normalized cosine similarity.
Offers aggregate distance plots to identify reasoning phases (search, thinking, concluding).
Includes example prompts and pre-collected chains of thought.

Maintenance & Community

No specific information on maintainers, community channels, or roadmap is provided in the README.

Licensing & Compatibility

The README does not specify a license.

Limitations & Caveats

The project relies on the OpenAI API, incurring costs and requiring an API key. The visualization method (t-SNE) is sensitive to parameter choices and may not always perfectly represent high-dimensional relationships. Data extraction from Deepseek requires manual execution of a JavaScript snippet in the browser console.

frames_of_mind by dhealy05

Explore Similar Projects

Luotuo-Text-Embedding by LC1332

LaVIT by jy0205

MindMap by wyl-willing

llm by ahyatt

WordLlama by dleemiller

graphologue by foundation-interface

bio_embeddings by sacdallago

DataChad by gustavz

aella-data-explorer by context-labs

fastembed by qdrant

lida by microsoft

OpenNE by thunlp