LDAvis  by cpsievert

R package for interactive topic model visualization

Created 11 years ago
566 stars

Top 56.8% on SourcePulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

LDAvis is an R package designed to help users interpret topic models by providing an interactive, web-based visualization. It extracts key information from fitted topic models, enabling users to explore relationships between topics, terms, and documents, thereby facilitating a deeper understanding of textual data.

How It Works

LDAvis leverages a scatter plot to represent topics, where proximity indicates semantic similarity. Terms are displayed as word clouds, sized by their relevance to the selected topic. The visualization uses a two-dimensional projection (often t-SNE or similar) of the topic-term distributions to position topics. Users can adjust a slider to dynamically re-weight terms based on their relevance and frequency, allowing for nuanced exploration of topic content.

Quick Start & Requirements

  • Install from CRAN: install.packages("LDAvis")
  • Install from GitHub: devtools::install_github("cpsievert/LDAvis")
  • Requires R.
  • Official documentation and examples are available via help(createJSON, package = "LDAvis") and the vignette vignette("details", package = "LDAvis").

Highlighted Details

  • Facilitates interpretation of LDA topic models.
  • Interactive visualization with topic-term relationships.
  • Supports dynamic term re-weighting via a lambda slider.
  • Can encode visualization state into URLs for sharing.

Maintenance & Community

  • Developed by Carson Sievert.
  • Additional data sets are available in the LDAvisData package.
  • Links to video demos and a research paper are provided.

Licensing & Compatibility

  • License: Not explicitly stated in the README.
  • Compatibility: Designed for R, can integrate with topic models generated by other tools like MALLET and gensim.

Limitations & Caveats

LDAvis itself does not perform topic model fitting; it requires pre-fitted models. The README does not specify the license, which could impact commercial use or closed-source integration.

Health Check
Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
1 stars in the last 30 days

Explore Similar Projects

Starred by Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), Shizhe Diao Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA), and
8 more.

BERTopic by MaartenGr

0.2%
7k
Topic modeling with transformers and c-TF-IDF
Created 5 years ago
Updated 1 month ago
Starred by Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), Stas Bekman Stas Bekman(Author of "Machine Learning Engineering Open Book"; Research Engineer at Snowflake), and
11 more.

pattern by clips

0.0%
9k
Python web mining module
Created 14 years ago
Updated 1 year ago
Feedback? Help us improve.