Discover and explore top open-source AI tools and projects—updated daily.
karpathyVisualizing US job market data with LLM-driven insights
New!
Top 28.1% on SourcePulse
Summary
This repository provides a research tool for visually exploring US Bureau of Labor Statistics (BLS) Occupational Outlook Handbook data. It enables developers and researchers to interactively visualize job market trends, including novel metrics like AI exposure, by processing detailed occupation data and leveraging LLM-powered analysis. The project aims to offer a flexible development tool for exploring BLS data visually, rather than a formal economic publication.
How It Works
The project employs a multi-stage data pipeline: scraping raw BLS HTML, parsing it into clean Markdown, and tabulating structured statistics (pay, education, job count, growth) into occupations.csv. A core innovation is the score.py script, which utilizes LLMs (Gemini Flash via OpenRouter) to assign custom scores and rationales to each occupation based on user-defined prompts, such as estimating "Digital AI Exposure." This LLM-generated data is then merged with BLS statistics to power an interactive treemap visualization, allowing dynamic exploration of various job market facets.
Quick Start & Requirements
uv sync for dependencies and uv run playwright install chromium for browser drivers..env file (OPENROUTER_API_KEY=your_key_here).scrape.py), processing (process.py), CSV generation (make_csv.py), LLM scoring (score.py), building site data (build_site_data.py), and serving the site locally (cd site && python -m http.server 8000).karpathy.ai/jobs.Highlighted Details
Maintenance & Community
No specific details on maintenance, contributors, or community channels were found in the provided README.
Licensing & Compatibility
The README does not specify a software license. This omission requires clarification for any adoption decision, particularly regarding commercial use or derivative works.
Limitations & Caveats
3 weeks ago
Inactive
comet-ml