StarWhisper  by Yu-Yang-Li

LLM for astronomy research

created 2 years ago
298 stars

Top 90.1% on sourcepulse

GitHubView on GitHub
Project Summary

StarWhisper is a series of large language models (LLMs) tailored for astronomy, offering language, time-series, and multimodal capabilities ranging from 7B to 72B parameters. Developed with support from the National Astronomical Observatories and Zhijiang Laboratory, it aims to serve as an AI tool for astronomical data processing, particularly for projects like the SITIAN survey, by integrating astronomical knowledge and exploring multimodal solutions for specific challenges.

How It Works

The project leverages a data flywheel approach, refining training methods with cleaned and corrected scientific and popular science data to enhance astronomical physics, coding, and agent capabilities. It has released technical reports on specialized models: StarWhisper Pulsar for state-of-the-art pulsar identification using multimodal LLMs, StarWhisper LC for light curve classification via transfer learning and LLMs, and StarWhisper Telescope for telescope control workflows using LLM agents, which has been applied to the SITIAN project.

Quick Start & Requirements

  • Weights for StarWhisper 4.0 are planned for release on the "ModelScope" platform.
  • The training dataset for StarWhisper 3 is available in the LLM_Data directory.
  • Code related to published papers is available for testing.

Highlighted Details

  • StarWhisper Telescope is an agent-based observation assistant system designed to function as an AI astrophysicist.
  • The project has developed a multimodal LLM for pulsar identification and a method for light curve classification.
  • It aims to improve astronomical agent capabilities and integrate with astronomical professional tools.

Maintenance & Community

  • The project has multiple authors and has published several technical reports and a paper on arXiv.
  • Further details on community engagement or roadmaps are not explicitly provided in the README.

Licensing & Compatibility

  • Project source code is licensed under Apache-2.0.
  • Model weights for Qwen Chat require adherence to their respective licenses.

Limitations & Caveats

The project is actively under development, with a to-do list including further fine-tuning for scientific data, reinforcement learning from human feedback, and the development of an astronomical knowledge graph to mitigate hallucinations. Open-sourcing of multimodal fine-tuning weights is pending.

Health Check
Last commit

3 weeks ago

Responsiveness

1 week

Pull Requests (30d)
0
Issues (30d)
0
Star History
15 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems) and Shishir Patil Shishir Patil(Author of BFCL, Gorilla).

SkyThought by NovaSky-AI

0.2%
3k
Training recipes for Sky-T1 family of models
created 6 months ago
updated 3 weeks ago
Starred by George Hotz George Hotz(Author of tinygrad; Founder of the tiny corp, comma.ai), Calvin French-Owen Calvin French-Owen(Coounder of Segment), and
12 more.

StableLM by Stability-AI

0.0%
16k
Language models by Stability AI
created 2 years ago
updated 1 year ago
Feedback? Help us improve.