Spatio-temporal LLM for urban prediction tasks (KDD'24 paper)
Top 76.3% on sourcepulse
UrbanGPT is a spatio-temporal large language model designed for urban computing tasks, targeting researchers and practitioners in AI and urban planning. It aims to enhance generalization capabilities in data-scarce environments by integrating spatio-temporal encoding with instruction tuning.
How It Works
UrbanGPT integrates a spatio-temporal dependency encoder (TCN-based) with an instruction-tuning paradigm. This approach allows LLMs to understand complex inter-dependencies across time and space, enabling more accurate predictions, particularly in zero-shot scenarios, by leveraging pre-trained spatio-temporal data.
Quick Start & Requirements
pip install -r requirements.txt
. Key dependencies include PyTorch (2.0.1+cu117), fschat
, torch_geometric
, deepspeed
, ray
, einops
, wandb
, and flash-attn==2.3.5
.nnodes=1 --nproc_per_node=8
).Highlighted Details
Maintenance & Community
The project is associated with the Data Intelligence Lab at the University of Hong Kong and South China University of Technology. Further community engagement details are not explicitly provided in the README.
Licensing & Compatibility
The repository does not explicitly state a license. The project acknowledges inspirations from Vicuna, GraphGPT, NExT-GPT, gradio, and Baize.
Limitations & Caveats
The README mentions potential version compatibility issues with flash-attn
and transformers
. The project is presented as a research artifact with a "TODO" list including releasing baseline codes.
3 months ago
1 week