MentalLLaMA by SteveKGYang

Open-source LLM for interpretable mental health analysis

Created 2 years ago

301 stars

Top 88.7% on SourcePulse

1 Expert Loves This Project

omarsar

Founder of DAIR.AI

Project Summary

MentaLLaMA provides open-source instruction-following large language models for interpretable mental health analysis on social media. It targets researchers and developers needing to analyze mental health discourse and generate explanations, offering a novel dataset and benchmark for this specialized domain.

How It Works

MentaLLaMA is built upon LLaMA and Vicuna foundation models, fine-tuned on the Interpretable Mental Health Instruction (IMHI) dataset. This dataset comprises 105K instruction samples across 8 mental health analysis tasks derived from public social media data. The models are designed to follow instructions for mental health analysis and provide high-quality, interpretable explanations for their predictions.

Quick Start & Requirements

Installation: Use Hugging Face Transformers library.
Dependencies: Python, PyTorch, Transformers, PEFT (for LoRA models). GPU recommended for inference.

Model Loading (Example):

from transformers import LlamaTokenizer, LlamaForCausalLM
tokenizer = LlamaTokenizer.from_pretrained(MODEL_PATH)
model = LlamaForCausalLM.from_pretrained(MODEL_PATH, device_map='auto')

MentaLLaMA-33B-lora: Requires downloading Vicuna-33B separately and placing it in ./vicuna-33B.
Resources: Larger models (13B, 33B) require significant VRAM.
Docs: MentaLLaMA Paper, Evaluation Paper

Highlighted Details

Offers 5 model checkpoints: MentaLLaMA-33B-lora, MentaLLaMA-chat-13B, MentaLLaMA-chat-7B, MentalBART, MentalT5.
Includes the IMHI dataset (105K instruction samples) and a benchmark (19K test samples) for interpretable mental health analysis.
Provides 10 pre-trained classifiers (based on MentalBERT) for evaluating model output correctness on the IMHI benchmark.
Supports evaluation using BART-score for explanation quality.

Maintenance & Community

Active development with recent updates in March 2024 (test data release) and February 2024 (paper acceptance).
Primary contributors are affiliated with the National Centre for Text Mining and The University of Manchester.

Licensing & Compatibility

Licensed under the MIT License.
Permits commercial use and linking with closed-source projects.

Limitations & Caveats

The project is strictly for non-clinical research; it does not provide diagnosis or advice.
Users assume all risk; authors disclaim responsibility for errors or consequences.
LLMs may introduce biases, incorrect predictions, or inappropriate explanations, posing challenges for real-world deployment.

Health Check

Last Commit

1 year ago

Responsiveness

1 day

Pull Requests (30d)

0

Issues (30d)

1

Star History

3 stars in the last 30 days

Explore Similar Projects

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory).

awesome-llm-human-preference-datasets by glgh

Curated list of human preference datasets for LLM training

Created 2 years ago

Updated 2 years ago

prometheus by prometheus-eval

Evaluator LM for fine-grained assessment using customized rubrics

Created 2 years ago

Updated 2 years ago

InstructionZoo by FreedomIntelligence

Instruction-tuning dataset collection for chat-based LLMs

Created 2 years ago

Updated 1 year ago

EvaluationPapers4ChatGPT by THU-KEG

ChatGPT resource hub for evaluation, detection, and datasets

Created 2 years ago

Updated 1 year ago

finetuned-qlora-falcon7b-medical by iamarunbrahma

Falcon-7B finetuned for mental health conversations

Created 2 years ago

Updated 2 years ago

MedTrinity-25M by UCSC-VLAA

Large-scale multimodal dataset for medicine research

Created 1 year ago

Updated 6 months ago

Starred by

Gabriel Almeida

Gabriel Almeida(Cofounder of Langflow).

anli by facebookresearch

NLI benchmark dataset for natural language understanding research

Created 6 years ago

Updated 3 years ago

PandaLM by WeOpenML

LLM evaluation benchmark for reproducible, automated assessment

Created 2 years ago

Updated 1 year ago

Starred by

Wing Lian

Wing Lian(Founder of Axolotl AI) and

Carlos E. Jimenez

Carlos E. Jimenez(Coauthor of SWE-bench, SWE-agent).

EmpatheticDialogues by facebookresearch

PyTorch code for empathetic dialogue research

Created 6 years ago

Updated 4 years ago

Starred by

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI).

awesome-llm-interpretability by JShollaj

LLM interpretability resources

Created 2 years ago

Updated 6 months ago

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera) and

Luis Capelo

Luis Capelo(Cofounder of Lightning AI).

detoxify by unitaryai

Trained models for toxic comment classification

Created 5 years ago

Updated 6 days ago

Starred by

Jeremy Howard

Jeremy Howard(Cofounder of fast.ai),

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera), and

4 more.

chain-of-thought-hub by FranxYao

LLM benchmark for complex reasoning via chain-of-thought prompting

Created 2 years ago

Updated 1 year ago

Feedback? Help us improve.