pywhyllm by py-why

LLMs enhance causal analysis workflows

Created 2 years ago

279 stars

Top 93.2% on SourcePulse

Project Summary

Summary

PyWhy-LLM is an experimental Python library designed to integrate Large Language Models (LLMs) into causal analysis workflows. It aims to augment human expertise by providing LLM-powered insights, bridging knowledge gaps typically filled by domain experts, and enhancing the capabilities of the DoWhy ecosystem.

How It Works

This library leverages LLMs, such as GPT-4, to automate and suggest critical steps in causal inference. It offers modules for suggesting domain expertises, potential confounders, causal relationships (DAGs), backdoor sets, mediators, and instrumental variables. A Retrieval Augmented Generation (RAG) component, utilizing CauseNet, further enhances relationship suggestions by grounding LLM outputs in external knowledge. This approach aims to streamline the causal discovery and identification process.

Quick Start & Requirements

Installation is straightforward via pip: pip install pywhyllm. The library requires access to LLM APIs (e.g., GPT-4), implying potential API key configuration and associated costs. Specific Python versions or hardware requirements like GPUs are not detailed in the provided README. Links to detailed usage are available via "Walkthrough Notebook" and "Examples Notebook".

Highlighted Details

Seamless integration of LLM capabilities into established causal inference pipelines.
Automated suggestion of key causal components: confounders, mediators, IVs, and graph structures.
RAG-based modeler for enhanced relationship discovery using CauseNet.
Part of the broader DoWhy community, fostering ecosystem compatibility.

Maintenance & Community

Contributions are welcomed, with guidelines provided in CONTRIBUTING.md. Users can report issues or make requests by raising an issue on the project's repository. A Code of Conduct is also available.

Licensing & Compatibility

The provided README does not specify a software license. This absence requires further investigation for determining commercial use or derivative work compatibility.

Limitations & Caveats

As an experimental library, PyWhy-LLM may be subject to changes, bugs, or incomplete features. Its functionality is dependent on the performance and availability of external LLM services, introducing potential costs and reliability concerns. The effectiveness of suggestions relies heavily on the quality of the underlying LLM and the specific causal problem context.

pywhyllm by py-why

Explore Similar Projects

LM-reasoning by jeffhj

MindMap by wyl-willing

MedAgents by gersteinlab

arXausality by logangraham

LLM-based-causal-discovery by WXY604

CausalNLP_Papers by zhijing-jin

causal-text-papers by causaltext

Generalization-Causality by yfzhang114

awesome-llm-interpretability by JShollaj

Chain-of-ThoughtsPapers by Timothyxxx

chain-of-thought-hub by FranxYao

pythia by EleutherAI