Discover and explore top open-source AI tools and projects—updated daily.
ICLR 2021 research paper on aligning AI with human values
Top 89.4% on SourcePulse
This repository provides the ETHICS benchmark dataset and fine-tuning scripts for evaluating AI alignment with human values across five ethical frameworks: Justice, Deontology, Virtue Ethics, Utilitarianism, and Commonsense. It targets AI researchers and developers seeking to measure and improve the ethical reasoning capabilities of their models.
How It Works
The project offers a benchmark dataset designed to test AI models on various ethical scenarios. It includes fine-tuning scripts for popular transformer models (e.g., BERT, RoBERTa, ALBERT) to adapt them to the benchmark tasks. The core approach involves evaluating model performance on specific ethical dimensions, enabling comparative analysis and identification of areas for improvement in AI ethical alignment.
Quick Start & Requirements
pip install -r requirements.txt
(specific installation commands for fine-tuning scripts are within subfolders).Highlighted Details
Maintenance & Community
The project is associated with ICLR 2021 and its authors are prominent researchers in AI safety and ethics. There is no explicit mention of ongoing maintenance or community channels like Discord/Slack.
Licensing & Compatibility
The repository does not explicitly state a license. The dataset is available for research purposes.
Limitations & Caveats
The project does not specify a license, which may impact commercial use or integration into closed-source projects. Ongoing maintenance and community support are not detailed.
2 years ago
1 day