multiwoz by budzianowski

Dataset for task-oriented dialogue modeling

Created 7 years ago

938 stars

Top 39.0% on SourcePulse

Project Summary

This repository provides the MultiWOZ dataset, a large-scale, human-human conversation corpus for task-oriented dialogue systems, along with baseline implementations and evaluation scripts. It is designed for researchers and developers working on dialogue state tracking and response generation.

How It Works

The project centers around the MultiWOZ dataset, which comprises over 10,000 dialogues across multiple domains, annotated with goals, utterances, and belief states. It supports end-to-end dialogue modeling and dialogue state tracking, offering various dataset versions (2.0, 2.1, 2.2) with corrections and improvements. The code includes preprocessing scripts and baseline models for training and evaluation.

Quick Start & Requirements

Install: Requires Python 2 with pip.
Dependencies: PyTorch 0.4.1.
Preprocessing: Run python create_delex_data.py.
Training: Run python train.py [--args=value].
Testing: Run python test.py [--args=value].
Dataset Access: Can be loaded through DialogStudio.

Highlighted Details

Comprehensive benchmarks for Dialogue State Tracking (DST) and Response Generation are provided, with results on MultiWOZ 2.0, 2.1, and 2.2.
Includes detailed hyperparameter settings for baseline models, enabling reproducibility.
Supports both end-to-end models and policy optimization models for response generation.
Offers bibtex citations for the dataset and related papers.

Maintenance & Community

The project was initiated by Paweł Budzianowski from Cambridge Dialogue Systems Group. Bug reports can be sent to budzianowski@gmail.com or jianguozhang@salesforce.com.

Licensing & Compatibility

Released under the MIT License, allowing for open-source use and modification.

Limitations & Caveats

The baseline code is specified for Python 2 and an older version of PyTorch (0.4.1), which may require significant adaptation for modern Python 3 environments. Some older benchmark results might not be directly comparable due to evaluation script inconsistencies.

multiwoz by budzianowski

Explore Similar Projects

WavChat by jishengpeng

BabelDuck by Orenoid

opensource_notebooklm by satvik314

ToD-BERT by jasonwu0731

dialogbot by shibing624

locomo by snap-research

UltraChat by thunlp

ConvLab-2 by thu-coai

Paper-Reading-ConvAI by iwangjian

mindmeld by cisco

dia by nari-labs

ChatTTS by 2noise