realistic-ssl-evaluation  by brain-research

Evaluation benchmark suite for semi-supervised learning algorithms

created 7 years ago
460 stars

Top 66.8% on sourcepulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

This repository provides the evaluation benchmark suite for deep semi-supervised learning (SSL) algorithms, as detailed in the paper "Realistic Evaluation of Deep Semi-Supervised Learning Algorithms." It is intended for researchers and practitioners in machine learning who need to rigorously evaluate and compare SSL methods.

How It Works

The suite automates the download, preprocessing, and splitting of datasets (CIFAR-10, SVHN, ImageNet 32x32) into labeled and unlabeled subsets using configurable label maps. It then facilitates running and evaluating various SSL algorithms, including VAT, by leveraging tmuxp for managing training and evaluation processes, ensuring a structured and reproducible experimental setup.

Quick Start & Requirements

  • Install dependencies: pip3 install -r requirements.txt
  • Datasets: CIFAR-10 and SVHN are automatically downloaded. ImageNet 32x32 requires manual download and conversion.
  • Experiment execution: Use tmuxp load <.yml file> or directly run provided Python scripts.
  • Requires TensorFlow and tmuxp.
  • Links: Paper

Highlighted Details

  • Provides scripts for dataset preparation and label map generation with reproducible random seeds.
  • Includes .yml configuration files for running experiments as described in the paper.
  • Offers functionality to simulate small validation sets for robust evaluation.
  • Supports multiple SSL methods and dataset configurations.

Maintenance & Community

This project is an open-source release associated with a specific research paper. No active community channels or ongoing maintenance efforts are indicated.

Licensing & Compatibility

The repository does not explicitly state a license. It is presented as an open-source release for research purposes. Commercial use or linking with closed-source projects may require clarification.

Limitations & Caveats

Exact reproducibility of paper results may be affected by subtle differences in TensorFlow versions and random seeds. The project is not an official Google product.

Health Check
Last commit

6 years ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
1 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), and
10 more.

open-r1 by huggingface

0.2%
25k
SDK for reproducing DeepSeek-R1
created 6 months ago
updated 3 days ago
Feedback? Help us improve.