Targeted-Dropout  by Cohere-Labs-Community

Code for a targeted dropout research paper

created 6 years ago
255 stars

Top 99.2% on sourcepulse

GitHubView on GitHub
Project Summary

This repository provides complementary code for the Targeted Dropout paper, enabling researchers and practitioners to implement and experiment with a novel dropout technique for neural networks. The primary benefit is improved model generalization and robustness through more efficient regularization.

How It Works

Targeted Dropout introduces a method to selectively drop units based on their importance, aiming to improve generalization and reduce overfitting. The implementation likely involves modifications to standard dropout layers to incorporate this targeted selection mechanism, potentially leading to more efficient training and better performance on downstream tasks.

Quick Start & Requirements

  • Primary install / run command: python -m TD.train --hparams=resnet_default
  • Prerequisites: Python 3, Tensorflow 1.8.
  • The project supports different environments via the --env flag (local, gcp, tpu).
  • Hparams can be specified or overridden using --hparams and --hparam_override flags.

Highlighted Details

  • Code supports training and pruning models.
  • Hparam sets like resnet_default are available for quick experimentation.
  • Environment flags allow for flexible deployment on local machines, GCP, or TPUs.

Maintenance & Community

No specific community channels or maintenance details are provided in the README.

Licensing & Compatibility

The license is not specified in the README.

Limitations & Caveats

The project requires Tensorflow 1.8, which is an older version and may present compatibility challenges with modern Python environments or other libraries.

Health Check
Last commit

5 years ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
1 stars in the last 90 days

Explore Similar Projects

Starred by Stas Bekman Stas Bekman(Author of Machine Learning Engineering Open Book; Research Engineer at Snowflake).

HALOs by ContextualAI

0.2%
873
Library for aligning LLMs using human-aware loss functions
created 1 year ago
updated 2 weeks ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Travis Fischer Travis Fischer(Founder of Agentic), and
1 more.

fine-tune-mistral by abacaj

0.3%
716
Fine-tuning script for Mistral-7B
created 1 year ago
updated 1 year ago
Feedback? Help us improve.