weightnorm by openai

Example code for weight normalization research paper

Created 9 years ago

367 stars

Top 76.9% on SourcePulse

View on GitHub

3 Experts Love This Project

Aravind Srinivas

Cofounder of Perplexity

Tom Brown

Cofounder of Anthropic

Jianwei Yang

Research Scientist at Meta Superintelligence Lab

Project Summary

This repository provides example implementations of Weight Normalization, a technique to accelerate deep neural network training. It targets researchers and practitioners working with deep learning frameworks like Theano (via Lasagne), TensorFlow, and Keras. The primary benefit is faster convergence and improved training stability.

How It Works

Weight Normalization reparameterizes network weights by separating the direction and magnitude of weight vectors. This decoupling allows the optimizer to focus on learning the optimal direction, leading to faster convergence compared to standard weight parameterization.

Quick Start & Requirements

Lasagne/Theano: Requires Lasagne and Theano.
TensorFlow: Requires TensorFlow.
Keras: Requires Keras.
CIFAR-10: The Lasagne examples were used for CIFAR-10 experiments.

Highlighted Details

Implements Weight Normalization as described in the Salimans & Kingma (2016) paper.
Includes example code for Lasagne (Theano), TensorFlow, and Keras.
Code was used for CIFAR-10 experiments in the original paper.

Maintenance & Community

Status: Archive (code is provided as-is, no updates expected).

Licensing & Compatibility

The repository does not explicitly state a license.

Limitations & Caveats

The code is archived and no longer maintained or updated. Compatibility with current versions of the underlying frameworks (Theano, TensorFlow, Keras) is not guaranteed.

Health Check

Last Commit

7 years ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

0 stars in the last 30 days