gluonnlp.initializer¶
This page describes initializers that are useful for multiple NLP model architectures.
Highway Bias Initializer¶
We now provide Highway bias initializer defined in the following work.
@inproceedings{srivastava2015training,
title={Training very deep networks},
author={Srivastava, Rupesh K and Greff, Klaus and Schmidhuber, J{\"u}rgen},
booktitle={Advances in neural information processing systems},
pages={2377--2385},
year={2015}}
Initialize all biases of an Highway layer by setting the biases of nonlinear transformer and the transform gate differently. |
API Reference¶
NLP initializer.
-
class
gluonnlp.initializer.
HighwayBias
(nonlinear_transform_bias=0.0, transform_gate_bias=-2.0, **kwargs)[source]¶ Initialize all biases of an Highway layer by setting the biases of nonlinear transformer and the transform gate differently. The dimension of the biases are identical and equals to the \(arr.shape[0]/2\), where \(arr\) is the bias tensor.
The definition of the biases follows the work:
@inproceedings{srivastava2015training, title={Training very deep networks}, author={Srivastava, Rupesh K and Greff, Klaus and Schmidhuber, J{\"u}rgen}, booktitle={Advances in neural information processing systems}, pages={2377--2385}, year={2015} }