You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the feature and the current behavior/state.
A learning rate schedule implementing linear warmup with step decay, currently unavailable in tensorflow
Relevant information
Are you willing to contribute it (yes/no): Yes
Are you willing to maintain it going forward? (yes/no): Yes
Which API type would this fall under (layer, metric, optimizer, etc.)
Learning rate schedule
Who will benefit with this feature?
Considerable number of papers use warmup strategies, retinanet, efficientdet,
Users training on cloud tpus which need a high learning rate due to the large batch size, starting with a linear warmup is often helpful to achieve convergence sooner
Any other info.
Here is a prelimary code implementation, please let me know if this is something that can find a place in tf-addons.
TensorFlow Addons is transitioning to a minimal maintenance and release mode. New features will not be added to this repository. For more information, please see our public messaging on this decision: TensorFlow Addons Wind Down
Please consider sending feature requests / contributions to other repositories in the TF community with a similar charters to TFA: Keras Keras-CV Keras-NLP
Describe the feature and the current behavior/state.
A learning rate schedule implementing linear warmup with step decay, currently unavailable in tensorflow
Relevant information
Which API type would this fall under (layer, metric, optimizer, etc.)
Learning rate schedule
Who will benefit with this feature?
Any other info.
Here is a prelimary code implementation, please let me know if this is something that can find a place in tf-addons.
Possible extension
We could convert this into a generalized wrapper for all the existing lr schedules, hence adding the warmup functionality to them.
The text was updated successfully, but these errors were encountered: