Skip to content
Snippets Groups Projects
Commit b41c4d2d authored by Neta Zmora's avatar Neta Zmora
Browse files

apputils/checkpoint.py: load_checkpoint can be called w/o specifying the model

This is inspired by @barrh’s PR https://github.com/NervanaSystems/distiller/pull/246
but it at a “slower-integration-pace” and w/o changing APIs.

1. create_model() attaches model attributes (arch, dataset, is_parallel) to created models.
2. save_checkpoint() stores the new model attributes with checkpoint metadata
3. load_checkpoint() can be invoked with model=None, in which case we attempt
to create the model from the stored checkpoint metadata.
parent 99124355
No related branches found
No related tags found
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment