Commits · 326d172f128304e20708ed847da58c86e0bf5fa4 · llvm / distiller

Dec 08, 2019
- Update PTQ ResNet-50 command line results · 326d172f
  Guy Jacob authored 5 years ago
  
  Results changed following commit 9e7ef987 (#402)
  Unverified
  
  326d172f
Dec 03, 2019
- Bugfix - Save stats to file when 'qe-calibration' arg is used (#437) · 3df06fb4
  SunYiran authored 5 years ago
  
  3df06fb4
Dec 02, 2019

object detection: remove unsupported summaries · 8e3f04cc
Neta Zmora authored 5 years ago
```
compute-summary and png-summary currently work with image classifiers
only.
```
8e3f04cc

object detection: fix model summary generation · 8002b13f

Neta Zmora authored 5 years ago

When multi-processing, we want only one process to generate the
summary, while the other processes do nothing (lazy bums!)

8002b13f

Hotfix to sim_bn_fold.py after breaking merge · 801a26a8
levzlotnik authored 5 years ago

801a26a8

Object Detection Compression (#343) · 697b3cfe

Lev Zlotnik authored 5 years ago

Add an example of compressing OD pytorch models.

In this example we compress torchvision's object detection models - FasterRCNN / MaskRCNN / KeypointRCNN.
We've modified the reference code for object detection to allow easy compression scheduling with YAML configuration.

697b3cfe

Added model_setattr - sets an parameter/buffer/module · 6d9afab5
levzlotnik authored 5 years ago
```
of a model by name relative to the root of the model.
```
6d9afab5

Nov 28, 2019

Merge branch 'master' of https://github.com/NervanaSystems/distiller · 0520ffaf
Neta Zmora authored 5 years ago

0520ffaf

AMC: fix problems reported in issue #429 · 47af2cfa

Neta Zmora authored 5 years ago

- define ALMOST_ONE
- define op_type
- remove sanity assert (need to understand what tolerance value to use
in the assert)

Co-authored-by: csc12138
Co-authored-by: wangyidong3

47af2cfa

AMC: fix problems reported in issue #429 · 48244f3b

Neta Zmora authored 5 years ago

- define ALMOST_ONE
- define op_type
- remove sanity assert (need to understand what tolerance value to use
in the assert)

48244f3b

Nov 27, 2019

Introduce a performance tracking class (#427) · f45a29da

Neta Zmora authored 5 years ago

This will help define and use different performance sorting schemes.
E.g. this will address the issue raised in issue #411

Unverified

f45a29da

add tolerance (=0.05) when comparing test results · 1cf7e529

Neta Zmora authored 5 years ago

Small variances can occur when using different cudnn versions,
even when the environment and distiller version is the same.

1cf7e529

Revert "Cifar models: remove explicit parameters initialization" · f92656c7

Neta Zmora authored 5 years ago

Said commit was wrong: the default inializations in pytorch are not
the same as in our code.  For example, the default convolution
weight initialization uses Kaiming-uniform, while we used
Kaiming-normal.
For backward comparability of the model behavior, we need to
revert to the old behavior.
This reverts commit 6913687f.

f92656c7

Nov 25, 2019

Resnet50 early-exit update · 8b341593

Neta Zmora authored 5 years ago

Update the definition of the exits using info from Haim.

This is still very unsatsifactory because we don't have working
examples to show users :-(

8b341593

Nov 17, 2019
- Add README.md files for some pruning examples · 0c175c94
  Neta Zmora authored 5 years ago
  
  0c175c94
Nov 16, 2019

Add README.md files for APG and DropFilter · 70e26735
Neta Zmora authored 5 years ago

70e26735
Remove duplicate YAML file · 49933144
Neta Zmora authored 5 years ago

49933144

Cifar models: remove explicit parameters initialization · 6913687f

Neta Zmora authored 5 years ago

except for the case of VGG, our parameter initialization code was
matched the default pytorch initialization (per torch.nn operation),
so writing the initialization code ourselves can only lead to
more code and maintenance; and also we would not benefit from
improvements that occur at the pytorch level (e.g. if FB finds a
better initialization for nn.conv2d than today's kaiming init, we
would not benefit).
The VGG initialization we had was "suspicious" and so reverting
to the default seems reasonable.

6913687f

Nov 14, 2019

Update required PyTorch version check · fbdbe35a
Guy Jacob authored 5 years ago

fbdbe35a

PyTorch 1.3 Support (#426) · b8b4cf32

Guy Jacob authored 5 years ago

* summary_graph.py:
  * Change ONNX op.uniqueName() to op.debugName()
  * Removed scope-naming workaround which isn't needed in PyTorch 1.3
* Tests:
  * Naming of trace entries changed in 1.3. Fixed SummaryGraph unit
    test that checked that
  * Adjusted expected values in full_flow_tests
  * Adjusted tolerance in test_sim_bn_fold
  * Filter some new warnings

Unverified

b8b4cf32

Nov 13, 2019

enable load_checkpoint when chkpt_file has an initial component of ~ · 80cab7e6
Neta Zmora authored 5 years ago
```
Prevent exception when loading checkpoints from a home directory
```
80cab7e6

Distiller versioning - fix dual version strings · cadd07eb

Neta Zmora authored 5 years ago

Two strings represented the library version: one in distiller.__init__.py
and one in setup.py.
This can lead to two difference version string values.
The fix: have distiller.__init__.py read the version string from the
package installation.
This assumes that we've installed distiller properly, but we've been
making this assumption for a long time in our code (e.g. how we do
imports of distiller from the `tests` directory).

cadd07eb

image_classifier.py: fix printout when using early-exit · 81d561e0

Neta Zmora authored 5 years ago

When performing EE validation, the validation loop prints
an annoying and redundant log of the iteration number - remove this.

81d561e0

Enable ActivationStatsCollector on post-train quantized models (#419) · e9d93ca0
Guy Jacob authored 5 years ago

Unverified

e9d93ca0

image_classifier.py: PTQ stats collection and eval in same run (#346) · fb98377e

Bar authored 5 years ago

* Previous implementation:
  * Stats collection required a separate run with `-qe-calibration`.
  * Specifying `--quantize-eval` without `--qe-stats-file` triggered
    dynamic quantization.
  * Running with `--quantize-eval --qe-calibration <num>` only ran
    stats collection and ignored --quantize-eval.

* New implementation:
  * Running `--quantize-eval --qe-calibration <num>` will now 
    perform stats collection according to the calibration flag,
    and then quantize the model with the collected stats (and
    run evaluation).
  * Specifying `--quantize-eval` without `--qe-stats-file` will
    trigger the same flow as in the bullet above, as if 
    `--qe-calibration 0.05` was used (i.e. 5% of the test set will
    be used for stats).
  * Added new flag: `--qe-dynamic`. From now, to do dynamic 
    quantization, need to explicitly run:
    `--quantize-eval --qe-dynamic`
  * As before, can still run `--qe-calibration` without 
    `--quantize-eval` to perform "stand-alone" stats collection
  * The following flags, which all represent different ways to
    control creation of stats or use of existing stats, are now
    mutually exclusive:
    `--qe-calibration`, `-qe-stats-file`, `--qe-dynamic`,
    `--qe-config-file`

fb98377e

Nov 11, 2019

Pruning with virtual Batch-norm statistics folding (#415) · c849a25f

Neta Zmora authored 5 years ago

* pruning: add an option to virtually fold BN into Conv2D for ranking

PruningPolicy can be configured using a new control argument fold_batchnorm: when set to `True`, the weights of BatchNorm modules are folded into the weights of Conv-2D modules (if Conv2D->BN edges exist in the model graph).  Each weights filter is attenuated using a different pair of (gamma, beta) coefficients, so `fold_batchnorm` is relevant for fine-grained and filter-ranking pruning methods.  We attenuate using the running values of the mean and variance, as is done in quantization.
This control argument is only supported for Conv-2D modules (i.e. other convolution operation variants and Linear operations are not supported).
e.g.:
policies:
  - pruner:
      instance_name : low_pruner
      args:
        fold_batchnorm: True
    starting_epoch: 0
    ending_epoch: 30
    frequency: 2

* AGP: non-functional refactoring

distiller/pruning/automated_gradual_pruner.py – change `prune_to_target_sparsity`
to `_set_param_mask_by_sparsity_target`, which is a more appropriate function
name as we don’t really prune in this function

* Simplify GEMM weights input-channel ranking logic

Ranking weight-matrices by input channels is similar to ranking 4D
Conv weights by input channels, so there is no need for duplicate logic.

distiller/pruning/ranked_structures_pruner.py
-change `prune_to_target_sparsity` to `_set_param_mask_by_sparsity_target`,
which is a more appropriate function name as we don’t really prune in this
function
-remove the code handling ranking of matrix rows

distiller/norms.py – remove rank_cols.

distiller/thresholding.py – in expand_binary_map treat `channels` group_type
the same as the `cols` group_type when dealing with 2D weights

* AGP: add example of ranking filters with virtual BN-folding

Also update resnet20 AGP examples

Unverified

c849a25f

Nov 10, 2019

early-exit: further refactoring and resnet50-imagenet · 795590c8

Neta Zmora authored 5 years ago

Refactor EE code and place in a separate file.
Fix resnet50-earlyexit (inputs of nn.Linear layers was wrong).

Caveats:
1. resnet50-earlyexit performance needs to be tested for performance.
2. there is still too much EE code dispersed in apputils/image_classifier.py
and compress_classifier.py

795590c8

tests/full_flow_tests.py: improve early-exit test robustness · 49a2a967

Neta Zmora authored 5 years ago

EE runs emit more statistics than the regular classification pipeline,
and it is more robust to validate more of the log output for correctness
validation.

49a2a967

Nov 08, 2019

Early-exit refactoring: flexible exits installation · a7473c95

Neta Zmora authored 5 years ago

Exits can now be attached to any point in the network
By specifying the name of the attachment node and
the exit-branch subgraph.

a7473c95

Nov 07, 2019
- Refactor ResNetCifarEarlyExit · 660a0da5
  Neta Zmora authored 5 years ago
  
  Step 1 of refactoring EE code in order to make it more generic.
  660a0da5
- Prefer .detach() or .data · bc00ee48
  Neta Zmora authored 5 years ago
  
  bc00ee48
- Fix Early-exit code · fc62caab
  Neta Zmora authored 5 years ago
  
  Fix the EE code so that it works with the current 'master' branch, and add a test for high-level EE regression
  fc62caab
- PTQ: Quant params generator - Don't yield anything when no stats · f4979de1
  Guy Jacob authored 5 years ago
  
  available
  f4979de1
Nov 06, 2019
- Bugfix: Deepcopy args when creating ClassifierCompressor (#421) · 78144d4c
  Guy Jacob authored 5 years ago
  
  Co-authored-by: Bar <29775567+barrh@users.noreply.github.com> Co-authored-by: Guy Jacob <guy.jacob@intel.com>
  Unverified
  
  78144d4c
- PTQ: Exposed get/set ops for post-train quantization params (#418) · 4478a73d
  Lev Zlotnik authored 5 years ago
  
  4478a73d
- update README · 245f8b0d
  Neta Zmora authored 5 years ago
  
  Remove monthly updates sections (Oct, Nov 2018)
  Unverified
  
  245f8b0d
Nov 05, 2019

Update internal ResNet implementation according to latest TorchVision (#414) · 1fe80da7

Guy Jacob authored 5 years ago

Changed our version to only re-implement BasicBlock and Bottleneck,
and duplicate the model creation functions. Other than that, re-use
everything from the torchvision implementation.

Unverified

1fe80da7

Nov 03, 2019
- update README · 13887b62
  Neta Zmora authored 5 years ago
  
  add citation and links to published code
  Unverified
  
  13887b62
Oct 31, 2019

Re-enable NLP quant notebooks after following #402 (#412) · fdb12eb1

Guy Jacob authored 5 years ago

* Add blacklist to quantizer. In PTQ put Dropout on the blacklist.
* Update notebooks to use 2-phase stats collection
* Other small fixes

Unverified

fdb12eb1

distiller/utils.py: add param_name_2_module_name · 2058990a

Neta Zmora authored 5 years ago

- add function `param_name_2_module_name` to help convert from a
module's .weight or .bias parameter tensor name, to a fully-qualified
module name

- remove dead code

2058990a