Commits · f5987f9acd8f70dfb599244be5879829dd46b177 · llvm / distiller

Mar 29, 2019
- Fixed a typo in te quantization documentation (#207) · f5987f9a
  Songyi Blair Han authored 6 years ago
  
  f5987f9a
Mar 28, 2019

Added distiller.utils.convert_recursively_to (#209) · 119f7601

Lev Zlotnik authored 6 years ago

* Added distiller.utils.convert_recursively_to , replaced _treetuple2device in SummaryGraph with it.

* Renamed to convert_tensors_recursively_to

Unverified

119f7601

Mar 27, 2019
- Raise error if scheduler dict contains invalid class arguments (#204) · 958a52f6
  Guy Jacob authored 6 years ago
  
  * Until now this would be ignored, yielding unexpected results, such as what we saw in issue #191 * Also removed some redundant try-except clauses
  Unverified
  
  958a52f6
- Fix argument typo in a couple of YAMLs (fix #191) · 5800f35e
  Guy Jacob authored 6 years ago
  
  5800f35e
Mar 26, 2019

utils.py - fix issue #203 · 0ff78c76

Neta Zmora authored 6 years ago

Line 291 was coded twice (repeated), so removed one instance.
There's no functional effect (maybe a very small performance improvement).

Unverified

0ff78c76

Mar 25, 2019

Splicing Pruner: simplify the splicing code · c2642878
Neta Zmora authored 6 years ago
```
Rewrote the splicing logic with simpler code.
```
Unverified

c2642878

Fix ResNet50 Early Exit · 6b904a64

Neta Zmora authored 6 years ago

- Fix the invocation of resnet50_earlyexit (missing 'pretrained') parameter.
- Remove all ResNet depths other than 50, to prevent confusion (these are currently not supported).

Unverified

6b904a64

Mar 23, 2019

ResNet50 AGP: 84.6% element-wise sparsity (75.7% Top1) · b8fc5085

Neta Zmora authored 6 years ago

This schedule demonstrates high-rate element-wise pruning 
(84.6% sparsity) of Resnet 50.
Top1 is 75.66 vs the published Top1: 76.15 i.e. a drop of -0.5%.

Unverified

b8fc5085

Resnet50 AGP: 80% element-wise sparsity with 76.0% Top1 · 6401c3af

Neta Zmora authored 6 years ago

This is an improved ResNet50 AGP schedule which generates a ResNet50 network that is 80% element-wise sparse, with statistically insignificant drop in Top1 accuracy (-0.13%).

Unverified

6401c3af

Mar 21, 2019

Added BernoulliFilterPruner_AGP · 90226b1c

Neta Zmora authored 6 years ago

This is AGP (automatic gradual pruning) for a pruner which
samples filters-to-prune by sampling a Bernoulli probability distribution.

90226b1c

Mar 17, 2019

Replace exit()s with ValueError()s · 74a4f7ab

Neta Zmora authored 6 years ago

In several places we hit an error state and exit using exit(),
instead of raising a ValueError - fixed this.

74a4f7ab

Group Lasso regularization: fix thresholding · 642996cb

Neta Zmora authored 6 years ago

Fixed the return value from GroupThresholdMixin.group_threshold_mask
so that it only returns the mask in all cases (this code is _not_
under test at the moment, and changes to the pruning code, which
also uses the thresholding code) led to this bug.
Need to add tests for group-lasso regularization.

642996cb

truncated_svd.ipynb: replace numpy with pytorch · 2daf616f
Neta Zmora authored 6 years ago
```
Replaced numpy operations with pytorch operations
(so that we can leverage the GPU).
```
2daf616f

Fix logging message for structured pruning (#194) · ba1ee25b

Bar authored 6 years ago

Modify LpRankedStructureParameterPruner to log as
the correct class name for both L1 and L2 pruners.

ba1ee25b

Merge branch 'master' of https://github.com/NervanaSystems/distiller · 958b0e6d
Neta Zmora authored 6 years ago

958b0e6d

Add two stochastic pruners · b6b6f817

Neta Zmora authored 6 years ago

BernoulliFilterPruner – assigns a Bernoulli probability distribution to each
of the filters.

RandomLevelStructureParameterPruner – assigns a Uniform probability
distribution to the level-pruning level used by an L1-norm structure pruner.

b6b6f817

Fix row-pruning · d75ba51e

Neta Zmora authored 6 years ago

A recent change requires us to return the binary_map from the
ranking operation, and this was missing for the row-pruning case.

d75ba51e

Mar 14, 2019
- Remove non-utf8 characters from utils.py (#193) · 979bdea4
  Bar authored 6 years ago
  
  979bdea4
Mar 12, 2019
- Fix typo in tflogger path (#186) · d59888c9
  Bar authored 6 years ago
  
  "Peformance" --> "Performance"
  d59888c9
Mar 11, 2019

Integrate Cadene pretrained PyTorch models [fixes #142] (#184) · 321abb61

Bar authored 6 years ago

Integrate Cadene ```pretrainedmodels``` package.

This PR integrates a large set of pre-trained PyTorch image-classification and object-detection models which originate from https://github.com/Cadene/pretrained-models.pytorch.

*******************************************************************************************
PLEASE NOTE: 
This PR adds a dependency on he ```pretrainedmodels``` package, and you will need to install it using ```pip3 install pretrainedmodels```.  For new users, we have also updated the ```requirements.txt``` file.
*******************************************************************************************

Distiller does not currently support the compression of object-detectors (a sample application is required - and the community is invited to send us a PR).

Compression of some of these models may not be fully supported by Distiller due to bugs and/or missing features.  If you encounter any issues, please report to us.

Whenever there is contention on the names of models passed to the ```compress_classifier.py``` sample application, it will prefer to use the Cadene models at the lowest priority (e.g. Torchvision models are used in favor of Cadene models, when the same model is supported by both packages).

This PR also:
* Adds documentation to ```create_model```
* Adds tests for ```create_model```

321abb61

Mar 10, 2019
- logger: remove superfluous imports · cd2c5e73
  Neta Zmora authored 6 years ago
  
  cd2c5e73
Mar 06, 2019

Greedy pruning: change the pruning order of resnet layers · 556492a0
Neta Zmora authored 6 years ago

556492a0

Utils: added model_params_stats · 839c433a

Neta Zmora authored 6 years ago

This is a utility function that returns some statistics about a
model's parameters (model_sparsity, params_cnt, params_nnz_cnt).

This file is required for the previous commit (and was accidentally
left out)

839c433a

compress_classifier.py: sort best scores by count of NNZ weights · 9cb0dd68

Neta Zmora authored 6 years ago

A recent commit changed the sorting of the best performing training
epochs to be based on the sparsity level of the model, then its
Top1 and Top5 scores.
When we create thinned models, the sparsity remains low (even zero),
while the physical size of the network is smaller.
This commit changes the sorting criteria to be based on the count
of non-zero (NNZ) parameters.  This captures both sparsity and
parameter size objectives:
- When sparsity is high, the number of NNZ params is low
(params_nnz_cnt = sparsity * params_cnt).
- When we remove structures (thinnning), the sparsity may remain
constant, but the count of params (params_cnt) is lower, and therefore,
once again params_nnz_cnt is lower.

Therefore, params_nnz_cnt is a good proxy to capture a sparsity
objective and/or a thinning objective.

9cb0dd68

Mar 05, 2019

Network Surgery schedule: fix documentation · c4cdc0cf
Neta Zmora authored 6 years ago

c4cdc0cf
AMC: remove superflous imports · 342af9e8
Neta Zmora authored 6 years ago

342af9e8
language model example: fix code formatting · 84022927
Neta Zmora authored 6 years ago

84022927
Fix issue #155 · cee295fb
Lev Zlotnik authored 6 years ago

cee295fb

AMC - added arguments: amc-ft-frequency, amc-reward-frequency · 7fb41d6f

Neta Zmora authored 6 years ago

amc-ft-frequency:
Sometimes we may want to fine-tune the weights after
‘n’ number of episode steps (action-steps).   This new
argument  controls the frequency of this fine-tuning (FT)
How many action-steps between fine-tuning
By default, there is no fine-tuning between steps.

amc-reward-frequency:
By default, we only provide a non-zero reward at the end
episodes.  This argument allows us to provide rewards at
a higher frequency.

This commit also reorders the ResNet layer names, so that
layers are processed by near-topological order.  This is simply
to help interpret the data in the AMC Jupyter notebooks.

7fb41d6f

Mar 03, 2019

compress_classifier.py: Fix best_epoch logic · 87055fed

Neta Zmora authored 6 years ago

Based on a commit and ideas from @barrh:
https://github.com/NervanaSystems/distiller/pull/150/commits/1623db3cdc3a95ab620e2dc6863cff23a91087bd

The sample application compress_classifier.py logs details about
the best performing epoch(s) and stores the best epoch in a checkpoint
file named ```best.pth.tar``` by default (if you use the ```--name```
application argument, the checkpoint name will be prefixed by ```best```).

Until this fix, the performance of a model was judged solely on its
Top1 accuracy. This can be a problem when performing gradual pruning
of a pre-trained model, because many times a model's Top1 accuracy
increases with light pruning and this is registered as the best performing
training epoch. However, we are really interested in the best performing
trained model _after_ the pruning phase is done. Even during training, we
may be interested in the checkpoint of the best performing model with the
highest sparsity.
This fix stores a list of the performance results from all the trained
epochs so far. This list is sorted using a hierarchical key:
(sparsity, top1, top5, epoch), so that the list is first sorted by sparsity,
then top1, followed by top5 and epoch.

But what if you want to sort using a different metric? For example, when
quantizing you may want to score the best performance by the total number of
bits used to represent the model parameters and feature-maps. In such a case
you may want to replace ```sparsity``` by this new metric. Because this is a
sample application, we don't load it with all possible control logic, and
anyone can make local changes to this logic. To keep your code separated from
the main application logic, we plan to refactor the application code sometime
in the next few months.

87055fed

compress_classifier.py: fix PNG and ONNX exports broken in new release · 6567ecec
Neta Zmora authored 6 years ago
```
Release 0.3 broke the expots to PNG and ONNX and this is the fix.
```
6567ecec
Merge branch 'master' of https://github.com/NervanaSystems/distiller · 9f4f17b4
Neta Zmora authored 6 years ago

9f4f17b4
SummaryGraph - warn user when the model uses dynamic input shapes · e62d0a24
Neta Zmora authored 6 years ago
```
See issue #168.
This is not a fix, but warns the user of wrong MAC results, until
fixing the issue.
```
e62d0a24

Mar 01, 2019
- jupyter/alexnet_insights.ipynb - fix syntax error (#180) · d556ac51
  Yuma Hiramatsu authored 6 years ago
  
  d556ac51
Feb 28, 2019
- Added DropFilter as a separate regularizer · b135701b
  Neta Zmora authored 6 years ago
  
  b135701b
- Drop filter + other things · a39bb904
  Neta Zmora authored 6 years ago
  
  a39bb904
- compression_insights.ipynb: added L1-norm vs L2-norm filter graph · 63bea5b8
  Neta Zmora authored 6 years ago
  
  63bea5b8
Feb 27, 2019
- Bugfix in pruning masks access when model was modified by a Quantizer · e133828f
  Guy Jacob authored 6 years ago
  
  View commits for tag v0.3.0 v0.3.0
  
  e133828f
Feb 26, 2019

execution_env.py - small fix (#160) · 61c7e19d

Bar authored 6 years ago

Function ```log_execution_env_state``` copies a given configuration file to the logs directory to save all of the details of an experiment.  In some distributed a file copy may fail, so we wrap the copy of the configuration file with a try/except block.

61c7e19d

Update README.md · cebb140a
Lev Zlotnik authored 6 years ago

Unverified

cebb140a