Commits · 31c1bd8975710798d4a88008e91a05933925e9e8 · llvm / distiller · GitLab

Snippets Groups Projects

Oct 23, 2019

inspect_ckpt.py: support for very large models · 31c1bd89

Neta Zmora authored 5 years ago

Force loading on the CPU which always has more memory than a
single GPU.  This is useful for models that cannot be loaded onto
a single GPU.

31c1bd89

Update MobileNet v1 baseline training configuration file · 2c2a9417
Neta Zmora authored 5 years ago

2c2a9417

Fix AMC notebooks' sample commnand-line examples · 5059419b

Neta Zmora authored 5 years ago

As documented in issue #395, some of the command-line examples in the
AMC notebooks are incorrect.
Also, fix some bugs that were introduced with the refactoring of the
low-level pruning API

5059419b

Oct 22, 2019
- Set fixed_subset in data loader for image classifiers stats collection · b7eb1209
  Guy Jacob authored 5 years ago
  
  (since stats collection is now a 2-phase process)
  b7eb1209
- updated README · 503108e9
  Neta Zmora authored 5 years ago
  
  add citation
  Unverified
  
  503108e9
Oct 07, 2019

Post-Train Quant: Greedy Search + Proper mixed-settings handling (#402) · 9e7ef987

Guy Jacob authored 5 years ago


* Greedy search script for post-training quantization settings
  * Iterates over each layer in the model in order. For each layer,
    checks a user-defined set of quantization settings and chooses
    the best one based on validation accuracy
  * Provided sample that searches for best activations-clipping
    mode per layer, on image classification models

* Proper handling of mixed-quantization settings in post-train quant:
  * By default, the quantization settings for each layer apply only
    to output quantization
  * Propagate quantization settings for activations tensors through
    the model during execution
  * For non-quantized inputs to layers that require quantized inputs,
    fall-back to quantizing according to the settings used for the
    output
  * In addition, provide mechanism to override inputs quantization
    settings via the YAML configuration file
  * By default all modules are quantized now. For module types that
    don't have a dedicated quantized implementation, "fake"
    quantization is performed

* Misc. Changes
  * Fuse ReLU/ReLU6 to predecessor during post-training quantization
  * Fixes to ACIQ clipping in the half-range case

Co-authored-by: Lev Zlotnik <lev.zlotnik@intel.com>
Co-authored-by: Guy Jacob <guy.jacob@intel.com>

9e7ef987

AMC: fix the replay_buffer_size when using Coach and DDPG · 738d57f4
Neta Zmora authored 5 years ago

738d57f4

Remove confusing log message · 9f7f6b14

Neta Zmora authored 5 years ago

As noted in issue #382, logging when a parameter does not have
a mask is unnecessary and may confuse users.  Therefore, it is
removed.

9f7f6b14

Add logging of `app_cfg` to the logger default · bdab1fa5

Neta Zmora authored 5 years ago

`app_cfg` logs the basic execution environment state, and is deemed important
in most circumstances.

bdab1fa5

Oct 06, 2019

Low-level pruning API refactor (#401) · 05d5592e

Neta Zmora authored 5 years ago

Some refactoring of the low-level pruning API

Added distiller/norms.py - for calculating norms of various sub-tensors.

ranked_structures_pruner.py:
-Removed l1_magnitude, l2_magnitude. Use instead distiller.norms.l1_norm
-Lots of refactoring
-replaced LpRankedStructureParameterPruner.ch_binary_map_to_mask with
distiller.thresholding.expand_binary_map
-FMReconstructionChannelPruner.rank_and_prune_channels used L2-norm
by default and now uses L1-norm (i.e.magnitude_fn=l2_magnitude was
replaced with magnitude_fn=distiller.norms.l1_norm)

thresholding.py:
-Delegated lots of the work to the new norms.py.
-Removed support for 4D (entire convolution layers) since that has not been
maintained for a longtime. This may break some old scripts that remove entire
layers.
-added expand_binary_map() explicitly so others can use it. Might need to
move to a different file
-removed threshold_policy()

utils.py:
-use distiller.norms.xxx for sparsity stats

05d5592e

Change requirements on Tensorflow version (#400) · bbd6fef9

Bar authored 5 years ago

Hot-fix for issue that arises with FileWriter class on TF v2.
Allows only Tensorflow v1.X

bbd6fef9

Quantizer: Re-move model to device at the end of prepare_model · 2e0d147f
Guy Jacob authored 5 years ago

2e0d147f
Bugfix - remove unnecessary argument from NCF quantizer metadata · 475b6a67
Guy Jacob authored 5 years ago

475b6a67

Oct 05, 2019

QAT: Better handling of optimizer and of creation of fp32 weights copy (#399) · 4b6b5b19

Guy Jacob authored 5 years ago

* Create float copy such that the actual tensor being learned stays
  the same
* This way the optimizer doesn't have to be re-created, just need to
  add parameter groups if algo requires it (e.g. PACT)
* This also means we don't care about pre-existing parameter groups,
  as opposed to the previous implementation which ASSUMED a single
  existing group

4b6b5b19

Fix exception type being caught in checkpoint load sanity check · 3710c464
Guy Jacob authored 5 years ago

3710c464

Sep 27, 2019
- agp-pruning/resnet50.schedule_agp.filters.yaml - fix cmd-line · 2dcf3ff3
  Neta Zmora authored 5 years ago
  
  2dcf3ff3
- Merge plain20 changes to 'master' · ce2fa06b
  Neta Zmora authored 5 years ago
  
  ce2fa06b
- Plain20 - add a version of the Plain20 model w/o BN layers · 9c21c4e3
  Neta Zmora authored 5 years ago
  
  9c21c4e3
- update examples/baseline_networks/README.md · 0cfcf0d7
  Neta Zmora authored 5 years ago
  
  Unverified
  
  0cfcf0d7
- Move plain20 and vgg16 baseline training scripts · 8ff74211
  Neta Zmora authored 5 years ago
  
  Move these files to their true location, instead of using soft-links. Also added a short README file to distiller/examples/baseline_networks directory.
  8ff74211
Sep 25, 2019

Revert some of the code inserted in PR #391 · 66d93dde

Neta Zmora authored 5 years ago

The PR was based on an older code-base, which contained older code
and which was accidentally merged with the PR

66d93dde

Sep 24, 2019
- Fix bug in logger init from #391 which broke image classification scripts · 7bc2890e
  Guy Jacob authored 5 years ago
  
  * And removed unnecessary argument from execution env logging function
  7bc2890e
- ACIQ bug fixes · 58470d9f
  Guy Jacob authored 5 years ago
  
  * Return both min and max clip value in the symmetric case * Correct delta from mean in asymmetric + half_range case
  Unverified
  
  58470d9f
Sep 23, 2019

User-registered model (#391) · 0036011d

Neta Zmora authored 5 years ago

Add a jupyter notebook showing how to register a user's (external) image-classification model.
Contains fixes to the previous models extension mechanism, and relaxation of the `args' requirements in apputils/image_classifier.py.

apputils/image_classifier.py –
*when self.logdir is None:
-use NullLogger
-skip save_checkpoint

*return training log from run_training_loop()
*don’t log if script_dir or output_dir are not set.
*Fix params_nnz_cnt in update_training_scores_history()

data_loggers/logger.py – add NullLogger which does not log

0036011d

Sep 18, 2019

NCF: Add missing script to get dataset and update README · 9097d6d7
Guy Jacob authored 5 years ago

9097d6d7
Fix typo: statsitic --> statistic (#385) · f312e092
Bar authored 5 years ago

f312e092
compress_classifier.py: remove remarked code · f6c48f87
Neta Zmora authored 5 years ago

f6c48f87
image_classifier.py – remove unused cmd-line argument · 7f78b22e
Neta Zmora authored 5 years ago
```
../../../distiller/apputils/image_classifier.py – remove unused
`--extras` command-line argument
```
7f78b22e
Merge nzmora's local changes · 5532eb90
Neta Zmora authored 5 years ago
```
Odds and ends commit
```
5532eb90

Odds and ends commit · 8d55ab15

Neta Zmora authored 5 years ago

A bundle of very small, and mostly non-functional, changes to the code.
Mostly they are unrelated to each other
../../../distiller/apputils/checkpoint.py – add info to exception

../../../distiller/apputils/image_classifier.py – remove unused
`--extras` command-line argument

../../../distiller/thinning.py – code refactoring (non-functional)
except for adding a new public API: contract_model()

../../classifier_compression/compress_classifier.py – use
contract_model() when using `--thinnify`

../../lottery_ticket/README.md – remove illegal characters in
the text

8d55ab15

Sep 10, 2019

ACIQ clipping in post-training quantization (#173) · 534072d8

Yury Nahshan authored 5 years ago

ACIQ clipping method, as described in:

Post training 4-bit quantization of convolutional networks for rapid-deployment
(Ron Banner , Yury Nahshan, Daniel Soudry)
(NeurIPS 2019)

https://arxiv.org/abs/1810.05723



Co-authored-by: Yury Nahshan <yury.nahshan@intel.com>
Co-authored-by: Lev Zlotnik <lev.zlotnik@intel.com>

534072d8

Update README.md · c8745186
Neta Zmora authored 5 years ago
```
Added explicit thank you mention for Cadene
```
Unverified

c8745186

Sep 09, 2019
- Bugfix - Error always raised in training loop after #375 (#377) · e46b2a0a
  KunHuang authored 5 years ago
  
  e46b2a0a
Sep 08, 2019

Fix epochs limitation check when initializing the learner · f55a47f7

Neta Zmora authored 5 years ago

This patch moves the check on remaining training epochs into the training loop, as it is irrelevant for non-training invocation of ClassifierCompressor.
Thus, allowing resumed checkpoints be evaluated regardless of the epochs limitation.

f55a47f7

Fix epochs limitation check · 55c2d1dc

barrh authored 5 years ago

This patch moves the check on remaining training epochs into the training loop, as it is irrelevant for non-training invocation of ClassifierCompressor.
Thus, allowing resumed checkpoints be evaluated regardless of the epochs limitation.

55c2d1dc

PTQ Bugfix - Asymmetric quantization + DataParallel was broken (#374) · 565d844e

Lev Zlotnik authored 5 years ago

* The is_simulated_quant_weight_shifted flag was a Python bool,
  modifying of which during forward() isn't supported when 
  DataParallel is used, as per
  https://pytorch.org/docs/stable/nn.html#dataparallel-layers-multi-gpu-distributed
* Instead, registered it as an integer tensor buffer

565d844e

Sep 06, 2019

Merge branch (HAN lab code integration) · 360c5fd0
Neta Zmora authored 5 years ago

360c5fd0

AMC: add direct integration of HAN Lab's DDPG agent · c4a17d6f

Neta Zmora authored 5 years ago

Integrate the code for the DDPG agent from:
https://github.com/mit-han-lab/amc-release

The instructions for cloning HAN's code and then making changes
to fit Distiller were too complicated, so we added the integrated
files to distiller/examples/auto_compression/amc/rl_lib/hanlab

c4a17d6f

Sep 05, 2019
- image-classifier: non-functional refactoring · e04e5824
  Neta Zmora authored 5 years ago
  
  Unverified
  
  e04e5824
Sep 04, 2019
- use the aliased name · c3d6a1c3
  103yiran authored 5 years ago
  
  Unverified
  
  c3d6a1c3