Commits · 4ad16ef00f9ea90c0d7834667bf86b12e795c12e · llvm / distiller

Aug 08, 2019
- Point to GNMT example in docs · 7b5fdefe
  Guy Jacob authored 5 years ago
  
  7b5fdefe
Aug 04, 2019
- Add documentation section on preparing a model for quantization · 7fee2c9d
  Guy Jacob authored 5 years ago
  
  7fee2c9d
Jul 10, 2019

Post-Train Quantization: BN folding and "net-aware quantization" (#313) · 43548deb

Guy Jacob authored 6 years ago

* "Net-aware quantization" - using the term coined in
  https://arxiv.org/abs/1811.09886. (section 3.2.2).
  Refers to considering sequences of modules when quantizing. This 
  isn't exactly layer fusion - we modify activation stats prior to
  setting quantization parameters, to make sure that when a module
  is followed by certain activation functions, only the relevant
  ranges are quantized. We do this for:
    * ReLU - Clip all negative values
    * Tanh / Sigmoid - Clip according to the (approximated) saturation
      values for these functions. We use [-4, 4] for tanh and [-6, 6]
      for sigmoid.

* Perform batch-norm folding before post-training quantization.
  Batch-norm parameters are folded into the parameters of the previous
  layer and the BN layer is replaced with an identity module.

* Both BN folding and "net-aware" are now automatically executed
  in PostTrainLinearQuantizer (details of this change below)

* BN folding enabled by new generic mechanism to "fuse" module
  sequences (at the Python API level)
    * First module in sequence is replaced/modified by a user-provided
      function, rest of moudles replaced with nn.Identity

* Quantizer changes:
  * Optionally create adjacency map during prepare_model
  * Subclasses may enforce adjacency map creation
  * Refatcoring: Replace _prepare_model_impl with pre and post
    override-able "callbacks", so core functionality is always executed

* PostTrainLinearQuantizer Changes:
  * Enforce creation of adjacency map. This means users must now pass a
    dummy input to PostTrainLinearQuantizer.prepare_model
  * Before module replacement - Apply BN folding and stats updates according
    to net-aware quantization

* Updated the language model quantization tutorial to reflect the new
  functionality

* Updated the image classification post-train quantization samples
  (command line and YAML)

* Other changes:
  * Distller LSTM implementation:
    Replace the ModuleList for cells with a plain list. The PyTorch trace
    mechanism doesn't "see" ModuleList objects, it only sees the 
    contained modules. This means that the "scopeName" of these modules
    isn't complete, which makes it impossible to match op names in 
    SummaryGraph to modules in the Python model.
  * ActivationStatsCollector: Ignore nn.Identity modules

Unverified

43548deb

Jul 08, 2019
- Add links to language model quantization notebook in README and docs · 81047f5d
  Guy Jacob authored 6 years ago
  
  81047f5d
Mar 29, 2019
- Fixed a typo in te quantization documentation (#207) · f5987f9a
  Songyi Blair Han authored 6 years ago
  
  f5987f9a
Feb 26, 2019

PyTorch 1.0.0 support + Proper Packaging (Release 0.3) (#144) · 62862a08

Lev Zlotnik authored 6 years ago

Not backward compatible - re-installation is required

* Fixes for PyTorch==1.0.0
* Refactoring folder structure
* Update installation section in docs

Unverified

62862a08

Dec 06, 2018

Documentation refactoring · 178c8c49

Neta Zmora authored 6 years ago

- Moved the Language model and struct pruning tutorials from the Wiki to
the HTML documentation.  Love the ease of Wiki, but GitHub doesn't let
Google crawl these pages, and users can't open PRs on Wiki pages.

- Updated the pruning algorithms documentation

178c8c49

Nov 07, 2018
- Documentation: add github pages documentation for Early Exit · 5681541f
  Neta Zmora authored 6 years ago
  
  5681541f
Sep 03, 2018

Add knowledge distillation flow (#41) · c9794e4a

Guy Jacob authored 6 years ago

* Implemented as a Policy
* Integrated in image classification sample
* Updated docs and README

Unverified

c9794e4a

Apr 30, 2018
- Additional quantization docs + fixes · 7bbfd12b
  Guy Jacob authored 7 years ago
  
  7bbfd12b
Apr 28, 2018
- fix typo: Jupyter spelled as Jupiter · ebb89126
  Neta Zmora authored 7 years ago
  
  ebb89126
Apr 24, 2018
- small documentation touchups · 7fbde765
  Neta Zmora authored 7 years ago
  
  7fbde765
- small documentation touchups · cb79e100
  Neta Zmora authored 7 years ago
  
  cb79e100
- Fix README links · 0ecd205a
  Neta Zmora authored 7 years ago
  
  0ecd205a
- first commit · 6eef69b5
  Neta Zmora authored 7 years ago
  
  6eef69b5