Commits · 4ad16ef00f9ea90c0d7834667bf86b12e795c12e · llvm / distiller

Aug 08, 2019
- Point to GNMT example in docs · 7b5fdefe
  Guy Jacob authored 5 years ago
  
  7b5fdefe
Aug 04, 2019
- Add documentation section on preparing a model for quantization · 7fee2c9d
  Guy Jacob authored 5 years ago
  
  7fee2c9d
Jul 08, 2019
- Add links to language model quantization notebook in README and docs · 81047f5d
  Guy Jacob authored 5 years ago
  
  81047f5d
Jun 10, 2019

Neta Zmora authored 5 years ago

Some links have changed with the latest version of mkdocs.
This closes issues #280 and #65 (reopened).

4103bb44

Apr 14, 2019
- Docs: Fix broken images and links · b9207bf7
  Guy Jacob authored 6 years ago
  
  b9207bf7
Mar 29, 2019
- Fixed a typo in te quantization documentation (#207) · f5987f9a
  Songyi Blair Han authored 6 years ago
  
  f5987f9a
Feb 11, 2019

Post-train quant based on stats + additional modules quantized (#136) · 28a8ee18

Guy Jacob authored 6 years ago

Summary of changes:
(1) Post-train quantization based on pre-collected statistics
(2) Quantized concat, element-wise addition / multiplication and embeddings
(3) Move post-train quantization command line args out of sample code
(4) Configure post-train quantization from YAML for more fine-grained control

(See PR #136 for more detailed changes descriptions)

Unverified

28a8ee18

Dec 06, 2018

Documentation refactoring · 178c8c49

Neta Zmora authored 6 years ago

- Moved the Language model and struct pruning tutorials from the Wiki to
the HTML documentation.  Love the ease of Wiki, but GitHub doesn't let
Google crawl these pages, and users can't open PRs on Wiki pages.

- Updated the pruning algorithms documentation

178c8c49

Dec 04, 2018

Range-Based Linear Quantization Features (#95) · 907a6f04

Guy Jacob authored 6 years ago

* Asymmetric post-training quantization (only symmetric supported so until now)
* Quantization aware training for range-based (min-max) symmetric and asymmetric quantization
* Per-channel quantization support in both training and post-training
* Added tests and examples
* Updated documentation

Unverified

907a6f04

Nov 25, 2018
- Activation statistics: improve documentation · 4224dc2e
  Neta Zmora authored 6 years ago
  
  4224dc2e
Nov 24, 2018

Fix activation stats for Linear layers · 22e3ea8b

Neta Zmora authored 6 years ago

Thanks to Dan Alistarh for bringing this issue to my attention.
The activations of Linear layers have shape (batch_size, output_size) and those
of Convolution layers have shape (batch_size, num_channels, width, height) and
this distinction in shape was not correctly handled.

This commit also fixes sparsity computation for very large activations, as seen
in VGG16, which leads to memory exhaustion.  One solution is to use smaller
batch sizes, but this commit uses a different solution, which counts zeros “manually”,
and using less space.

Also in this commit:
- Added a “caveats” section to the documentation.
- Added more tests.

22e3ea8b

Nov 21, 2018
- Update documentation: expand on the use of activation stats collectors · b718c6df
  Neta Zmora authored 6 years ago
  
  b718c6df
Nov 07, 2018
- Documentation: add github pages documentation for Early Exit · 5681541f
  Neta Zmora authored 6 years ago
  
  5681541f
Sep 03, 2018

Add knowledge distillation flow (#41) · c9794e4a

Guy Jacob authored 6 years ago

* Implemented as a Policy
* Integrated in image classification sample
* Updated docs and README

Unverified

c9794e4a

Jun 21, 2018
- Updated docs related to quantization · 3658374e
  Guy Jacob authored 6 years ago
  
  3658374e
May 22, 2018
- docs-src/usage: Fix wrong schedule path (#4) (#5) · 0c2fcb55
  Neta Zmora authored 6 years ago
  
  Two places in the documentation gave the wrong path to the example Alexnet sensitivity pruning schedule.
  Unverified
  
  0c2fcb55
Apr 30, 2018
- Additional quantization docs + fixes · 7bbfd12b
  Guy Jacob authored 6 years ago
  
  7bbfd12b
Apr 24, 2018
- small documentation touchups · 7fbde765
  Neta Zmora authored 7 years ago
  
  7fbde765
- small documentation touchups · cb79e100
  Neta Zmora authored 7 years ago
  
  cb79e100
- Fix README links · 0ecd205a
  Neta Zmora authored 7 years ago
  
  0ecd205a
- first commit · 6eef69b5
  Neta Zmora authored 7 years ago
  
  6eef69b5