Commits · 4224dc2ee650af09cf58475a5c2ee5bcebc48850 · llvm / distiller

Nov 25, 2018
- Activation statistics: improve documentation · 4224dc2e
  Neta Zmora authored 6 years ago
  
  4224dc2e
Nov 24, 2018

Fix activation stats for Linear layers · 22e3ea8b

Neta Zmora authored 6 years ago

Thanks to Dan Alistarh for bringing this issue to my attention.
The activations of Linear layers have shape (batch_size, output_size) and those
of Convolution layers have shape (batch_size, num_channels, width, height) and
this distinction in shape was not correctly handled.

This commit also fixes sparsity computation for very large activations, as seen
in VGG16, which leads to memory exhaustion.  One solution is to use smaller
batch sizes, but this commit uses a different solution, which counts zeros “manually”,
and using less space.

Also in this commit:
- Added a “caveats” section to the documentation.
- Added more tests.

22e3ea8b

Nov 21, 2018
- Update documentation: expand on the use of activation stats collectors · b718c6df
  Neta Zmora authored 6 years ago
  
  b718c6df
Nov 07, 2018
- Documentation: Early Exit documentation (2) · 7596a0a6
  Neta Zmora authored 6 years ago
  
  Add missing files from previous commit
  7596a0a6
- Documentation: add github pages documentation for Early Exit · 5681541f
  Neta Zmora authored 6 years ago
  
  5681541f
Nov 04, 2018
- Fix wrong link in the documentation (issue #68) · c247b57f
  Neta Zmora authored 6 years ago
  
  c247b57f
Oct 03, 2018

documentation: update syntax of launching jupyter notebook · 5902146a

Neta Zmora authored 6 years ago

Latest versions of Jupyter notebooks have a different syntax for
launching the server such that it listens on oll network interfaces
(this is useful if you are running the Jupyter server on one machine,
and connect to it from a browser on a different machine).

So:
	jupyter-notebook --ip=* --no-browser

is replaced by:
	jupyter-notebook --ip=0.0.0.0 --no-browser

5902146a

Sep 16, 2018

A temporary fix for issue #36 (#48) · 5d3d6d8d

Neta Zmora authored 6 years ago

* A temporary fix for issue 36

The thinning code assumes that the sgraph it is using
is not data-parallel, because it (currently) accesses the
layer-name keys using a "normalized" name ("module." is removed).

The bug is that in thinning.py#L73 we create a data_parallel=True
model; and then give it to sgraph.
But in other places thinning code uses "normalized" keys.  For
example in thinning.py#L264.

The temporary fix configures data_parallel=False in thinning.py#L73.

A long term solution should have SummaryGraph know how to handle
both parallel and not-parallel models.  This can be done by having
SummaryGraph convert layer-names it receives in the API to
data_parallel=False using normalize_layer_name.  When returning
results, use the de-normalized format.

* Fix the documentation error from issue 36
* Move some logs to debug and show in logging.conf how to enable DEBUG logs.

Unverified

5d3d6d8d

Sep 03, 2018

Add knowledge distillation flow (#41) · c9794e4a

Guy Jacob authored 6 years ago

* Implemented as a Policy
* Integrated in image classification sample
* Updated docs and README

Unverified

c9794e4a

Jul 22, 2018

PACT quantizer (#30) · df9a00ce

Gal Novik authored 6 years ago

* Adding PACT quantization method
* Move logic modifying the optimizer due to changes the quantizer makes into the Quantizer itself
* Updated documentation and tests

df9a00ce

Jul 17, 2018

Quantizer tests, fixes and docs update · 6b166cec

Guy Jacob authored 6 years ago

* Add Quantizer unit tests
* Require 'bits_overrides' to be OrderedDict to support overlapping
  patterns in a predictable manner + update documentation to reflect this
* Quantizer class cleanup
  * Use "public" nn.Module APIs instead of protected attributes
  * Call the builtins set/get/delattr instead of the class special methods
    (__***__)
  * Fix issues reported in #24
* Bug in RangeLinearQuantParamLayerWrapper - add explicit override of
  pre_quantized_forward accpeting single input (#15)
* Add DoReFa test to full_flow_tests

6b166cec

Jul 01, 2018
- Fix symmetric linear quantization math derivation in docs (#12) · 19d33c50
  Guy Jacob authored 6 years ago
  
  * Scale of bias and parentheses were wrong
  Unverified
  
  19d33c50
Jun 21, 2018
- Minor additions to docs · 02f7871b
  Guy Jacob authored 6 years ago
  
  02f7871b
- Updated docs related to quantization · 3658374e
  Guy Jacob authored 6 years ago
  
  3658374e
Jun 14, 2018
- Documentation: small fix for RNN pruning image · 3910b5bd
  Neta Zmora authored 6 years ago
  
  3910b5bd
- Documentation: added a bit of info regarding Baidu's RNN pruning algorithm · ce60fc14
  Neta Zmora authored 6 years ago
  
  ce60fc14
May 22, 2018
- docs-src/usage: Fix wrong schedule path (#4) (#5) · 0c2fcb55
  Neta Zmora authored 6 years ago
  
  Two places in the documentation gave the wrong path to the example Alexnet sensitivity pruning schedule.
  Unverified
  
  0c2fcb55
May 14, 2018
- 8-bit Quantization - Save model + add test + updated docs (#3) · 443e7381
  Guy Jacob authored 6 years ago
  
  443e7381
May 13, 2018
- Documentation: add Qgrid installation documentation · 2d7f6f13
  Neta Zmora authored 6 years ago
  
  2d7f6f13
Apr 30, 2018
- Additional quantization docs + fixes · 7bbfd12b
  Guy Jacob authored 6 years ago
  
  7bbfd12b
Apr 28, 2018
- fix typo: Jupyter spelled as Jupiter · ebb89126
  Neta Zmora authored 6 years ago
  
  ebb89126
Apr 24, 2018
- small documentation touchups · 7fbde765
  Neta Zmora authored 6 years ago
  
  7fbde765
- small documentation touchups · cb79e100
  Neta Zmora authored 6 years ago
  
  cb79e100
- fix documentation links · 67e5a230
  Neta Zmora authored 6 years ago
  
  67e5a230
- Fix README links · 0ecd205a
  Neta Zmora authored 6 years ago
  
  0ecd205a
- first commit · 6eef69b5
  Neta Zmora authored 6 years ago
  
  6eef69b5