examples · 952028d001149efcdb1837e107044863020e6db6 · llvm / distiller

Enable weights/activations-only PTQ for conv/linear modules (#439)

Guy Jacob authored 5 years ago

* Weights-only PTQ:
  * Allow RangeLinearQuantWrapper to accept num_bits_acts = None, in
    which case it'll act as a simple pass-through during forward
  * In RangeLinearQuantParamLayerWrapper, if bits_activations is None
    and num_bits_params > 0, Perform quant and de-quant of the
    parameters instead of just quant.
* Activations-only PTQ:
  * Enable activations only quantization for conv/linear modules. When
    PostTrainLinearQuantizer detects # bits != None for activations 
    and # bits == None for weights, a fake-quantization wrapper will
    be used.
* Allow passing 0 in the `--qe-bits-acts` and `--qe-bits-wts` command
  line arguments to invoke weights/activations-only quantization,
  respectively.
* Minor refactoring for clarity in PostTrainLinearQuantizer's replace_*
  functions

952028d0

History

952028d0 5 years ago

History

Name	Last commit	Last update
..
GNMT
agp-pruning
auto_compression
baidu-rnn-pruning
baseline_networks
classifier_compression
drop_filter
greedy_pruning
gss
hybrid
lottery_ticket
ncf
network_slimming
network_surgery
network_trimming
object_detection_compression
pruning_filters_for_efficient_convnets
quantization
sensitivity-analysis
sensitivity-pruning
ssl
util_scripts
word_language_model
__init__.py
knowledge_distillation.txt