onnxruntime/tools/ci_build/github/linux
Wang, Mengni fe463d4957
Support SmoothQuant for ORT static quantization (#16288)
### Description

Support SmoothQuant for ORT static quantization via intel neural
compressor

> Note:
Please use neural-compressor==2.2 to try SmoothQuant function.

### Motivation and Context
For large language models (LLMs) with gigantic parameters, the
systematic outliers make quantification of activations difficult. As a
training free post-training quantization (PTQ) solution, SmoothQuant
offline migrates this difficulty from activations to weights with a
mathematically equivalent transformation. Integrating SmoothQuant into
ORT quantization can benefit the accuracy of INT8 LLMs.

---------

Signed-off-by: Mengni Wang <mengni.wang@intel.com>
2023-07-26 18:56:45 -07:00
..
docker Support SmoothQuant for ORT static quantization (#16288) 2023-07-26 18:56:45 -07:00
ort_minimal Update python 3.11 and remove 3.7 for Linux (#15214) 2023-03-27 14:46:30 -07:00
tvm Fix TVM pipelines (#15653) 2023-04-26 09:55:05 +02:00
build_cuda_c_api_package.sh Remove 37 and 50 from CUDA compute archs (#14874) 2023-03-03 12:24:21 -08:00
build_linux_arm64_python_package.sh Fix python pipeline for AzureEP without using root (#16023) 2023-05-22 16:38:47 -07:00
build_yocto.sh
copy_strip_binary.sh Copy missing training header files to the package archive (#16119) 2023-05-30 16:45:40 -07:00
create_package.sh
extract_and_bundle_gpu_package.sh
java_copy_strip_binary.sh
java_linux_final_test.sh Update protobuf-java to version 3.21.7 (#13630) 2022-11-17 15:04:42 -08:00
run_build.sh Fix Linux MultiGPU TensorRT CI (#15697) 2023-04-26 10:01:33 -07:00
run_dockerbuild.sh Fix Linux MultiGPU TensorRT CI (#15697) 2023-04-26 10:01:33 -07:00
run_python_dockerbuild.sh Add compilation cache in 2 Linux CPU pipelines and refactor the Linux build step with cache (#15484) 2023-04-14 23:56:59 +08:00
run_python_tests.sh Unify cuda & trt version on few CIs (#15943) 2023-05-15 09:54:30 -07:00
test_custom_ops_pytorch_export.sh
upload_code_coverage_data.sh
upload_ortsrv_binaries.sh
yocto_build_toolchain.sh