pytorch/caffe2/perfkernels
efiks ea0ec9d71c [tourch] BatchBoxCox - fix numerical issue in vectorized code (#88875)
Summary:
Usage of fast math in BatchBoxCox kernel provided different math results between dev and optimized versions which cause few internal test to fail.
For now disabling the compiler optimized version and relying on ATEN vectors

Differential Revision: D41211784

Pull Request resolved: https://github.com/pytorch/pytorch/pull/88875
Approved by: https://github.com/hyuen
2022-11-11 21:58:23 +00:00
..
__init__.py
adagrad.cc
adagrad.h
adagrad_avx2.cc
batch_box_cox.cc
batch_box_cox.h
batch_box_cox_avx2.cc [tourch] BatchBoxCox - fix numerical issue in vectorized code (#88875) 2022-11-11 21:58:23 +00:00
CMakeLists.txt
common.h
common_avx.cc
common_avx2.cc
common_avx512.cc
cvtsh_ss_bugfix.h
embedding_lookup.cc
embedding_lookup.h
embedding_lookup_avx2.cc
embedding_lookup_fused_8bit_rowwise_avx2.cc
embedding_lookup_fused_8bit_rowwise_idx_avx2.cc
embedding_lookup_idx.cc
embedding_lookup_idx.h
embedding_lookup_idx_avx2.cc
fused_8bit_rowwise_embedding_lookup.cc
fused_8bit_rowwise_embedding_lookup.h
fused_8bit_rowwise_embedding_lookup_idx.cc
fused_8bit_rowwise_embedding_lookup_idx.h
fused_nbit_rowwise_conversion.cc
fused_nbit_rowwise_conversion.h
hp_emblookup_codegen.py
lstm_unit_cpu-impl.h [caffe2][tourch] Optimize BatchBoxCox (#87585) 2022-11-10 06:11:05 +00:00
lstm_unit_cpu.h
lstm_unit_cpu_avx2.cc
lstm_unit_cpu_common.cc
lstm_unit_cpu_common.h
math.h
math_cpu_avx2.cc
math_cpu_base.cc
typed_axpy.cc
typed_axpy.h
typed_axpy_avx.cc
typed_axpy_avx2.cc
vectorizer.h [caffe2][tourch] Optimize BatchBoxCox (#87585) 2022-11-10 06:11:05 +00:00