pytorch/caffe2/perfkernels
Nikita Shulga a229e78544 [BE] Enforce sign-compare (#96723)
Number of OSS PR were reverted, because new signed-unsigned comparison warnings, which are treated as errors in some internal builds.
Not sure how those selective rules are applied, but this PR removes `-Wno-sign-compare` from PyTorch codebase.

The only tricky part in this PR, as making sure that non-ASCII character detection works for both signed and unsigned chars  here:
6e3d51b08a/torch/csrc/jit/serialization/python_print.cpp (L926)

Exclude several files from sign-compare if flash attention is used, due to the violation in cutlass, to be fixed by https://github.com/NVIDIA/cutlass/pull/869
Do not try to fix sign compare violations in caffe2 codebase
Pull Request resolved: https://github.com/pytorch/pytorch/pull/96723
Approved by: https://github.com/albanD
2023-03-15 06:04:20 +00:00
..
__init__.py
adagrad.cc
adagrad.h
adagrad_avx2.cc
batch_box_cox.cc [BE] Enforce sign-compare (#96723) 2023-03-15 06:04:20 +00:00
batch_box_cox.h
batch_box_cox_avx2.cc [tourch] BatchBoxCox - fix numerical issue in vectorized code (#88875) 2022-11-11 21:58:23 +00:00
CMakeLists.txt
common.h
common_avx.cc
common_avx2.cc
common_avx512.cc
cvtsh_ss_bugfix.h
embedding_lookup.cc
embedding_lookup.h
embedding_lookup_avx2.cc
embedding_lookup_fused_8bit_rowwise_avx2.cc
embedding_lookup_fused_8bit_rowwise_idx_avx2.cc
embedding_lookup_idx.cc add bf16 in fp32 out fast path for embedingbag in caffe2 perfkernel (#89198) 2022-11-30 13:06:13 +00:00
embedding_lookup_idx.h
embedding_lookup_idx_avx2.cc add bf16 in fp32 out fast path for embedingbag in caffe2 perfkernel (#89198) 2022-11-30 13:06:13 +00:00
fused_8bit_rowwise_embedding_lookup.cc
fused_8bit_rowwise_embedding_lookup.h
fused_8bit_rowwise_embedding_lookup_idx.cc
fused_8bit_rowwise_embedding_lookup_idx.h
fused_nbit_rowwise_conversion.cc [BE] Enforce sign-compare (#96723) 2023-03-15 06:04:20 +00:00
fused_nbit_rowwise_conversion.h
hp_emblookup_codegen.py add bf16 in fp32 out fast path for embedingbag in caffe2 perfkernel (#89198) 2022-11-30 13:06:13 +00:00
lstm_unit_cpu-impl.h [caffe2][tourch] Optimize BatchBoxCox (#87585) 2022-11-10 06:11:05 +00:00
lstm_unit_cpu.h
lstm_unit_cpu_avx2.cc
lstm_unit_cpu_common.cc
lstm_unit_cpu_common.h
math.h
math_cpu_avx2.cc
math_cpu_base.cc
typed_axpy.cc
typed_axpy.h
typed_axpy_avx.cc
typed_axpy_avx2.cc
vectorizer.h [caffe2][tourch] Optimize BatchBoxCox (#87585) 2022-11-10 06:11:05 +00:00