onnxruntime/cmake/external
George Nash e695cd304a
Dnnl refactor (#8627)
* dnnl ep rework

    rework DnnlTensor,DnnlNode,DnnlSubgraph to support arbitrary graph topology and tensor data types

    rework GetCapability to claim nodes in graph greedily from node topological ordering and delay creation of DnnlSubgraph until Compile

    rework compile to have DnnlSubgraphPrimitive as the object to handle primitive creation and execution
        instead of thread local primitive pool which duplicates intermediate memory allocated by the EP across threads

    DnnlSubgraphPrimitive provides helpers to handle many common functions for each dnnl primitive builder and become the centralized place to store input, output, intermediate memories, initializer memories and etc
        it provides functions to obtain input memories with automatic reordering/reshaping and moving between engines
        it provides interfaces to add primitive, set output memory for single node and etc

    add CONCURRENT_EXEC compile flag for dnnl library as without it, convolution primitive cannot be created and executed on different threads

    enable unit tests to run on dnnl ep as well if built with dnnl ep

    add dnnl ep support for Matmulinteger

* Add Relu to the DNNL refactor

Signed-off-by: George Nash <george.nash@intel.com>

* Add Convolution op to the DNNL rework

Signed-off-by: George Nash <george.nash@intel.com>

* Add Pooling ops to the DNNL rework

This adds the following ops:
    - AveragePool
    - GlobalAveragePool
    - GlobalMaxPool
    - MaxPool

Note: Pooling with dilation is not yet supported.
Note: GlobalLpPool, LpPool, MaxRoiPool, and MaxUnpool are not supported yet.

Signed-off-by: George Nash <george.nash@intel.com>

* Add Sum op to the DNNL rework

Signed-off-by: George Nash <george.nash@intel.com>

* Add ConvGrad op to the DNNL rework

Signed-off-by: George Nash <george.nash@intel.com>

* Add MaxPoolGrad and AveragePoolGrad ops to DNNL rework

Signed-off-by: George Nash <george.nash@intel.com>

* Added lrn operator to the refactored code

Signed-off by chethan.palangoutu.keshava@intel.com

* Added ReduceMean DNNL op to the refactor code

Signed-off-by: Chethan Palangotu Keshava <chethan.palangotu.keshava@intel.com>

* Added Softmax DNNL op for the refactored code

Signed-off-by: Chethan Palangotu Keshava <chethan.palangotu.keshava@intel.com>

* Added BatchNorm DNNL op inference-only for refactored code

Signed-off-by: Chethan Palangotu Keshava <chethan.palangotu.keshava@intel.com>

* Added Binary Ops to DNNL rework

Signed-off-by: Wang <zhaoyang.wang@intel.com>

* Added ReluGrad to DNNL Rework

Signed-off-by: Wang <zhaoyang.wang@intel.com>

* Update OneDNN tag to v2.3

Signed-off-by: Wang <zhaoyang.wang@intel.com>

* Added support for memory upto dim size 12

this is to fix the CI test cases that contain binary ops of input dim
size > 5

Signed-off-by: Wang <zhaoyang.wang@intel.com>

* Prevent claiming support for float16 and bfloat16 when only float is suppoted

By using The string.find used was causing the code to claiming support
for float16 and bfloat16 when we only supported float. We now explicitly
check the code for the data type or the data type with a 7 letter prefix
basically prefixed with "tensor("

Signed-off-by: George Nash <george.nash@intel.com>

* Disable uint8 mul and div, improve type conversion

Disable mul_uint8 and div_uint8 test cases as they use modulo for
overflow handling while onednn uses saturation

improve ype conversion using enum instead of string comparsion as well
as adding more types

Signed-off-by: Wang <zhaoyang.wang@intel.com>

Co-authored-by: Wang <zhaoyang.wang@intel.com>
Co-authored-by: Chethan Palangotu Keshava <chethan.palangotu.keshava@intel.com>
2021-08-13 14:15:43 -07:00
..
coremltools@523d5e03d8 Initial version of CoreML EP (#6392) 2021-01-27 10:43:17 -08:00
cub@c3cceac115
cxxopts@3c73d91c0b
date@e7e1482087
dlpack@e1e11e0d55 Post merge update for ORTModule 2021-03-16 20:11:59 -07:00
eigen@1f4c0311cd Update C++ Standard from 14 to 17 (#8041) 2021-06-25 14:08:01 -07:00
emsdk@f44b841547 [wasm] upgrade emsdk version to 2.0.23 (#7893) 2021-06-02 12:26:24 -07:00
FeaturizersLibrary@fd5fe3de50
flatbuffers@6df40a2471 Move flatbuffers to 1.12 release (#5392) 2020-10-07 09:23:03 -07:00
googlebenchmark@7d0d9061d8 add google benchmark as direct dependency (#7762) 2021-05-19 20:12:17 -07:00
googletest@53495a2a7d Update googletest to latest commit to fix build issues with GCC11 (#7984) 2021-06-08 16:06:53 -07:00
json@db78ac1d77 Use GCC 10 in Linux CPU CI pipeline (#7985) 2021-06-08 11:53:29 -07:00
libprotobuf-mutator@7a2ed51a6b
mimalloc@2d54553b7a
mp11@21cace4e57 Op kernel type reduction infrastructure. (#6466) 2021-01-28 07:27:19 -08:00
nsync@436617053d
onnx@1f63dcb7fc Update onnx (#8458) 2021-08-05 09:21:44 -07:00
onnx-tensorrt@4e50dbca66 Update TensorRT parser to the latest (#8712) 2021-08-12 18:10:51 -07:00
onnxruntime-extensions@b708aeaaf4 Update submodule onnxruntime-extensions. (#8282) 2021-07-13 10:21:11 +08:00
optional-lite@4acf4553ba Upgrade optional implementation to https://github.com/martinmoene/optional-lite. (#5563) 2020-11-03 15:27:47 -08:00
protobuf@2dc747c574 Update protobuf to 3.16 (#7616) 2021-05-07 14:09:23 -07:00
pytorch_cpuinfo@5916273f79 Adding pytorch cpuinfo as dependency (#8178) 2021-07-12 14:21:12 -07:00
re2@4244cd1cb4 Update C++ Standard from 14 to 17 (#8041) 2021-06-25 14:08:01 -07:00
SafeInt Revert to using release SafeInt repo now that it supports a build with exceptions disabled. (#5233) 2020-09-22 06:29:28 +10:00
tensorboard@373eb09e4c
tvm@eab844a872
wil@e8c599bca6
dml.cmake Update DirectML version to 1.5.1 and enable ARM/ARM64 builds with DML (#7511) 2021-04-30 00:49:30 -07:00
dnnl.cmake Dnnl refactor (#8627) 2021-08-13 14:15:43 -07:00
eigen.cmake
featurizers.cmake
FindNumPy.cmake
jemalloc.cmake
mimalloc.cmake
onnx_minimal.cmake pull onnx latest commit (#7102) 2021-03-29 11:00:38 -07:00
pybind11.cmake Add static code analyzer to Windows CPU/GPU CI builds and fix the warnings (#7489) 2021-04-29 11:54:57 -07:00
pyxir.cmake
zlib.cmake