* Remove unused NodeArgs
* Handle case where a node arg from an initializer from initializer_names_to_preserve
* Fix CI failure
* update test
* Fix outer scope node args failure
* Use NodeArg* as the key of the std::set instead of string
* Minor updates
Use the more robust implementation from DML's Algorithms.h.
```
engine\lotus\onnxruntime\core\providers\dml\OperatorAuthorHelper\Common.h(27): warning C4756: overflow in constant arithmetic
```
* add initializer checker for Gather with 1D input
* Check if indices value exists
* Update symbolic_shape_infer.py
* add unit test
* Update symbolic_shape_infer.py
* Update symbolic_shape_infer.py
- Now few more DML operators support INT64 dataType directly.
- Operators like Padding, ElementWise_Clip now have new dml structure to support int64 data type for scalar value.
Related work items: #33883294
* implement cuda provider
* define profiler common
* call start after register
* add memcpy event
* add cuda correlation
* format code
* add cupti to test path
* switch to CUpti_ActivityKernel3
* reset cupti path
* fix test case
* fix trt pipeline
* add namespace
* format code
* exclude training from testing
* remove mutex
* Remove USE_TENSORRT macro and disable TRT EP at runtime if not support
* Remove USE_TENSORRT macro and disable TRT EP at runtime if not support
* Remove USE_TENSORRT macro and disable TRT EP at runtime if not support
* handle unused parameters
* Remove USE_TENSORRT macro and disable TRT EP at runtime if not support
* Remove USE_TENSORRT macro and disable TRT EP at runtime if not support
* handle unused parameters
* Disable some testcases
* only include opset13 for testing and add a keyword filter set
* rename variable
* add back code which was accidentally commented on previous commit
* Adjust model test filter for opset14
* Added code to support Softmaxgrad
Signed-off-by: Chethan Palangotu Keshava <chethan.palangotu.keshava@intel.com>
* Bringing back the opset filters for softmax that I had removed.
This will fix the test failures from onnnx repo.
Signed-off-by: Chethan Palangotu Keshava <chethan.palangotu.keshava@intel.com>
Now that DML has int64 support directly, register the related operators for uint64/int64 (rather than the hack in the ORT DML EP with doubled strides).
## Remaining work
- Not implemented in DML: CumSum, Range, MaxPool/MaxUnpool, TopK, ReduceProd/Sum/SumSquare/L1
- Implemented in DML but need DML EP kernel work: Clip, Pad, Neg, Range, ConstantOfShape
```
te.exe OnnxConformanceTests.dll
Summary: Total=4454, Passed=4147, Failed=0, Blocked=0, Not Run=0, Skipped=307
```
Corresponding PR: https://microsoft.visualstudio.com/WindowsAI/_git/WindowsAI/pullrequest/6486426
Related work items: #28761231, #33883294