With this change, differentiating CUDA EP and ROCm EP is not needed in training script when mem_limit option needs to be set.
Co-authored-by: Weixing Zhang <wezhan@microsoft.com>
use unordered_set instead of unordered_map to keep track of dynamic shape tensors with shape updates
fix: insert input_name in the set of input_names
move trt_profile to TensorrtFuncState and reuse it
* Integrate openvino-ep-2021.3
* operators type
* changed the myriad as it is case sensitive
* logging information for openvino-ep-2021.3
* Unit test fix
* Resize operator added for myriad
* Fixed python tests for CPU and GPU
* data commit for loop tile and gatherelements failure
* adding checks for Where
* fixing gatherelements and loop tests
* disabling instance normalization test for now as there seems to be a
myriad bug, putting loop in ops supported only because all the tests
fail
* gather elements op test taking care of warning message
* condition needs to be an intializers
* Disabled python test for Myriad
* Disable compilation warning for MSVC windows compiler
* softmax_test, threedimaxis0 and 1 test give accuracy mismatch
tensoroptest disables test gives accuracy mismatch
gather test gives accuracy mismatch
* Updated with ov version 2021.3
* Updated with ov version 2021.3
* Updated README
* Disabling python tests for cpu
* Disabling python tests with accuracy mismatch on cpu
* Added fix for Linux CI Pipeline failure
-> Disabled tests that were throwing segfault
Co-authored-by: sfatimar <sahar.fatima@intel/com>
Co-authored-by: MaajidKhan <n.maajidkhan@gmail.com>
Co-authored-by: Aravind <aravindx.gunda@intel.com>
* Updates to some operators to always support int32 and int64 based on testing of Android package build config with a minimal build.
If an operator can be used for shape manipulation (int64) it is frequently used for indices manipulation (int32), so we enable both types for that set of ops.
- e.g. BERT models take indices as input
- Scatter/Gather ops utilize indices
Misc. fix to python bindings to exclude call that fails in a minimal build.
Resize is spec'd to ignore the "roi" tensor in certain modes. For some reason, converters are specifying an arbitrary value for this tensor, even though it's optional.
This makes the graph partitioner skip a check for empty shape dimensions for tensors such as this, which the DML kernel registers as consuming as CPU inputs. Otherwise, the node is not included in DML graph partitions, because the DML graph doesn't handle empty dimensions.
Related work items: #32221164
* Adding TorchEmbedding contrib op
* Update contrib_defs.cc
* Shape fix
* Update shape_inference_test_helper.h
* Fix typo
* Fix test
* Fix for test code
* Merge
* Fix CI
* Fix for CI
* Fix CI no-contrib
Enable type reduction for Scatter/ScatterElements CPU kernels. Some refactoring to reduce binary size.
Add MLTypeCallDispatcher methods.
Minor cleanup for Pad CPU kernel.