* Initial commit
* More ops
* fix missing declarations for ReduceSum and ReduceSumSquare
* Add tests for new ops supporting double
* isable Add_dobule for OpenVINO EP
The NCHWc transform was missing support for the Sum_6 operator from ONNX 1.2. Older models would add unnecessary reorder ops and also would not use the Conv/Add fusion.
* Add string attribute interface for C API.
* Add string attribute interface for C++ API accordingly.
* Update comment to say that string is also valid
* Initial commit
* Add test case
* Revert unintentional change
* Update comments
* Resolve PR feedback
* Craft test casse and add more logs
* Fix build failures
* Fix minor bug in the way modified is updated
* Remove full model inference session test
* Resolve PR comments
* Resolve more PR feedback
* Resolve more PR feedback
* Resolve more PR comments
* Remove logging
* Move GetInitializer() method to memcpy_transformer scope
* Remove some unnecessary blank lines
* Make GetInitializer static
* Validate input shapes.
* Cache some input def metadata
* Make some methods const and check for negative values of dims instead of just -1.
* Fix shape inferencing test.
* Fix testLabelEncoder test
* Fix more tests
* Fix more tests
* Use size_t for loop variable
* sync onnx to get equal op with float support
* doc update
* fix test failure because of updated shape inference logic for roialign.
* filter consum test cases since it's not implemented yet.
This change makes some optimizations on various places. This change consists of a part of PR #1240 (removed the problematic part) and some other trivial fix.
1. reduce unnecessary copy when constructing vector or objects that contains vector as member. use std::move when applicable.
2. use std::vector<std::reference_wrapper<const TensorShape>> instead of std::vector<TensorShape>, when it is only for constant reference usage.
3. calculate key BEFORE (instead of AFTER) acquire lock in SessionState::GetMemoryPatternGroup
other trivial fixes (code should be straightforward and self-explainable).
Description: Describe your changes.
Change the logic to find cublas dll
Motivation and Context
Why is this change required? What problem does it solve?
The name pattern of cublas changed since 10.1. It doesn't include minor version in its name anymore.
If it fixes an open issue, please link to the issue here.
This change implements Conv+Clip activation fusion for FusedConv and NCHWc convolutions. The Clip operation runs in the thread context that is producing the convolution output.
* Minor bug fixes for accelerators
* Added dimensionality checks for each graph input for GPU
* Disabled some tests for MYRAID and GPU
* This change is required for running some of the models on
OpenVINO instead of falling back to default CPU EP
Signed-off-by: suryasidd <surya.siddharth.pemmaraju@intel.com>
* PR Feedback
Signed-off-by: suryasidd <surya.siddharth.pemmaraju@intel.com>
* Fix missing bracket
Signed-off-by: suryasidd <surya.siddharth.pemmaraju@intel.com>
* Use INFO instead of WARNING for an unused graph input.
* Drop severity of unused initializer as well
* Update to output a warning level message if removing an initializer that is never used, and an info level message if removing an initializer that optimization has made redundant.
* Now that we check for a constant initializer in an ancestor graph we also need to be able to retrieve and replace that initializer.
Add helpers to do so.
Update optimizers to use the new helpers.
Fix bug in UnsqueezeElimination where it wasn't checking if the initializer it was replacing was constant.