Address #1155
Add debug helper methods to be able to dump input name and shape information for node inputs, and the data from node outputs.
As the input data comes from graph inputs, initializers or node outputs we don't dump it.
Must be manually enabled by building with '--cmake_extra_defines onnxruntime_DEBUG_NODE_INPUTS_OUTPUTS=ON'
1. Use cudnnRNNForwardInferenceEx for unpacked (padded) layout case, place the sequence_lens data on CPU
2. Fix hard code device ID issue. In cuda kernel, it should get the device id from provider.
Add ability to set the session and run logger severity via SessionOptions and RunOptions
Inherit severity from the next logger up if logger severity isn't specified in SessionOptions or RunOptions
Expose ability to set default logger severity in python bindings.
* Updated CPU GRU to zero Y output between max specified sequence length and max sequence length implied by input shape.
* Updated CPU LSTM to zero Y output between max specified sequence length and max sequence length implied by input shape.
* Disabled LSTMTest.ONNXRuntime_TestLSTMSequenceLengthShorterThanInputSequenceLength for nGraph execution provider and added TODO to investigate failure.
* subgraph with memcpy fix
* Linux compile errors fix
* Linux compile errors fix
* subgraph with memcpy fix
* Linux compile errors fix
* Linux compile errors fix
* memcpy (PR1020) fix implemented
* check graph viewer GetNode for nullptr at other plances
* documents
* Review changes (UseSubgraph simplified)
* static_cast<int> removed
* static_cast<int> removed 2
* fall back to CPU implementation in GetCapability()
* check shape for null. fall back to CPU implementation in GetCapability()
* backend data errors fixed
* PR review changes
* disable Opset10 tests
* removed tests from main.cc of test runner. added a check at GetCapability()
* backend data and Model-Zoo related fixes
* subgraph with memcpy fix
* Linux compile errors fix
* Linux compile errors fix
* subgraph with memcpy fix
* Linux compile errors fix
* memcpy (PR1020) fix implemented
* documents
* Review changes (UseSubgraph simplified)
* static_cast<int> removed
* fall back to CPU implementation in GetCapability()
* check shape for null. fall back to CPU implementation in GetCapability()
* backend data errors fixed
* PR review changes
* disable Opset10 tests
* removed tests from main.cc of test runner. added a check at GetCapability()
* backend data and Model-Zoo related fixes
* patch to run tests and models separatly
* add immutable exclude list
* trigger callback
* switch to set for failed cases
* tweak conditions
* restore space
* cancel judgement
* fix comment
* restore onnx install sequence
* better comment and set const string
* set set const
* revert change on install_onnx.sh
Advance ONNX submodule to 5c51f0dbbe88ee1536f17ee7bd462b2ab3772c52
This commit in ONNX contains a fix to ConstantOfShape test data.
Uncomment ConstantOfShape.
Update test script, make sure exclusions are uniform.
* added the runcoverage powershell script
* updated the run coverage script. added installation to the windows CI for trying
* exclude other parts of win ci
* fix in the download script
* fix in the download script
* fix in the download script
* fix in the download script
* fix in the download script
* fix in the download script
* fix in the download script
* fix in the download script
* fix in the download script
* added the runtestcoverage script to the pipeline
* some typo fix
* formatting
* re-commenting previously commented block
* cleaned up the powershell script
* fix path in pipeline
* fix path in pipeline
* fixed model path
* some fixes
* excluded long running tests
* add the publish job
* uncomment other tasks
* fixed excluded tests
* some format correction
* stopped running the test debug
* try placing the tes-all at the beginning
* try running the failing test only
* edit run_coverage
* some fix
* skip onnx_model_test
* Added memory size log in powershell script
* try running the onnxruntime_test_all.exe separately from codecov
* enable error reporting, and double memory size in powershell
* corrected the set-item
* remove memory resize, since we are already at max 2 GB
* fixed the tvm.dll issue
* added back the onnx tests in codecov. added back the regular test run
* cleanup
* remove * from the the module path
* add junction target resolution for modules dir
* remove junction-resolution
* reduced tests
* added target extraction for the junction paths in build machine
* added the appropriate change in win ci pipeline to call the updated ps script
* fix typo
* added back all the tests that were disabled
* try fixing the source root
* cleanup and enable all tests
* increase timeout for windows CPU CI due to codecoverage
* templatized the code coverage steps. Conitnue on error with any codecoverage step
* change quote marks
This PR introduces a rewrite rule that replaces a Shape node with an initializer when the shape of the input is statically known through shape inference.
* ConstantOfShape CUDA implementation
* Enhance the fallback logic, so the case that Shape -> ... -> ConstantOfShape won't fallback ConstantOfShape to CPU provider
* move shared code to cpu implementation
* do the fill based on sizeof(data_type)
* update method access level
This fixes#1034: Can't Create Model Sessions on Different GPU
The root cause of the bug is that CUDA execution provider uses thread_local to save per-thread-context and allocator, and when two CUDA execution providers are running on the same thread there's a conflict. The fix is to add a std::unordered_map to differentiate EPs in the same thread.
* Update MaxPool & AveragePool to support opset 10
* fix build issue
* still use cudnn for MaxPool if dilation is not set or are default 1.
* fix build issue
* add version filter to failed tests
* exclude test from backend
* exclude shrink from opset 9
* fix compile err
* exclude certain version of constant shape
* enable flatten test
* fix compile err
* comment mvn test
* disable constantofshape test in x86
* disable x86 test
* get model version from imported opset
* test linux x86 case
* disable nonzero opset 10
* make mutex const
* test filter by commit id
* adjust substr offset
* Limit test platform
* remove change impacting TFModleInfo.h
* refactoring
* refactoring
* test x86 pipeline with filter
* add comment
* restrict version extraction on non-win
* restrict version extraction on non-win
* add tag
* exclude case from backend test
* remove dup
* remove dup
* make script runnable
* hard code adsolute path
* refactor log
* fix x86 compile err
* fix x86 compile err
* fix x86 compile err
* sync with latest tensorrt
* switch to regex
* fix cpu pipeline err
* test filter
* disable nonzero from all versions