* Remove Relu if followed by Clip. Update Clip 'min' if necessary.
Add unit test.
* Rename to match behaviour a little better.
* Update to match latest RewriteRule interface
* Add version and latest commit id to ORT Server
* Update cmake
* Change build id to build number
* Use target_compile_definitions instead of add_definitions
More C++ API improvements and cleanup
Add templates to tensor creation
Add run method that allows preallocated outputs
Simplify CreateTensor<T> to multiply by sizeof(T)
Convert io_types code
Optimize away vector copies in Session::Run
* Improve TensorRT GetCapability Accuracy
* Update onnxruntime_providers.cmake
* made changes based on feedback
* update unit tests for TensorRT
* update onnx-tensorrt submodule to v5.0 branch
* remove uncessary comments
* convert int32 to int64 at inferencing output
* add more data types in compute
* change returns in compute
* use StatusCode as return in compute
* CUDA CPU/GPU sync optimization
Even though CUDA device is capable of handling certain ops, it may be better to leave them on CPU especially for dynamic shape computations starting from Shape.
* Fix TensorRT test crash when fused graph may have null node in topological sort
* As we consistently use non-const reference for modifiable arguments that cannot be null, update the conventions to reflect that.
Add a note on qualifying 'auto' to make the intent clearer and it easier to notice accidental copies.
* Address PR comment by adding a statement around disabling copy/assignment/move for new classes until needed.
* onnx_backend_test_series.py update to allow specifying a single test to run.
The python unittest filtering is to a test method not test name so can't be used directly.
* Clarify help message.
First, we don't need this line of code.
Second, it may change path unintentionally. That, you want to use gcc from /usr/lib/ccache/gcc, but cmake pickup it from /usr/bin.
The shape of Loop inputs M and cond, according to below, could either be a scalar(rank 0), or a 1-d tensor(rank 1), and depending on the actual rank it may produce different outputs for subsequent nodes such as Gather. Thus ORT Loop operator cannot hard code these inputs to be rank 1. This PR also include some fixes for test failures caused by updating Conv shape inferences in ONNX(onnx/onnx#1988).
.Input(
0,
"M",
"A maximum trip-count for the loop specified at runtime. Optional."
" Pass empty string to skip.",
"I",
OpSchema::Optional)
.Input(
1,
"cond",
"A boolean termination condition. Optional. Pass empty string to skip.",
"B",
OpSchema::Optional)
...
.TypeConstraint(
"I",
{"tensor(int64)"},
"tensor of int64, which should be a scalar.")
.TypeConstraint(
"B",
{"tensor(bool)"},
"tensor of bool, which should be a scalar.")
* add x86 legs to ci
* minor update
* update platform from x86 to Win32
* remove --use_mklml from x86 build
* add win x64 mklml pipeline
* remove pybind and use_tvm from win x86
* Add build pipelines for --disable_contrib_ops (mac, lnx, win)
* remove --gen-doc generation for x86
* set environment variables during build to disablecontribops=on
* update to match aiinfra pipelines
* update test data url
* update mac pipeline test data
* remove gen_doc from win x64 leg
* update model files for nocontribops
* reset win-ci-pipeline.yml
* remove confidential models
* 3 stale tests were removed from ONNX. Remove them from the list of broken tests that we ignored.
ab52a5d31e
* Also remove some exclude tests that now psas
* Use local ort python package in server model tests
* Create symlink for onnxruntime during test
* Using generated _pb2.py in the build folder
* Generate onnx_ml_pb2.py from ONNX CMakeList.txt
* Update model tests for python package path
* Only use onnx python package from build
* Revert some changes for pb2.py generation
* Refactor C# to handle x86
* update run script
* Add Native win x86 tests
* Add native x86 tests for Linux
* Update linux tests scripts to control which tests are run
* update linux image name for x86 to prevent using cached image
* update to not run unit python unit tests unless pybind is specified
* remove --build_wheel as a core common arg. Python cannot run on x86 build
* update OrtGetNumOfDimensions to OrtGetDimensionsCount in rest of C#
* Change function signature
* Convert compute to use custom op style APIs
* Remove dead CustomOp function
* Use CustomOp API in TensorRT EP
* Switch to new API in ngraph
* fix ngraph due to incompatibility with PR1019
* temp disable onnx backend test test_operator_repeat_dim_overflow_cpu due to failure in ngraph ep
* disable tiny_yolov2 for ngraph
* temporarily disable operator_repeat_dim_overflow for ngraph due to graph resolve failure
* initial version. there's a bug
* allow nodes with multiple outputs to be removed if only one output is used by downstream ops
* move node output checks into their own methods
* add test data file
* address comments by @kkaranasos and @skottmckay
* address more comments by @kkaranasos
* a comment to clarify the position of the mask output