* add option numpy_version to build against the installed numpy version and not 1.15.0 (hardcoded version number), default is still 1.15.0
* add option skip_keras_test to skip keras test even if keras is installed (still enabled by default)
disable unnecessary warnings about ubuntu
* enable option PRIVATE for the compilation of the Python bindings (settings recommended on pybind11 documentation)
* test on debian 9
* Prototype version that demonstrates it can work
* Switched to OrtValue and removed the OrtCustomOpTensor code.
* Support multiple outputs and reading of attributes
* Add custom domain handling to custom ops
* Update documentation
* more wording changes
* added linux packaging template and pipeline
* Update linux-packaging-pipeline.yml for Azure Pipelines
* fix path seperator
* update copy command for linux
* fixed linux gpu artifact name, added mac build
* fixed linux gpu artifact name, added mac build
* fixed vmImage syntax
* use 1 model at a time for macos
* added onnx test on Mac CI
* some refactor of the pipeline scripts
* try fixing the tensorproto for x86 build
* try __cdecl
* try C-style cast
* use ORTAPICALL
* put the deleter under the namespace
1. Support the new external data extension in ONNX 1.4 onnx/onnx#678
2. Enable onnxruntime_perf_test in Mac Build
3. move path_lib.h from onnx_test_runner source dir to onnxruntime_framework
4. Enable memory planner for string tensors
5. Make memory planner always enabled, to simplify model loading logic
6. Delete some duplicated code between onnxruntime_perf_test and onnx_test_runner
7. Delete win_getopt_mb lib.
8. Remove the dependency on Pathcch lib, which is only available on Windows 8 and newer.
* added packaging pipeline
* Update win-ci-pipeline.yml for Azure Pipelines
* Update win-ci-pipeline.yml for Azure Pipelines
* Update win-ci-pipeline.yml for Azure Pipelines
* Update win-ci-pipeline.yml for Azure Pipelines
* Update win-ci-pipeline.yml for Azure Pipelines
* Update win-ci-pipeline.yml for Azure Pipelines
* Update win-ci-pipeline.yml for Azure Pipelines
* Update win-ci-pipeline.yml for Azure Pipelines
* put the c-api header file at root instead of under core/session
* Update win-ci-pipeline.yml for Azure Pipelines
* Update win-ci-pipeline.yml for Azure Pipelines
* Update win-ci-pipeline.yml for Azure Pipelines
* parameterize the windows build script
* Update win-package-pipeline.yml for Azure Pipelines
* fixed indenting
* fixed indenting
* fix parameter reference syntax
* try using arch = amd64 for the vcvarsall
* remove duplicate tasks
* use vcvarsall
* some more refactor
* fix typo
* fix typo
* factored out the packaging step into a template
* add x86 build to package pipeline
* use amd64 for vcvars arg
* added gpu pipeline. added msbuild platform param
* fix the msbuild platform
* use amd64 host for x86 build
* use buildarch=x86 for vcvarsall
* remove vcvars from setup steps
* add some logging for PNG lib, and disable fns_candy demo for win32
* set allocator alignment to 32 bit for win32 compiler
* disable parallel execution test for x86
* use 64 bit toolchain for x86 build
* add missing -T flag for toolset
* fix string delimietr in workingdirectory name for package build test step
* fix gpu pipeline
* make io_types test conditional
* use cuda 10 instead of cuda 9.1, similar to the ci build
* try some workaround on the io test
* undo inadvertent local change in build.py, also reenable the io test
* make all test run single threaded
* blacklist few failing tests for x86
* added some log in build.py
* edit build.py to disable parallel test
* add the failed tests into the blacklist for win32
* add tf_pasnet_large to blacklist
* change control flow for build.py onnx tests
* add README, license and TPN to the package
* updated build.py test sequence for parallel executor
* updated onnx test flow as per review comment
* add type checking log in the compare_mlvalue
* fix type cast
* blacklist some failed test as of now
* one more blacklisted test
* minor changes for ValidateInput in inferencesession level api.
* update
* valid real graph input and don't validate initializers
* refine the validation logic.
* remove the unnecessary validatation test.
* ensure that the exact input feeds provided from caller.
* fix tests.
* fix c# test failure.
* fix test case
* don't verify the error message which is hard to maintain.
* fix c# test case
* c# test
* c# test
* fix test cases.
* test update
* update packaging numpy version to 1.15.0
* update version in numpy version in linux
* Install numpy 1.15.0
* Finish up numpy requirement after test
* Try fix
* Fix ci script
* Initial commit
* Adding shrink tests
* Fix formatting in shrink_test.cc
* Fix broken build
* More changes
* PR feedback and formatting
* Place files in the right location corresponding to def file location in onnx
* Exclude shrink model test in test_series.py
* Remove shrink from exclusion list in main.cc
* Adding test to exclusion list
* More tests
* Formatting
* PR feedback
* PR feedback
* More changes
* PR feedback
* More changes
* Fix broken build
* Fix nit
* Fix nit
* Break dependency on SessionState for ExecutionFrame and OpKernelContext so optimizers can execute a node with a minimal setup.
- Create IExecutionFrame
- split out core logic and interface from extended logic used in full Graph execution (that uses allocation plan and memory pattern planner)
- Update NodeIndexInfo to allow contruction from a subset of nodes
- split out logic from GraphNodes into a re-usable template so it can be used with a vector of const Node* as well as a vector of unique_ptr<Node>
- Remove SessionState from OpKernelContext
- Misc cleanups
- move AllocPlanPerValue out of SequentialExecutionPlan as it's used in a more generic manner that isn't specific to a sequential execution plan
NOTE: I manually tested the new paths, especially NodeIndexInfo. There will shortly be optimizers added that use the new infrastucture so they'll get test coverage as part of those changes.
* Fix linux build issue.
Handle graph with no nodes in NodeIndexInfo.
* Add ability to enable caching to the C API, and update the internals to pass the feed names and MLValue instances in vectors so the order is deterministic (so cache entry matching works as expected).
* Address PR comment and don't use 'bool'
* Remove meaningless C# test around duplicate input.
We _could_ check input names for duplicates (previously we did this via the usage of unordered_map), but the system will gracefully handle with the duplicate anyway (will just use the last value provided for the input name).
Based on that, I don't think the cost of checking for duplicates is worth it.
* Fix c-style cast in test_run_options.
* Fix murmurhash when multiple strings are in the input. Also add an attribute to specify output type like sckitlearn. Add type inference to decide output type based on this attribute.
* Address PR comment
* cleaned up the additional header in C-api
* ensure test failure surfaces in the build pipeline
* sanitized runtest.bat
* cleanup unneeded headers
* formatting and typos
* support non-tensor types
* support non-tensor types.
* support non-tensor types.
* fix compilation issues
* fix compilation issues
* Build without mkldnn for release packages. We'll default to MLAS.
* Remove tvm as well
* Add openmp
* Add check for linux version supporting glibc 2.23 or higher
* Refactor the libc check to SessionOptions
* removed whitespace
* Update SessionOptions.cs
* Various optimizations to reduce the setup and execution cost.
Cache information about the feeds and fetches, and any device copies required to execute the graph so we minimize checking for later calls to ExecuteGraph using the same input/output.
- enable use of caching in Loop and Scan
- make use of caching optional for InferenceSession::Run
- handle calls to Run with different feeds and fetches to support scenarios where there may be a truncated sequence in some calls
Take the feed names and MLValue instances as vectors so the order is deterministic.
Add unit tests
Update onnxruntime_perf_test to enable caching.
* Couple of tweaks.
Fix shared library unit test failure.
Attempt to workaround MacOS build failure due to VC++ bug around including reaching scope values in a lambda automatically.
* Rework order of init in Run so we get nice error messages about invalid feed/output names.
* Refine logic around copying MLValue using execution provider so common code can be used. Simplify the logic due to this change.
Split the paths for executing with/without cached info so we can be more const correct with how FeedsFetchesManager is passed in. This makes it clearer when a shared instance can be used due to it being const.
Cache the FeedsFetchesManager instances in the control flow nodes. They can be re-used across calls to Compute.
* Removed unused local variable to fix some builds.
* Fix build issue by cleaning up some more unused params.
* Check names when using cache entry from SessionState. Add unit test.
* Fix a issue for Reduce Max, Min. Per cudnn document, only Max/Min ops requires the indices output, it will report error if requesting indices for the other reduction ops.
* support non-tensor types
* support non-tensor types.
* support non-tensor types.
* fix compilation issues
* fix compilation issues
* fix compilation issues
* add test cases
* test cases
* add test cases
* try to fix string test case
* working now
* use allocator (broken)
* string test broken after using allocator
* full working example
* Fix PR comments