onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-04 04:07:22 +00:00

Author	SHA1	Message	Date
Edward Chen	dda9f53bed	Build script logging updates (#8618 ) Log build.py command line arguments. Update subprocess logging to format arguments in way that is easier to copy.	2021-08-05 09:41:17 -07:00
Edward Chen	e09321f4db	Update ORT format model conversion utility to optionally fail fast on model conversion failure. (#8589 )	2021-08-03 11:12:56 -07:00
Rachel Guo	0cf2ed029b	Add python binding for CoreML EP (#8472 ) * add pybind binding for coreml ep * update merged files * address comments * format * remove lines for non-macOS platform Co-authored-by: rachguo <rachguo@rachguos-Mini.attlocal.net>	2021-07-29 10:06:47 -07:00
Vincent Wang	c8d210de29	Decouple Forward and Backward of ATenOp (#8301 ) * atenop for inference * assert if dtype mismatch * atenop config in frontend * fix orttrainer test * gradient def not only for ATenOp * bugfix * fix gradient input shape and type issue * fix after merge master	2021-07-23 16:53:26 +08:00
Edward Chen	c254c3c355	Fix issue with ONNX to ORT format model conversion script when given single model file as input. (#8323 )	2021-07-07 14:08:47 -07:00
Vincent Wang	f0f3012666	Add SoftmaxCrossEntropyLossInternal to Support Dynamic ignore_index Input (#7899 ) * add SoftmaxCrossEntropyLossInternal * bugfix and ut * fix ut * fix ut * support torch1.8.1 * function body for nll_loss_internal	2021-06-09 10:29:46 +08:00
Bowen Bao	a776b57160	Add shape inference to custom symbolic functions (#7937 ) Description: As title. Motivation and Context - PyTorch ONNX exporter heavily depends on ONNX shape inference to export accurate and efficient model. Custom symbolic function exports the op as contrib ops, thus exporter is unable to perform standard onnx shape inference. Models with dynamic shape inputs are affected.	2021-06-08 10:43:06 -07:00
Vincent Wang	71c4f5ddb2	ATenOp Enhancement (#7725 ) * config parser, default argument values * ut * win build * maxpool2d * fix win build * fix build * unfold atenop	2021-06-08 11:01:17 +08:00
Scott McKay	0fbec1b9c1	Update the operator documentation generation (#7787 ) * Update the operator documentation generation - Make layout a little nicer - Update to latest supported operators including training - Fix some links that are broken when the docs content is copied to github-pages - Fix incorrect usage of 'onnx.ai.ml' as the default domain - ML ops are now separated from the real default domain of 'onnx.ai' - Include CPU, CUDA and training kernels - exclude DNNL as it's not an EP we own * There are separate paths for CUDA and CUDNN as they are not guaranteed to be in the same location on a Windows machine. Use the CUDNN path when looking for the CUDNN library. * Enable validation of both contrib ops and operator kernels in build Filter generation so it's deterministic Add ability for CI to publish the md files as build artifacts if they differ so a developer can download and add to their PR to resolve any diffs. Remove workarounds for github-pages as that will now link to the github docs which display correctly	2021-06-02 17:47:40 +10:00
Scott McKay	57782b3463	Add supported operators/types documentation for the ORT Mobile package (#7807 ) * Add ability to generate documentation for the ORT Mobile package using the build configuration as input.	2021-05-26 15:57:40 +10:00
Yulong Wang	077e8c6b40	allow update_version.py to update new npm packages (#7746 ) * update versions for npm packages * remove package-lock.json in list	2021-05-18 16:15:19 -07:00
Vincent Wang	dac24f7d63	Add ATenOp and call aten::embedding and its Backward Op from ORT (#7590 ) * build with libtorch and impl torchembedding * fix op shape infer * local commit * atenfunctionop * call aten operator from online extension * rollback build.py * resolve comments * bugfix * fix build * fix ortmodule test * remove external outputs, resolve comments * resolve comments * export embedding to microsoft::atenop * bugfix	2021-05-13 09:24:27 +08:00
Scott McKay	830d9e54dd	Add script to dump initializer, NodeArg, Node and subgraph info from an ORT format model (#7516 )	2021-05-04 08:34:35 +10:00
Scott McKay	d6df5764d7	Android package infrastructure (#7430 ) * Include ORT format model conversion scripts and infrastructure in ORT python package. - tweak existing script setup so it can be easily run directly and from the ORT python package Add config file and readme for Android minimal build package Update ORT Mobile doco Disable warning if 'all' optimizations are enabled but NCHWc transformer is excluded (device specific optimizations don't apply in this scenario so the warning is moot). * Address PR comments	2021-04-30 14:23:54 +10:00
Yulong Wang	009f342caf	[JS] refactor Javascript/Typescript libraries in ONNX Runtime (#7308 ) * working on re-organizing js code for ortweb * remove dup files * move folder * fix common references * fix common es5 * add webpack to common * split interfact/impl * use cjs for node * add npmignore for common * update sourcemap config for common * update node * adjust folder/path in CI and build * update folder * nit: readme * add bundle for dev * correct nodejs paths * enable ORT_API_MANUAL_INIT * set name for umd library * correct name for commonjs export * add priority into registerBackend() * fix npm ci pwd * update eslintrc * revise code * revert package-lock lockfileVersion 2->1 * update prebuild * resolve comments * update document * revise eslint config * update eslint for typescript rules * revert changes by mistake in backend.ts * add env * resolve comments	2021-04-16 01:33:10 -07:00
Chun-Wei Chen	3ee9b0ec4d	Add detailed assertion error message (#7232 )	2021-04-05 10:05:40 -07:00
Scott McKay	329fd03bb4	Add int32_t as required type to some operators (#7192 ) * Updates to some operators to always support int32 and int64 based on testing of Android package build config with a minimal build. If an operator can be used for shape manipulation (int64) it is frequently used for indices manipulation (int32), so we enable both types for that set of ops. - e.g. BERT models take indices as input - Scatter/Gather ops utilize indices Misc. fix to python bindings to exclude call that fails in a minimal build.	2021-04-01 19:32:34 +10:00
Edward Chen	0ccfe6c86a	Enable type reduction for Scatter/ScatterElements CPU kernels (#7171 ) Enable type reduction for Scatter/ScatterElements CPU kernels. Some refactoring to reduce binary size. Add MLTypeCallDispatcher methods. Minor cleanup for Pad CPU kernel.	2021-03-30 11:02:24 -07:00
Scott McKay	9297527b7a	Enable NHWC transformer when generating ORT format model (#7126 ) * Allow specific optimizers to be disabled. - replace unused ability to specify just the optimizers to run - never used so not needed Allow the disabled list to be specified via the python bindings - expected usage is internal, so using kwargs for that so as not to pollute the documentation with stuff no user is likely to need Update the ORT format model conversion script to disable NCHWc transformer when level is 'all' - currently there aren't any known use cases where we'd want the NCHWc transformations to run as they create a device specific model and aren't used on ARM - the ORT format model is not expected to be generated on the target device (e.g. generate on Windows/Linux/macOS to deploy to Android/iOS so there's a good chance we'd generate a useless/invalid model - default to 'all' as ARM and MLAS prefer NHWC and the NHWC transformer runs at that level * Add matching changes to optimizer generation in training code	2021-03-29 18:39:48 +10:00
Dmitri Smirnov	2bf54bcaa2	Fix bugs in sparsify script (#7134 ) Fix type and check.	2021-03-25 14:53:52 -07:00
Edward Chen	53392664d3	Enable type reduction for Shrink, Sign, SplitToSequence CPU kernels (#7090 ) Enable type reduction for Shrink, Sign, SplitToSequence CPU kernels. Some other type reduction changes including refactoring to specify element types in a single place.	2021-03-23 09:57:33 -07:00
Dmitri Smirnov	3b58fc7b97	Add types support for Sparse Initializer in Onnxruntime (#7004 ) Add types support for DenseToSparse and SparseToDense conversions Address the case of empty sparse values and indicies when the initializer does not contain any NNZ. Add sparsify script.	2021-03-22 10:06:11 -07:00
Edward Chen	4cbb8e166a	Update kernel def hashing (#7019 ) Update the kernel def hashing in ORT format models. The new hashing logic ignores the ordering of type constraint types. This is a backward compatibility breaking change, but we don't guarantee backward compatibility yet.	2021-03-22 09:28:27 -07:00
Edward Chen	aa60a8368f	Update type reduction operator type usage processors set. (#6976 )	2021-03-11 09:22:53 -08:00
Edward Chen	b6c4a7ac54	Support required types when excluding typed registrations (#6871 )	2021-03-08 08:22:07 -08:00
jingyanwangms	f22f04a109	Add comment (#6860 ) Co-authored-by: Jingyan Wang <jingywa@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2021-03-02 18:54:25 -08:00
Edward Chen	ee35be0129	Support specifying globally allowed types from build script (#6677 ) Add initial support for constraining operator kernel implementations (which support this type-granularity) to a set of allowed types from scripts.	2021-02-22 14:05:00 -08:00
Scott McKay	02c7873b0e	Update ORT model conversion script to support custom ops (#6701 ) * Add support for custom ops library to the ORT model conversion script Simplify model conversion now that we read ops from the ORT format model. Enable custom ops in the python bindings if custom ops are turned on in a minimal build. * Add test of model conversion involving custom ops.	2021-02-17 12:52:39 +10:00
Chun-Wei Chen	115e16b37b	ort_test_utils: skip creating input if it is an initializer (#6544 )	2021-02-05 17:34:08 -08:00
Scott McKay	c5d2538314	Add more kernels that have typed registrations to the operators we track type usage for. (#6565 )	2021-02-05 15:10:54 +10:00
Scott McKay	c49d1dbc4b	Add type reduction support to Slice and Transpose (#6547 ) * Add type reduction support to Slice and Transpose	2021-02-05 11:08:23 +10:00
Scott McKay	6cb8f8c812	Support disabling a typed kernel registration that uses the output type (#6530 ) * Update infrastructure to support disabling a typed kernel registration that uses output 0 for the type (vs. the normal use case of input 0).	2021-02-03 14:22:32 +10:00
Scott McKay	8d53ef69e5	Add type reduction support to Min, Max and Pow (#6519 ) * Add type reduction support to Min, Max and Pow Update the C++ type reduction infrastructure to allow specifying an opset for the supported types list, as those can change across opset versions. Minor updates to the type usage tracking script * Add 'all opsets' macros and constant	2021-02-03 06:51:35 +10:00
Scott McKay	c84bb9df9f	Add ability to track per operator types in reduced build config. (#6428 ) * Add ability to generate configuration that includes required types for individual operators, to allow build size reduction based on that. - Add python bindings for ORT format models - Add script to update bindings and help info - Add parsing of ORT format models - Add ability to enable type reduction to config generation - Update build.py to only allow operator/type reduction via config - simpler to require config to be generated first - can't mix a type aware (ORT format model only) and non-type aware config as that may result in insufficient types being enabled - Add script to create reduced build config - Update CIs	2021-01-29 07:59:51 +10:00
Edward Chen	042053c55e	Add support for running Android emulator from build.py on Windows. (#6317 )	2021-01-13 19:21:49 -08:00
Scott McKay	30c7fffbab	Expand the documentation on using compiling EPs with a minimal build (#5893 ) * Expand the documentation on using compiling EPs with a minimal build to call out a 'simple' option that is easier to use. Provide more background on what happens to help users choose the best option for them. Tweak conversion script to be noisier about attempted usage of 'all' optimization level. Co-authored-by: manashgoswami <magoswam@microsoft.com>	2020-12-02 09:12:36 +10:00
Scott McKay	f0142da59c	Add NNAPI to providers that can be used via the python bindings. (#5867 ) Update ORT model conversion script - add args for specifying optimization level and whether to use NNAPI - add logic to create a list of required ops and ORT format model that can be used with NNAPI	2020-11-21 09:18:35 +10:00
Edward Chen	bef06dac93	Automatically clean up build docker image cache. (#5843 ) Follow up to #5811 to automate cleanup of the build docker image cache. Added a script and build definition to clean up docker images that haven't been accessed recently.	2020-11-20 11:56:26 -08:00
Edward Chen	71e7c2b423	Cache build docker images in container registry. (#5811 ) This PR adds infrastructure to automatically cache docker images used in CI builds in a container registry. Currently, build images are pulled from a container registry for some builds and built every time for others. The container registry requires maintenance to keep the images up to date and building images every time wastes build agent resources. With this change, a given build image can be looked up in a cache container registry and if present, pulled, and otherwise, built and pushed. The uniqueness of a build image is determined by a hash digest of the dockerfile, docker build context directory, and certain "docker build" options. This digest is part of the image tag in the cache container repository. The cache container registry will need to be cleaned up periodically. This is not automated yet.	2020-11-17 17:02:24 -08:00
Chun-Wei Chen	5bd7241839	Raise output mismatch error in ort_test_dir_utils.py (#5364 )	2020-10-02 16:44:59 -07:00
Ashwini Khade	3f00b8db8f	move all experimental ops to version 1 of ms domain (#5287 ) * move all experimental ops to version 1 of ms domain * deprecate TransposeMatMul in favor of FusedMatMul * update documentation	2020-09-30 14:50:18 -07:00
Scott McKay	1ff3b2d5b8	Add ability to generate multiple test dirs so that different input mixes can be tested. (#5310 )	2020-09-29 12:55:15 +10:00
Changming Sun	17f1178c2e	Downgrade GCC (#5269 ) Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>	2020-09-24 21:14:54 -07:00
Guoyu Wang	78a29aebbc	[ORT Mobile] ORT Minimal E2E CI (#5200 ) * Modify the ort minimal CI to ort minimal e2e ci	2020-09-19 18:43:22 +10:00
Scott McKay	c46a480306	Update conversion script and process to simplify creating ORT format models and a minimal build (#5217 ) * Update conversion script and process to simplify creating ORT format models and a minimal build.	2020-09-18 18:49:54 +10:00
Scott McKay	796ddeb2cb	Remove serialization of outer scope value info in ORT format model (#5077 ) * Remove serialization of outer scope node arg info in ORT format model. We don't currently need it in a minimal build as only SessionState calls Graph::IsConstantInitializer and it doesn't search outer scope. If we do need it in the future the information can be calculated at runtime (small binary size cost to do so). Motivation: ORT format model was 32% bigger for a BERT model with multiple levels of subgraph and a lot of nodes due to this. Size is about 5% larger of the original ONNX model with the change. ORT format has type/shape info for all nodes, and this model has 2000 nodes so this seems reasonable. Added example code to dump ORT format model to json. Fixed misc bug in python test script around handling float and non-float expected output.	2020-09-08 17:43:42 +10:00
Scott McKay	b5c2932ae8	Last major set of ORT format model changes (#5056 ) * Add minimal build option to build.py Group some of the build settings so binary size reduction options are all together Make some cmake variable naming more consistent Replace usage of std::hash with murmurhash3 for kernel. std::hash is implementation dependent so can't be used. Add initial doco and ONNX to ORT model conversion script Misc cleanups of minimal build breaks.	2020-09-05 07:59:01 +10:00
Bowen Bao	73456f10cd	Fix contrib ops unregister to match pytorch behavior (#5052 )	2020-09-03 16:32:42 -07:00
Bowen Bao	22ba266bd6	Add flag to _internal_use to control export of contrib ops in ort trainer (#4968 )	2020-09-03 09:11:47 -07:00
Scott McKay	28445c88f9	Changes to enable saving and loading an ORT format model (#4995 ) * Changes to enable saving and loading an ORT format model via the public APIs. Cleanup session.py to try and make slightly more understandable. More refactoring is needed here. Couple of bug fixes * Fix bug in handling NodeArg serialization for optional inputs which has a name and no type info. * Address PR comments - tweak SessionOptions config to avoid double lookup - merge duplicated functionality in python binding around registering an EP with optional options Fix a couple of build issues. * Update C API to be consistent with python API - only load model in InferenceSession ctor if required - support loading ORT model in minimal build * Fix nodejs test. We get an invalid path error from LoadInterOp first now * Another attempt at fixing nodejs test. Error message depends on whether ENABLE_LANGUAGE_INTEROP_OPS is defined. Make the output consistent. The interop implementation looks suspicious given it appears to be internal code that is going via the public api. TBD if that should be fixed. * Fix couple of build issues. * Disable test temporarily so PR can be checked in. Will fix in separate PR that adds final pieces for minimal build as the test is required there. * Give up on nodejs test and make the match simpler. Fix init call in TrainingSession python to not pass through sess. it wasn't being used in Session anyway so passing it through just adds confusion. * Fix call to Session.__init__ in TrainingSession. Session now initializes Session._sess to None to make it clearer where the 'ownership' of that member is, and that needs to happen before TrainingSession sets it.	2020-09-03 09:10:48 -07:00

1 2

72 commits