onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-16 21:00:14 +00:00

Author	SHA1	Message	Date
Vincent Wang	91d42e9d85	Tool to Convert ONNX Model to TFEvents (#14160 ) A tool to convert ONNX model to tfevents so that we can use tensorboard to open it for visualization. This is especially useful for debugging when the ONNX model is too large to open by Netron. usage: onnx2tfevents.py [-h] [--logdir LOGDIR] [--model MODEL]	2023-01-28 15:09:15 +08:00
Edward Chen	3b382ea7e1	Free OrtStatus in ASSERT_ORT_STATUS_OK, make run_android_emulator.py work with newer JDK version (#14369 ) - Free OrtStatus in ASSERT_ORT_STATUS_OK in model_tests.cc - Make run_android_emulator.py work with newer JDK version	2023-01-20 09:27:47 -08:00
liqun Fu	1be36913cc	to work with onnx 1.13 rc, implement ver 18 reduce and optioanl ops, … (#13765 )	2023-01-09 10:26:16 -08:00
Edward Chen	2fa18ea77e	[React Native CI] Record more info to debug E2E test (#13329 ) Record more info from the React Native CI E2E test. In particular, log the view hierarchy when exiting the test and dump logs from Android emulator to the build output.	2022-10-18 17:21:28 -07:00
Scott McKay	4d8510611b	Update find_optimizer_opset_version_updates_required.py to use the ONNX headers to determine the latest opset. (#12484 ) Description: Use the onnx headers to find the latest opset for each operator. This allows the script to detect optimizers with `graph_utils::IsSupportedOptypeVersionAndDomain` calls that need updating when run during the update of the onnx commit id. Without this change issues are not detected until a new kernel is registered. Motivation and Context Detect optimizers that need updates as part of the ONNX update process.	2022-09-29 16:55:22 +10:00
RandySheriffH	a83a9ed6b0	Remove miscellaneous nuphar configs (#13070 ) Remove a handful of nuphar related configurations after deprecation. Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2022-09-26 13:41:28 -07:00
Edward Chen	454f77cd94	Update kernel matching logic: decouple from op schemas and remove kernel def hashes (#12791 ) # Motivation Currently, ORT minimal builds use kernel def hashes to map from nodes to kernels to execute when loading the model. As the kernel def hashes must be known ahead of time, this works for statically registered kernels. This works well for the CPU EP. For this approach to work, the kernel def hashes must also be known at ORT format model conversion time, which means the EP with statically registered kernels must also be enabled then. This is not an issue for the always-available CPU EP. However, we do not want to require that any EP which statically registers kernels is always available too. Consequently, we explore another approach to match nodes to kernels that does not rely on kernel def hashes. An added benefit of this is the possibility of moving away from kernel def hashes completely, which would eliminate the maintenance burden of keeping the hashes stable. # Approach In a full build, ORT uses some information from the ONNX op schema to match a node to a kernel. We want to avoid including the ONNX op schema in a minimal build to reduce binary size. Essentially, we take the necessary information from the ONNX op schema and make it available in a minimal build. We decouple the ONNX op schema from the kernel matching logic. The kernel matching logic instead relies on per-op information which can either be obtained from the ONNX op schema or another source. This per-op information must be available in a minimal build when there are no ONNX op schemas. We put it in the ORT format model. Existing uses of kernel def hashes to look up kernels are replaced with the updated kernel matching logic. We no longer store kernel def hashes in the ORT format model’s session state and runtime optimization representations. We no longer keep the logic to generate and ensure stability of kernel def hashes.	2022-09-20 14:24:59 -07:00
Edward Chen	31a1403e06	Add --output_dir option to convert_onnx_models_to_ort.py. (#12844 ) Add --output_dir option to convert_onnx_models_to_ort.py. Allows one to optionally specify an output directory for the converted model files.	2022-09-12 15:36:03 -07:00
Scott McKay	706e03c63d	Add azp run helper (#12832 ) * Add helper to add azp run comments to a PR.	2022-09-07 11:48:31 +10:00
Yulong Wang	c144acc534	Replace 'master' branch ref to 'main' in the code (#12547 )	2022-08-22 10:48:12 -07:00
Scott McKay	a3de1bbf7d	Update script to find optimizers that potentially need supported opset updates (#12330 ) * Update to handle multiline declarations for the kernels which are typical these days. * Update to new path for the cpu contrib_op kernel registrations. * Update tools/python/find_optimizer_opset_version_updates_required.py Co-authored-by: Justin Chu <justinchuby@users.noreply.github.com>	2022-08-04 07:37:27 +10:00
Justin Chu	d64769c38e	Set black's target version (#11370 ) Description: Set black's target version to be py37 - py310 Motivation and Context Black by default targets its format for py3.10. Since our project supports python 3.7, we need to target version to all the python versions supported. Re-ran black. 13 files reformatted.	2022-04-27 14:52:19 -07:00
Justin Chu	fdce4fa6af	Format all python files under onnxruntime with black and isort (#11324 ) Description: Format all python files under onnxruntime with black and isort. After checking in, we can use .git-blame-ignore-revs to ignore the formatting PR in git blame. #11315, #11316	2022-04-26 09:35:16 -07:00
Yulong Wang	5ee8e2e491	[js] use NPM and yarn to upgrade package version (#11059 )	2022-04-19 12:28:13 -07:00
Edward Chen	269be2fe63	Remove unnecessary option from convert_onnx_models_to_ort.py, fix old instructions. (#11088 ) Remove unnecessary --nnapi_partitioning_stop_ops option from convert_onnx_models_to_ort.py, fix old instructions.	2022-04-11 11:19:21 -07:00
Edward Chen	9371401746	Move node EP assignment for ORT format into SessionState::FinalizeSessionState() (#10944 ) Follow up to #10904. - Move node EP assignment for ORT format into SessionState::FinalizeSessionState(). - Add unit test for #10904. - Make convert_onnx_models_to_ort.py optimization level configurable via environment variable.	2022-03-28 10:37:22 -07:00
Scott McKay	91722e2bc4	Fix typos (#10935 )	2022-03-20 08:27:35 +10:00
Scott McKay	5cbacec854	Maintain aspect ratio by doing resize + crop in image_to_pb tool (#10887 )	2022-03-19 07:08:45 +10:00
Scott McKay	f385c73058	Fix a couple of issues with the python package tools (#10858 ) * Tweaks to the model utils * Add handling for a dim_value of -1 when replacing the entire input shape. This occurs in models exported from PaddlePaddle * make pytorch helpers accessible in package * make QDQ helpers accessible in package	2022-03-15 15:52:12 +10:00
Edward Chen	e53422c6d0	Update convert_onnx_models_to_ort.py to support runtime optimizations. (#10765 ) Add runtime optimization support to ONNX -> ORT format conversion script. Replace `--optimization_level`, `--use_nnapi`, and `--use_coreml` with a new `--optimization_style` option.	2022-03-14 16:50:41 -07:00
Scott McKay	6072c6b65e	Simplify QLinearConv registration so type reduction works with it. (#10747 ) * Simplify QLinearConv registration so type reduction works with it. * Update QLinearMatMul registration to be a standard typed registration	2022-03-04 14:06:04 +10:00
Rachel Guo	a9dc50ba8b	Add option to force QDQIsInt8Allowed to return true when exporting to ORT format (#10719 ) * wip * save * minor update * fix * fix * Revert "fix" This reverts commit `a76f364b2d`. * revert * revert * revert submodule removal * address pr comments * minor fix * address cr comments * fix format Co-authored-by: rachguo <rachguo@rachguos-Mini.attlocal.net>	2022-03-02 23:26:14 -08:00
Scott McKay	4d3cd2f685	Add helper for optimizing a QDQ format model for usage with ORT. (#10595 ) * Add initial helper for optimizing a QDQ format model for usage with ORT. If a DQ node has multiple consumers it will end up in multiple QDQ node units. This is complicated to handle as each qdq unit could end up being handled by different execution providers. By duplicating the DQ node we simplify this logic. Generally the duplicate nodes will disappear when the qdq node unit is converted to a single node with a quantized operator. If there are qdq node units that are not able to be converted to use a quantized operator the ORT cleanup (pending) to drop remaining Q->DQ pairs between fp32 nodes can remove any remaining DQ nodes. * Fix pep8 warning Co-authored-by: Guoyu Wang <wanggy@outlook.com>	2022-02-21 09:26:19 +10:00
Scott McKay	2ca9566994	Add range of helpers for making usage of ORT Mobile easier. (#10458 ) * Add range of helpers for making usage of ORT Mobile easier.	2022-02-18 07:35:25 +10:00
Scott McKay	6545e24b60	Update mobile prebuilt package ops to add support for opset 14 and 15 (#9717 ) * Update required operators for prebuilt package to add opsets 14 and 15. Add helper script to check if the prebuilt package will support the model and if not why not. * Add support for multiple opsets being specified on a single line in the required operators config. This makes it easier to update the pre-built package config. It's also required for validation tools to work as they only have a single opset from the model and not per-operator opsets. If we only list the incremental ops we could merge in the ops from the previous opset, but that wouldn't give a way to drop an operator from being supported. Left the info on which ops changed though so we have a better feel for the cost of supporting each opset.	2021-11-18 10:44:39 +10:00
Guoyu Wang	5ad6dbb314	Remove experimental from ORT format namespace (#9729 ) * schema change * cc channges * remove temp debug code * Adding fbs namespace to session_state_flatbuffers_utils.h * Add fbs namepsace to all ort format utils	2021-11-11 19:46:30 -08:00
Viswanath Boga	9d84811fb6	fixing pypi pipeline for release (#9716 ) * fixing pypi pipeline for release * updated the script and correct python version * updated the version correctly with script changes * Remove 1.9.1	2021-11-10 17:33:51 -08:00
Edward Chen	011cb8fd48	Fix Where op type reduction processing (#9033 ) * Update type reduction script to track Where Op's second input type. * Clean up op_kernel_type_control.h includes. * Use more maintainable include.	2021-09-13 08:37:58 -07:00
Gary Miguel	47435311f4	Include pytorch_export_contrib_ops in inference builds (#8878 ) * Include pytorch_export_contrib_ops in inference builds Rename / move it from tools/python/register_custom_ops_pytorch_exporter to onnxruntime/python/tools/pytorch_export_contrib_ops. Rationale for inclusion in inference builds: This code is potentially useful for anyone using ORT, not just training. Rationale for new name: "Contrib op" is the nomenclature used within ORT to refer to the set of ops that are not in the standard op set but are included by default with ORT. This is more specific than "custom op", which is what the PyTorch exporter uses to refer to any non-standard op. Step 1 of addressing #8818. After this is merged I will update the docs. * Enable test_pytorch_export_contrib_ops.py in CI Fixes AB#1342330	2021-09-02 14:26:58 -07:00
Scott McKay	858989293d	Reduce binary size of strided copy used by Concat (#8913 ) * Change the strided copy to switch on data size not data type. Move to header so we can reduce on the enabled types. Setup type reduction for Concat now that it's using this implementation.	2021-09-02 08:19:20 +10:00
Dmitri Smirnov	f3083f4bf3	Support of sparse initializers with smaller indices data type (#8834 ) Support of sparse initializers with smaller indices data type to save space. Make the script more efficient by selecting indices data type and checking resulting sparse bytes Exclude new code from SPARSE_TENSORS	2021-08-27 14:02:48 -07:00
XiyinOSS	19b82b438b	GridSample OP implementation for CPU and CUDA (#8551 ) * GridSample OP implementation for CPU and CUDA Description: This change contains implementation for torch grid_sample OP. Cuda implementation contains contribution from Muscle Wu. * Use interpolation for out-of-bound points in zero padding mode Out-of-bound points in zeros padding mode changed from constant 0 to interpolation of surrounding pixels. This aligns with Pytorch implementation. A bug in CUDA batch offset calculation is fixed. Custom op exporter type is added. * Fix nearest bug in CPU * Update per CI build finding and review comments * Force float to avoid potential integer T issue * Style update * PR update * Remove c++17 feature from cuda code	2021-08-20 12:37:38 -07:00
Edward Chen	94c3e2048b	[convert_onnx_models_to_ort.py] Add option to specify NNAPI EP partitioning stop ops. (#8668 ) Add option to specify NNAPI EP partitioning stop ops from the ORT format model conversion script.	2021-08-19 13:02:28 -07:00
Rachel Guo	78759059f1	[CoreML EP]Make coreml ep build on non-macOS platform (#8677 ) * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * clean * remove unused defs * correct typo * remove onnxruntime_coreml_proto * cr comments * enablie nnapi/coreml in minimal build * enable nnapi/coreml in one build * refine dependencies * fix nnapi build failure and remove onnxruntime_coreml_proto dependencies in unit tests cmake files * small fix * fix * fix build * revert * fix build Co-authored-by: rachguo <rachguo@rachguos-Mini.attlocal.net>	2021-08-18 09:35:32 -07:00
Edward Chen	dda9f53bed	Build script logging updates (#8618 ) Log build.py command line arguments. Update subprocess logging to format arguments in way that is easier to copy.	2021-08-05 09:41:17 -07:00
Edward Chen	e09321f4db	Update ORT format model conversion utility to optionally fail fast on model conversion failure. (#8589 )	2021-08-03 11:12:56 -07:00
Rachel Guo	0cf2ed029b	Add python binding for CoreML EP (#8472 ) * add pybind binding for coreml ep * update merged files * address comments * format * remove lines for non-macOS platform Co-authored-by: rachguo <rachguo@rachguos-Mini.attlocal.net>	2021-07-29 10:06:47 -07:00
Vincent Wang	c8d210de29	Decouple Forward and Backward of ATenOp (#8301 ) * atenop for inference * assert if dtype mismatch * atenop config in frontend * fix orttrainer test * gradient def not only for ATenOp * bugfix * fix gradient input shape and type issue * fix after merge master	2021-07-23 16:53:26 +08:00
Edward Chen	c254c3c355	Fix issue with ONNX to ORT format model conversion script when given single model file as input. (#8323 )	2021-07-07 14:08:47 -07:00
Vincent Wang	f0f3012666	Add SoftmaxCrossEntropyLossInternal to Support Dynamic ignore_index Input (#7899 ) * add SoftmaxCrossEntropyLossInternal * bugfix and ut * fix ut * fix ut * support torch1.8.1 * function body for nll_loss_internal	2021-06-09 10:29:46 +08:00
Bowen Bao	a776b57160	Add shape inference to custom symbolic functions (#7937 ) Description: As title. Motivation and Context - PyTorch ONNX exporter heavily depends on ONNX shape inference to export accurate and efficient model. Custom symbolic function exports the op as contrib ops, thus exporter is unable to perform standard onnx shape inference. Models with dynamic shape inputs are affected.	2021-06-08 10:43:06 -07:00
Vincent Wang	71c4f5ddb2	ATenOp Enhancement (#7725 ) * config parser, default argument values * ut * win build * maxpool2d * fix win build * fix build * unfold atenop	2021-06-08 11:01:17 +08:00
Scott McKay	0fbec1b9c1	Update the operator documentation generation (#7787 ) * Update the operator documentation generation - Make layout a little nicer - Update to latest supported operators including training - Fix some links that are broken when the docs content is copied to github-pages - Fix incorrect usage of 'onnx.ai.ml' as the default domain - ML ops are now separated from the real default domain of 'onnx.ai' - Include CPU, CUDA and training kernels - exclude DNNL as it's not an EP we own * There are separate paths for CUDA and CUDNN as they are not guaranteed to be in the same location on a Windows machine. Use the CUDNN path when looking for the CUDNN library. * Enable validation of both contrib ops and operator kernels in build Filter generation so it's deterministic Add ability for CI to publish the md files as build artifacts if they differ so a developer can download and add to their PR to resolve any diffs. Remove workarounds for github-pages as that will now link to the github docs which display correctly	2021-06-02 17:47:40 +10:00
Scott McKay	57782b3463	Add supported operators/types documentation for the ORT Mobile package (#7807 ) * Add ability to generate documentation for the ORT Mobile package using the build configuration as input.	2021-05-26 15:57:40 +10:00
Yulong Wang	077e8c6b40	allow update_version.py to update new npm packages (#7746 ) * update versions for npm packages * remove package-lock.json in list	2021-05-18 16:15:19 -07:00
Vincent Wang	dac24f7d63	Add ATenOp and call aten::embedding and its Backward Op from ORT (#7590 ) * build with libtorch and impl torchembedding * fix op shape infer * local commit * atenfunctionop * call aten operator from online extension * rollback build.py * resolve comments * bugfix * fix build * fix ortmodule test * remove external outputs, resolve comments * resolve comments * export embedding to microsoft::atenop * bugfix	2021-05-13 09:24:27 +08:00
Scott McKay	830d9e54dd	Add script to dump initializer, NodeArg, Node and subgraph info from an ORT format model (#7516 )	2021-05-04 08:34:35 +10:00
Scott McKay	d6df5764d7	Android package infrastructure (#7430 ) * Include ORT format model conversion scripts and infrastructure in ORT python package. - tweak existing script setup so it can be easily run directly and from the ORT python package Add config file and readme for Android minimal build package Update ORT Mobile doco Disable warning if 'all' optimizations are enabled but NCHWc transformer is excluded (device specific optimizations don't apply in this scenario so the warning is moot). * Address PR comments	2021-04-30 14:23:54 +10:00
Yulong Wang	009f342caf	[JS] refactor Javascript/Typescript libraries in ONNX Runtime (#7308 ) * working on re-organizing js code for ortweb * remove dup files * move folder * fix common references * fix common es5 * add webpack to common * split interfact/impl * use cjs for node * add npmignore for common * update sourcemap config for common * update node * adjust folder/path in CI and build * update folder * nit: readme * add bundle for dev * correct nodejs paths * enable ORT_API_MANUAL_INIT * set name for umd library * correct name for commonjs export * add priority into registerBackend() * fix npm ci pwd * update eslintrc * revise code * revert package-lock lockfileVersion 2->1 * update prebuild * resolve comments * update document * revise eslint config * update eslint for typescript rules * revert changes by mistake in backend.ts * add env * resolve comments	2021-04-16 01:33:10 -07:00
Chun-Wei Chen	3ee9b0ec4d	Add detailed assertion error message (#7232 )	2021-04-05 10:05:40 -07:00
Scott McKay	329fd03bb4	Add int32_t as required type to some operators (#7192 ) * Updates to some operators to always support int32 and int64 based on testing of Android package build config with a minimal build. If an operator can be used for shape manipulation (int64) it is frequently used for indices manipulation (int32), so we enable both types for that set of ops. - e.g. BERT models take indices as input - Scatter/Gather ops utilize indices Misc. fix to python bindings to exclude call that fails in a minimal build.	2021-04-01 19:32:34 +10:00
Edward Chen	0ccfe6c86a	Enable type reduction for Scatter/ScatterElements CPU kernels (#7171 ) Enable type reduction for Scatter/ScatterElements CPU kernels. Some refactoring to reduce binary size. Add MLTypeCallDispatcher methods. Minor cleanup for Pad CPU kernel.	2021-03-30 11:02:24 -07:00
Scott McKay	9297527b7a	Enable NHWC transformer when generating ORT format model (#7126 ) * Allow specific optimizers to be disabled. - replace unused ability to specify just the optimizers to run - never used so not needed Allow the disabled list to be specified via the python bindings - expected usage is internal, so using kwargs for that so as not to pollute the documentation with stuff no user is likely to need Update the ORT format model conversion script to disable NCHWc transformer when level is 'all' - currently there aren't any known use cases where we'd want the NCHWc transformations to run as they create a device specific model and aren't used on ARM - the ORT format model is not expected to be generated on the target device (e.g. generate on Windows/Linux/macOS to deploy to Android/iOS so there's a good chance we'd generate a useless/invalid model - default to 'all' as ARM and MLAS prefer NHWC and the NHWC transformer runs at that level * Add matching changes to optimizer generation in training code	2021-03-29 18:39:48 +10:00
Dmitri Smirnov	2bf54bcaa2	Fix bugs in sparsify script (#7134 ) Fix type and check.	2021-03-25 14:53:52 -07:00
Edward Chen	53392664d3	Enable type reduction for Shrink, Sign, SplitToSequence CPU kernels (#7090 ) Enable type reduction for Shrink, Sign, SplitToSequence CPU kernels. Some other type reduction changes including refactoring to specify element types in a single place.	2021-03-23 09:57:33 -07:00
Dmitri Smirnov	3b58fc7b97	Add types support for Sparse Initializer in Onnxruntime (#7004 ) Add types support for DenseToSparse and SparseToDense conversions Address the case of empty sparse values and indicies when the initializer does not contain any NNZ. Add sparsify script.	2021-03-22 10:06:11 -07:00
Edward Chen	4cbb8e166a	Update kernel def hashing (#7019 ) Update the kernel def hashing in ORT format models. The new hashing logic ignores the ordering of type constraint types. This is a backward compatibility breaking change, but we don't guarantee backward compatibility yet.	2021-03-22 09:28:27 -07:00
Edward Chen	aa60a8368f	Update type reduction operator type usage processors set. (#6976 )	2021-03-11 09:22:53 -08:00
Edward Chen	b6c4a7ac54	Support required types when excluding typed registrations (#6871 )	2021-03-08 08:22:07 -08:00
jingyanwangms	f22f04a109	Add comment (#6860 ) Co-authored-by: Jingyan Wang <jingywa@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2021-03-02 18:54:25 -08:00
Edward Chen	ee35be0129	Support specifying globally allowed types from build script (#6677 ) Add initial support for constraining operator kernel implementations (which support this type-granularity) to a set of allowed types from scripts.	2021-02-22 14:05:00 -08:00
Scott McKay	02c7873b0e	Update ORT model conversion script to support custom ops (#6701 ) * Add support for custom ops library to the ORT model conversion script Simplify model conversion now that we read ops from the ORT format model. Enable custom ops in the python bindings if custom ops are turned on in a minimal build. * Add test of model conversion involving custom ops.	2021-02-17 12:52:39 +10:00
Chun-Wei Chen	115e16b37b	ort_test_utils: skip creating input if it is an initializer (#6544 )	2021-02-05 17:34:08 -08:00
Scott McKay	c5d2538314	Add more kernels that have typed registrations to the operators we track type usage for. (#6565 )	2021-02-05 15:10:54 +10:00
Scott McKay	c49d1dbc4b	Add type reduction support to Slice and Transpose (#6547 ) * Add type reduction support to Slice and Transpose	2021-02-05 11:08:23 +10:00
Scott McKay	6cb8f8c812	Support disabling a typed kernel registration that uses the output type (#6530 ) * Update infrastructure to support disabling a typed kernel registration that uses output 0 for the type (vs. the normal use case of input 0).	2021-02-03 14:22:32 +10:00
Scott McKay	8d53ef69e5	Add type reduction support to Min, Max and Pow (#6519 ) * Add type reduction support to Min, Max and Pow Update the C++ type reduction infrastructure to allow specifying an opset for the supported types list, as those can change across opset versions. Minor updates to the type usage tracking script * Add 'all opsets' macros and constant	2021-02-03 06:51:35 +10:00
Scott McKay	c84bb9df9f	Add ability to track per operator types in reduced build config. (#6428 ) * Add ability to generate configuration that includes required types for individual operators, to allow build size reduction based on that. - Add python bindings for ORT format models - Add script to update bindings and help info - Add parsing of ORT format models - Add ability to enable type reduction to config generation - Update build.py to only allow operator/type reduction via config - simpler to require config to be generated first - can't mix a type aware (ORT format model only) and non-type aware config as that may result in insufficient types being enabled - Add script to create reduced build config - Update CIs	2021-01-29 07:59:51 +10:00
Edward Chen	042053c55e	Add support for running Android emulator from build.py on Windows. (#6317 )	2021-01-13 19:21:49 -08:00
Scott McKay	30c7fffbab	Expand the documentation on using compiling EPs with a minimal build (#5893 ) * Expand the documentation on using compiling EPs with a minimal build to call out a 'simple' option that is easier to use. Provide more background on what happens to help users choose the best option for them. Tweak conversion script to be noisier about attempted usage of 'all' optimization level. Co-authored-by: manashgoswami <magoswam@microsoft.com>	2020-12-02 09:12:36 +10:00
Scott McKay	f0142da59c	Add NNAPI to providers that can be used via the python bindings. (#5867 ) Update ORT model conversion script - add args for specifying optimization level and whether to use NNAPI - add logic to create a list of required ops and ORT format model that can be used with NNAPI	2020-11-21 09:18:35 +10:00
Edward Chen	bef06dac93	Automatically clean up build docker image cache. (#5843 ) Follow up to #5811 to automate cleanup of the build docker image cache. Added a script and build definition to clean up docker images that haven't been accessed recently.	2020-11-20 11:56:26 -08:00
Edward Chen	71e7c2b423	Cache build docker images in container registry. (#5811 ) This PR adds infrastructure to automatically cache docker images used in CI builds in a container registry. Currently, build images are pulled from a container registry for some builds and built every time for others. The container registry requires maintenance to keep the images up to date and building images every time wastes build agent resources. With this change, a given build image can be looked up in a cache container registry and if present, pulled, and otherwise, built and pushed. The uniqueness of a build image is determined by a hash digest of the dockerfile, docker build context directory, and certain "docker build" options. This digest is part of the image tag in the cache container repository. The cache container registry will need to be cleaned up periodically. This is not automated yet.	2020-11-17 17:02:24 -08:00
Chun-Wei Chen	5bd7241839	Raise output mismatch error in ort_test_dir_utils.py (#5364 )	2020-10-02 16:44:59 -07:00
Ashwini Khade	3f00b8db8f	move all experimental ops to version 1 of ms domain (#5287 ) * move all experimental ops to version 1 of ms domain * deprecate TransposeMatMul in favor of FusedMatMul * update documentation	2020-09-30 14:50:18 -07:00
Scott McKay	1ff3b2d5b8	Add ability to generate multiple test dirs so that different input mixes can be tested. (#5310 )	2020-09-29 12:55:15 +10:00
Changming Sun	17f1178c2e	Downgrade GCC (#5269 ) Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>	2020-09-24 21:14:54 -07:00
Guoyu Wang	78a29aebbc	[ORT Mobile] ORT Minimal E2E CI (#5200 ) * Modify the ort minimal CI to ort minimal e2e ci	2020-09-19 18:43:22 +10:00
Scott McKay	c46a480306	Update conversion script and process to simplify creating ORT format models and a minimal build (#5217 ) * Update conversion script and process to simplify creating ORT format models and a minimal build.	2020-09-18 18:49:54 +10:00
Scott McKay	796ddeb2cb	Remove serialization of outer scope value info in ORT format model (#5077 ) * Remove serialization of outer scope node arg info in ORT format model. We don't currently need it in a minimal build as only SessionState calls Graph::IsConstantInitializer and it doesn't search outer scope. If we do need it in the future the information can be calculated at runtime (small binary size cost to do so). Motivation: ORT format model was 32% bigger for a BERT model with multiple levels of subgraph and a lot of nodes due to this. Size is about 5% larger of the original ONNX model with the change. ORT format has type/shape info for all nodes, and this model has 2000 nodes so this seems reasonable. Added example code to dump ORT format model to json. Fixed misc bug in python test script around handling float and non-float expected output.	2020-09-08 17:43:42 +10:00
Scott McKay	b5c2932ae8	Last major set of ORT format model changes (#5056 ) * Add minimal build option to build.py Group some of the build settings so binary size reduction options are all together Make some cmake variable naming more consistent Replace usage of std::hash with murmurhash3 for kernel. std::hash is implementation dependent so can't be used. Add initial doco and ONNX to ORT model conversion script Misc cleanups of minimal build breaks.	2020-09-05 07:59:01 +10:00
Bowen Bao	73456f10cd	Fix contrib ops unregister to match pytorch behavior (#5052 )	2020-09-03 16:32:42 -07:00
Bowen Bao	22ba266bd6	Add flag to _internal_use to control export of contrib ops in ort trainer (#4968 )	2020-09-03 09:11:47 -07:00
Scott McKay	28445c88f9	Changes to enable saving and loading an ORT format model (#4995 ) * Changes to enable saving and loading an ORT format model via the public APIs. Cleanup session.py to try and make slightly more understandable. More refactoring is needed here. Couple of bug fixes * Fix bug in handling NodeArg serialization for optional inputs which has a name and no type info. * Address PR comments - tweak SessionOptions config to avoid double lookup - merge duplicated functionality in python binding around registering an EP with optional options Fix a couple of build issues. * Update C API to be consistent with python API - only load model in InferenceSession ctor if required - support loading ORT model in minimal build * Fix nodejs test. We get an invalid path error from LoadInterOp first now * Another attempt at fixing nodejs test. Error message depends on whether ENABLE_LANGUAGE_INTEROP_OPS is defined. Make the output consistent. The interop implementation looks suspicious given it appears to be internal code that is going via the public api. TBD if that should be fixed. * Fix couple of build issues. * Disable test temporarily so PR can be checked in. Will fix in separate PR that adds final pieces for minimal build as the test is required there. * Give up on nodejs test and make the match simpler. Fix init call in TrainingSession python to not pass through sess. it wasn't being used in Session anyway so passing it through just adds confusion. * Fix call to Session.__init__ in TrainingSession. Session now initializes Session._sess to None to make it clearer where the 'ownership' of that member is, and that needs to happen before TrainingSession sets it.	2020-09-03 09:10:48 -07:00
Nat Kershaw (MSFT)	8a03b6e5c7	Render Operator documentation as compliant markdown (#3658 )	2020-09-02 15:07:50 -07:00
Ksenija Stanojevic	ea37a4d89b	Add Trilu custom op (#4537 ) Co-authored-by: neginraoof <neginmr@utexas.edu>	2020-08-17 14:42:26 -07:00
gwang-msft	db475c4f35	Add option for onnx_test_runner can pause after launch, make create_test_dir work on non-windows os (#4618 ) * minor fix for test dir util * add pause option for onnx_test_runner * add flush std to show pause prompt text Co-authored-by: gwang0000 <62914304+gwang0000@users.noreply.github.com>	2020-07-29 11:47:01 -07:00
Negin Raoof	37cbe8551d	Adding export registration and tests for custom ops (#4248 )	2020-06-25 22:29:02 -07:00
edgchen1	63bf587623	Use azcopy to download test data (#4221 ) Use azcopy from download_e2e_test_data.py, add helper function for downloading azcopy. Update download_test_data.py to use helper function.	2020-06-16 10:14:34 -07:00
Yulong Wang	2b3ce1b090	add script to support update nodejs binding version (#4164 )	2020-06-09 13:12:55 -07:00
Wenbing Li	ee35320974	The fixings for python scripts in ONNXRuntime (#4135 ) * The fixings for python scripts in ONNXRuntime * update according the comments	2020-06-08 10:27:32 -07:00
Scott McKay	5e0928a777	Enable running PEP8 on python scripts using flake8 (#3928 ) * Enable running PEP8 checks via flake8 as part of the build if flake8 is installed. Update scripts in \tools and \onnxruntime\python. Excluding \onnxruntime\python\tools which needs a lot more work to be PEP8 compliant. Also excluding orttraining\tools for the same reason. Install flake8 as part of the static_analysis build task in the Win-CPU CI so the checks are run in one CI build. Update coding standards doc.	2020-05-15 07:15:06 +10:00
Scott McKay	0cdc1459c6	Add a few more things to the helper python scripts. (#3842 ) * Add a few more things to the helper python scripts. Add documentation with usage examples. * Address PR comments	2020-05-12 09:59:40 +10:00
Scott McKay	42cf971ca2	Add a couple of utility scripts to tools/python (#3621 ) * Add a helper script to more easily create a test directory for use with onnx_test_runner or onnxruntime_perf_test. Add example script that can be used as a base for performance testing a model with a variety of input sizes. Add __init__.py so files in this directory can be imported in other scripts. * Fix some flake8 warnings. Add example of specifying attribute for op. * Add ability for test dir creation to fill in all missing input data with random values. Add example of using test dir creation this way	2020-05-02 17:35:43 +10:00
Hariharan Seshadri	b4457ecb7a	Fix `gen_doc` build option and refresh documentation (#3545 ) * Support listing keys in custom metadata map via C/C++ API * nit * PR feedback * Nit * Initial commit * More changes * Support listing keys in custom metadata map via C/C++ API * nit * PR feedback * Nit * Initial commit * More changes * Add md files * Doc changes * Update * revert cmake changes * Update * Doc change * Update * Update	2020-04-17 14:41:04 -07:00
Yufeng Li	baa86f181f	Handle the case that initializers are in graph input (#3449 ) warn that initializers are in graph input provide a tool to move initializer out of graph input Motivation and Context ONNX model from IR_VERSION 4 only treats initializers that appear in graph input as non-constant. This may fail some of the graph optimizations, like const folding, operator fusion and etc. Warn the case and provide a tool.	2020-04-14 09:06:04 -07:00
Tiago Koji Castro Shibata	cff266e1b9	Fix cgmanifest.json generating script (#2770 ) * Fix protobuf submodule name * Workaround pygit2 bug	2020-01-14 14:59:07 -08:00
Scott McKay	7a5de9c958	Add a python script with a number of helper actions for creating/editing/dumping onnx test runner format pb files (#2294 ) * Add a python script with a number of helper actions for creating/editing/dumping onnx test running format pb files.	2019-11-01 06:39:14 +10:00
Scott McKay	6699c19010	Add script to find if optimizer updates are required due to onnx operator updates. (#1957 ) * Add script to find calls to graph_utils::IsSupportedOptypeVersionAndDomain where the latest supported version is prior to the latest defined version.	2019-10-21 11:38:30 -07:00
shahasad	0c5d2c998b	Generate documentation from the registered operator kernels (#1395 ) - Added python script for generating markdown doc from the registered opkernels. - Made some conditional changes in the pybind to expose necessary python API - Added some missing type-constraints in the op kernel registrations	2019-08-14 18:12:24 -07:00

1 2 3 4

156 commits