onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-01 03:45:06 +00:00

Author	SHA1	Message	Date
Justin Chu	a36caba073	Bump ruff in CI (#15533 ) ### Description Bump ruff version in CI and fixed new lint errors. - This change enables the flake8-implicit-str-concat rules which helps detect unintended string concatenations: https://beta.ruff.rs/docs/rules/#flake8-implicit-str-concat-isc - Update gitignore to include common python files that we want to exclude. ### Motivation and Context Code quality	2023-04-17 10:11:44 -07:00
Justin Chu	938e2136c6	Enable pylint and numpy rules (#15218 ) ### Description Enable pylint and numpy rules ### Motivation and Context Modernize numpy usage and enable more quality checks	2023-03-27 20:37:53 -07:00
Justin Chu	d834ec895a	Adopt linrtunner as the linting tool - take 2 (#15085 ) ### Description `lintrunner` is a linter runner successfully used by pytorch, onnx and onnx-script. It provides a uniform experience running linters locally and in CI. It supports all major dev systems: Windows, Linux and MacOs. The checks are enforced by the `Python format` workflow. This PR adopts `lintrunner` to onnxruntime and fixed ~2000 flake8 errors in Python code. `lintrunner` now runs all required python lints including `ruff`(replacing `flake8`), `black` and `isort`. Future lints like `clang-format` can be added. Most errors are auto-fixed by `ruff` and the fixes should be considered robust. Lints that are more complicated to fix are applied `# noqa` for now and should be fixed in follow up PRs. ### Notable changes 1. This PR removed some suboptimal patterns: - `not xxx in` -> `xxx not in` membership checks - bare excepts (`except:` -> `except Exception`) - unused imports The follow up PR will remove: - `import *` - mutable values as default in function definitions (`def func(a=[])`) - more unused imports - unused local variables 2. Use `ruff` to replace `flake8`. `ruff` is much (40x) faster than flake8 and is more robust. We are using it successfully in onnx and onnx-script. It also supports auto-fixing many flake8 errors. 3. Removed the legacy flake8 ci flow and updated docs. 4. The added workflow supports SARIF code scanning reports on github, example snapshot: ![image](https://user-images.githubusercontent.com/11205048/212598953-d60ce8a9-f242-4fa8-8674-8696b704604a.png) 5. Removed `onnxruntime-python-checks-ci-pipeline` as redundant ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Unified linting experience in CI and local. Replacing https://github.com/microsoft/onnxruntime/pull/14306 --------- Signed-off-by: Justin Chu <justinchu@microsoft.com>	2023-03-24 15:29:03 -07:00
Adrian Lizarraga	db9c677b63	[EP Perf Dashboard] Add TensorRT 8.5.1.1 dockerfile (#13843 ) ### Description - Adds a dockerfile for Ubuntu with TensorRT 8.5.1.1. - Adds option to run EP Perf pipeline with TensorRT 8.5 ### Motivation and Context Necessary to benchmark models with TensorRT 8.5	2022-12-09 14:33:52 -08:00
stevenlix	ce0025d3f2	Fallback Pow op in layer norm to FP32 in TRT to avoid overflow (#13639 ) Accuracy loss is observed when transformer models such as BERT, DeBERTa, ViT are running in TRT FP16 mode. The cause is that overflow happens at Pow op in layer norm. This PR provides the option to force Pow to run in TRT FP32 precision if overflow occurs. Co-authored-by: Ubuntu <azureuser@orteplinuxdev.bxgbzpva45kedp3rhbsbit4phb.jx.internal.cloudapp.net>	2022-11-29 13:37:31 -08:00
Adrian Lizarraga	281f199754	[EP-Perf-Dashboard] Reduce script excessive output (#13562 ) ### Description Properly cleans up all temporary resources created while running benchmarks. Details: - Dump all temporary artifacts (TRT engines, TRT profiles, inference profiles, fp16 models) into a temp directory in `/tmp/`. Each model/EP combination has its own temp directory that is deleted after validation and benchmarking. - Allow running both validation and benchmarking in one invocation of the benchmark.py script. This is necessary to allow the benchmarking step to reuse artifacts (e.g., TRT engines) created during validation. Before this PR, we ran validation on all model/EP combinations before running benchmarks on all combinations again. This required us to keep all temporary artifacts for all model/EP combinations throughout the entire run (expensive). - Create individual functions for validation and benchmarking (split-up large function that did it all) ### Motivation and Context The EP Perf pipeline failed to run because the script generated too much output and the VM ran out of disk space.	2022-11-08 16:17:29 -08:00
Adrian Lizarraga	418304743d	[EP-Perf-Dashboard] Update table schemas (#13327 ) Updates EP perf benchmarking scripts to upload new data with an improved table schema. In order to preserve compatibility with the current benchmarking pipeline, we still upload data that uses the old schema as well. These changes are required in order to improve data filtering capabilities and general UX in dashboards that visualize this data. Details: - EP names no longer hardcoded as columns for tables that store inference latency, session creation times, memory usage, and model/EP status. - Add explicit branch, commit ID, and commit date columns to all tables - Improvements to the docker image building scripts (simplify docker image build; support installing binary TensorRT packages) - Remove use of deprecated DataFrame.append in favor of pandas.concat.	2022-10-19 16:15:05 -07:00
Tony Xia	962fee5fe5	Fix typo enviroment => environment (#13195 )	2022-10-03 17:02:26 -07:00
Adrian Lizarraga	39e20686a0	[EP Perf Dashboard] Fix incorrect calls to trtexec with fp16 inputs (#13018 )	2022-09-21 10:31:45 -07:00
Dmitri Smirnov	bc2df1bf95	Remove previously deprecated API (#12935 ) Remove previously deprecated API Format JS code, address review comments NPM Formatting	2022-09-14 10:58:03 -07:00
Yulong Wang	c144acc534	Replace 'master' branch ref to 'main' in the code (#12547 )	2022-08-22 10:48:12 -07:00
Adrian Lizarraga	ad4abbd75e	[EP-Perf-Dashboard] Add support for TensorRT 8.4 to EP Perf Dashboard (#11876 ) Co-authored-by: George Wu <jywu@microsoft.com>	2022-06-17 09:16:51 -07:00
stevenlix	bd65acd08d	Share execution context memory between TensorRT subgraphs (#11859 ) * share trt context memory * update parser to 8.4-EA * update parser to 8.4-GA * add context memory sharing enable option * update parser to 8.2-GA * fix format issue * reverse orders * fix format * fix format * fix issues	2022-06-16 22:42:40 -07:00
Adrian Lizarraga	aef53e2b0d	Support uploading EP perf data to a configurable database. (#11819 )	2022-06-13 14:06:50 -07:00
Adrian Lizarraga	e45197fa8c	[trt-ep-perf] Fix upload time of EP perf data (#11531 ) Fix the post.py script to use the actual "upload time" in ISO format instead of the day/month/year of the commit date.	2022-05-18 15:36:21 -07:00
Adrian Lizarraga	48efeca66c	[trt-ep-perf] Fix bug that suppresses latency gain reporting (#11321 ) Fix bug that prevents EP perf script from reporting latency gain for TensortRT/CUDA	2022-05-17 14:00:52 -07:00
Olivia Jain	49d7050b88	Create Checkout Submodules Script (#11344 ) * move all logic for ubuntu dockerfiles * pass in trt version * update trt 8.0 file * downgrade protobuf * uncomment * and * change to 8.0 * update dockerfiles * checkout protobuf based on version * adding last dockerfile: : * checkout 3.10 protobuf * fix checkout version * update to 8.2 * keep only one submodule sync * cleanup * Delete Dockerfile.custom-trt-perf * create checkout submodules script * properly compare decimals in bin/sh * combine build ort paths * deprecate TRT 7.2 * only checkout protobuf if we checkout older onnx-tensorrt * only pull nvidia container if true, update image * downgrade protobuf only if we checkout onnx-trt * Update linux-gpu-tensorrt-daily-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-daily-perf-pipeline.yml for Azure Pipelines * Add quotes to avoid path splitting * address shellcheck * use shellcheck suggestions	2022-04-29 13:04:26 -07:00
Adrian Lizarraga	11e681f46a	[ep-perf] Fix the order of onnx and onnxruntime imports. (#11383 ) Fix the order of onnx and onnxruntime imports. Importing onnx before onnxruntime causes a dependency issue in the tensorrt containers that prevents onnxruntime_pybind11_state.so from finding the system libstdc++. This is a workaround to get the EP Perf pipeline working until we can investigate the issue more closely. Co-authored-by: Justin Chu <justinchuby@users.noreply.github.com>	2022-04-28 10:32:39 -07:00
mindest	c8270c2940	Add ATen export and gradient for torch.max/min (#11275 ) * add aten export for max, max.dim * rewrite grad of max (no dim); add cases for min * update UT cases * mod sym shape infer * resolve comments: shape infer, add comments, etc. * add test for torch.max of two tensors * resolve peng's comments: keepdim; test case * correct python format * fix recently introduced lint error	2022-04-28 17:30:33 +08:00
Olivia Jain	88ff48910d	Perf Bug Fix: Keep First Run for Random Input (#11366 )	2022-04-27 09:17:49 -07:00
Justin Chu	fdce4fa6af	Format all python files under onnxruntime with black and isort (#11324 ) Description: Format all python files under onnxruntime with black and isort. After checking in, we can use .git-blame-ignore-revs to ignore the formatting PR in git blame. #11315, #11316	2022-04-26 09:35:16 -07:00
Adrian Lizarraga	f069951835	[trt-perf-test] Pass TensorRT/CUDA EP options via dictionary argument (#11231 ) * Enable users to pass a dictionary of TensorRT and CUDA EP options to the EP perf benchmark.py script. * Post specified EP options to database.	2022-04-22 11:22:25 -07:00
Olivia Jain	ae243c2bb5	Pull Nightly Wheel File and Cleanup Perf (#11164 ) * delete unused files * only use one dockerfile, otherwise install * Update pipeline file * get other changes * minimal packages * update pull nightly variable * try logical boolean * test boolean * have build ort as boolean * case senstive * use the current head not the previous commit * add helpful note	2022-04-11 11:41:11 -07:00
Olivia Jain	872ed91d8a	Perf FasterRCNN + MaskRCNN (#11102 ) * add faster mask * fix paths	2022-04-04 13:23:25 -07:00
Olivia Jain	de384805cd	Custom parameters (#10964 ) * get inputs independently for trtexec * track one process only * remove engine and profile files * change time to commit time * add runtime option for io binding * move to commit date * fixes * add option for graph optimization * cleanup docker script * note second time creation * allow for parameters to be configured from pipeline at runtime * uncomment * include optional arguments at runtime * post second session creation * update cmake version * Revert "update cmake version" This reverts commit 09a1364eae68610724c8e90eeea777b7ee03f74b. * Move data format import	2022-03-23 09:47:24 -07:00
Olivia Jain	12eb660415	Compare TRT vs ORT-TRT Accurately (#10565 ) * get inputs independently for trtexec * track one process only * remove engine and profile files * change time to commit time * add runtime option for io binding * move to commit date * fixes * add option for graph optimization * cleanup docker script * include remaining changes * choose graph optimization option * add space in option	2022-03-04 10:14:18 -08:00
Olivia Jain	a1d9a71b8b	Improve Perf System (#10404 ) * move table names to one location * remove session metadata * reload trt inputs * fix posting names * Update linux-gpu-tensorrt-daily-perf-pipeline.yml for Azure Pipelines * remove comments * Split up anubis job and perf run * add trt environ variables * No embedded links	2022-02-01 16:01:34 -08:00
Olivia Jain	eee627fde9	Track Session Creation Time (#10281 ) * add back previous changes lost in merge * post session to dashboard * post session creation time to dashboard * fix trt 8 functionality: * add component governance * Remove hardcoded values * Update linux-gpu-tensorrt-daily-perf-pipeline.yml for Azure Pipelines * cleanup errors * post results only once * checkout 8.0 GA * try build 8.0 without building shared lib * add back build_shared_lib, not the problem * add upload_time to table * use identifier to post * Shorten to TRT x.x * shorten commit hash using rev_parse * use shortened commit hash * use nvidia's default TRT_VERSION	2022-01-21 13:20:53 -08:00
Olivia Jain	4048ed326c	Update EP Perf Pipeline (#10149 ) * migrate to 1ES Hosted Pool * migrate to Kusto database * refactor and organize ep names with ORT prefix * standardize TRT benchmarking with save/load engine, input binding, and workspace * Add TRT 8.2 to ep perf pipeline * update model_list.json with full onnx zoo * add anubis credentials * add anubis credentials * clarify trt variables * get system info from docker image * remove unwanted commenting	2022-01-11 16:12:32 -08:00
Olivia Jain	6fbd0a8233	Change cmake_cuda_architectures to double quotes (#8990 )	2021-09-08 09:41:52 -07:00
Olivia Jain	a0c9408f0d	Make TRT Version Configurable (#8864 ) * copy changes from trt_and_mem * second edits * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * change to cuda 11.4 * build with cuda 11.4 * Update Dockerfile.ubuntu_cuda11_1_tensorrt7_2 * add cmake extra defines * cmake architectures * fix cmake arch * Delete ubuntu-18.04.Dockerfile * Rename Dockerfile.ubuntu_cuda11_1_tensorrt7_2 to Dockerfile.ubuntu_cuda11_4_tensorrt7_2 * Update linux-gpu-tensorrt-ci-perf-pipeline.yml * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * removing previous ort args * rename to cuda 11.4 * remove cuda 10_2 * delete trt 7.1 * remove 7.1 * Passing in cuda architecture to reduce build time * always add submodule sync due to recursive cloning * fix run command * add and * take away unused arms and share python installation script * Update linux-gpu-tensorrt-ci-perf-pipeline.yml * Update Dockerfile.tensorrt * cleanup file * install python directly on dockerfile - move to scripts in future * Update Dockerfile.custom-trt-perf * adding cuda 11.1 for missing Libnvrtc.so.11.1 * Delete install_python.sh	2021-09-03 13:32:27 -07:00
Olivia Jain	33c0b3e94b	Perf test fixes (#8863 ) * fix anubis wheel upload and symbolic shape infer location * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * fix symbolic path * use master and call mem_test after build * Update linux-gpu-tensorrt-ci-perf-pipeline.yml * use installed symbolic shape infer TODO: check upon error * catch symbolic shape errors	2021-08-31 10:03:47 -07:00
Olivia Jain	9cefd1303b	Integrate Anubis (#8603 ) * copy over changes * Update build_image.sh * allow for configurable trt * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * reflect previous changes * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * model_list.json * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * checkout trt 7.1 * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update post.py * Update post.py * Update post.py * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update model_list.json * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update post.py * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update post.py * Update post.py * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update model_list.json * Update post.py * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update post.py * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update post.py * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update start_job.ps1 * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update run_mem_test_docker.sh * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Separate anubis files * revert to old pipeline * Update post.py * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * build off master Dockerfile * Delete Dockerfile.custom-trt-perf * Delete install_common_deps.sh * uncomment * Update linux-gpu-tensorrt-ci-perf-pipeline.yml * pass in trt container version * Update linux-gpu-tensorrt-ci-perf-pipeline.yml * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update post.py * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update post.py * remove sudo * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * add back build number * allow python 3.8 * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * python 3.8 fix trtexec * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * remove prev py38 * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * add perf dependencies * Update start_job.ps1	2021-08-16 13:20:28 -07:00
Edward Chen	20f006c580	Remove flake8 check from CMake build. (#8662 )	2021-08-09 14:10:36 -07:00
Ryan Hill	cc9f793b48	Move one function from cuda_provider_factory.h (#8407 )	2021-07-19 17:55:59 -07:00
Chi Lo	31f291f0af	Add TRT EP memory leak test into trt perf script (#8155 ) * Add memory check for TRT perf * Revise test app * Add memory check for TRT perf * Revise test app * add test cases * Modify script and add pipeline YAML * remove redundant code * temporarily change * Change YAML * revise test app * fix minor bug * code refactor * small fix * temporarily change for test * prepare result log * rm container when it exits * code refactor	2021-07-13 09:39:08 -07:00
Olivia Jain	b2247ece25	Make Perf Test Configurable (#7836 ) - Allow anyone to kick off a perf test here. Customize: branch, eps, model selection, cuda version. - Only run shape inference when required. - Kill errored out memory processes. - Remove warmup run. - Clean up script. - Standalone_TRT is it's own "EP" vs as an additional run with TRT EP	2021-06-18 11:11:19 -07:00
Olivia Jain	29172d8f54	Setup EP Dashboard (#7321 ) * setting up dashboard * posting to ort dashboard * creating separate docker file * including common deps * tracking latency over time	2021-05-11 10:33:39 -07:00
Olivia Jain	fb40602ea2	Mem trt (#6868 ) * adding trt comparison and memory consumption * creating separate docker file	2021-04-05 22:16:12 -07:00
Olivia Jain	db05d53b94	Setup perf in docker and add features (#6582 ) * setup scripts to run in docker * percent threshold for accuracy * branch testing	2021-02-25 09:31:03 -08:00
Olivia Jain	56ab2166e8	Delete float16.py (#6336 ) No longer needed. Also doesn't pass policheck.	2021-01-13 13:41:06 -08:00
Olivia Jain	c8de3f355a	Refactor EP Perf Tool (#6202 ) * merge master, keep postprocess status commit * download float16.py everytime * using variables to reference eps * adding ACL EP to ep perf tool * accuracy with absolute tolerance configurable * add acl to dict + remove commented line	2021-01-04 08:50:41 -08:00
Olivia Jain	234e94b4e1	Add Status.csv to EP Perf Tool (#6167 ) * merge master, keep postprocess status commit * download float16.py everytime * removing hardcoded values	2020-12-21 20:23:19 -08:00
Olivia Jain	3738ca7e10	Improve perf testing (#5760 ) * build off a specific commit and archive wheel file * rename to fp32, prefix results w/ commit, add CPU col * rename 99th to 90 percentile * get symbolic_shape from master each time * add install archive wheel, parallel build * shortening hash	2020-11-20 16:03:09 -08:00
Chi Lo	92292de135	Tensorrt perf tool (#5436 ) * Add YAML file for pipeline * Modify typo * Add working directory * Modify and test * Modfiy and test * Modify and test * Modify and test * Modify * Modify * Modify * Modify * Make sure to copy all the result files * Add clearn up * Modify * Modify agent pool name * Upload only specific artifacts * Modify * Integrated CI Pipeline for running TRT perf as well as added the “large amount of models” into perf model target * Fix bug * Fix bug * Add reading the information regarding previously known failing models and then skip testing them during benchmark/validation * Modify the script file for CI * Replace print with logger.info * Fix bug * Fix bug * Refine the code * Modify the script so that it can capture script segmentation fault while running ORT * Fix bug * fix bug * fix bug * Add debug info * fix bug * Refine perf code * Refine the code * fix bug * Code refactoring * change many-models path * remove metadata after validation/benchmark are done * Update README.md * Fix bug so that metadata doesn't hold stale value * Remove hardcode and update README * Add arguments to the script to make it run correctly * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Fix bug so that metadata doesn't hold stale value * Fix small bug of finding test dataset directory for FP16 test data, as well as modification of some output information * use -i random for perf test of TRT changes Co-authored-by: Olivia Jain <oljain@microsoft.com>	2020-11-06 12:27:42 -08:00
KeDengMS	ce3b67e0cd	[Python] Move symbolic_shape_infer from nuphar to tools (#5162 ) * [Python] Move symbolic shape inference from nuphar to tools * Fix PEP8 ERROR	2020-09-18 09:31:06 -07:00
Chi Lo	9f526f45ac	TensorRT Perf Tool (#4900 ) * Initialize tensorrt perf script * Add bert-squad dependencies * Modified code to make ort inference with CUDA/Tensorrt * Add get CUDA/TRT version * uncomment bert-squad * Add BERT-SQUAD inputs.json * Add FastRCNN * Make preprocess/validation in to common functions * Add MaskRCNN and SSD and consolidate the code * Add dependencies for MaskRCNN * following modifications are made: - create common fetch function to get inputs/outputs of model from ONNX model zoo. - create common validation function to compare inference outputs with reference outputs from ONNX model zoo. - move run/repeat time to argument list. (still working on other arguments, like fp16 or fp32, latency percentile). - generate table in csv file to show the latency comparison (TRT vs CUDA) side by side. * Add approache to analyze profling file and also update model related settings * Add models * Add most of models from ONNX model zoo * Add model input name and print all the model names at the end of run * Add system info * Add TRT fp16 support * Refine the code * Handle TRT fall back and modify the way to get input data * Refine code * Modify code * Add more precise approach to measure inference * Add io-binding * Add YoLoV4 * Refine the code * Refine the code * Add models * Add yolov4 notebook for jetson device * Update notebook * Update notebook * Add CVS models * Add missing model * Add support of float16 * Add new way to get trt version * Add "validate" and "benchmark" mode * Add randomly generated input * Refine perf script * Refine the code. * Add README * Refine the code * Update README.md * Refine code * Update README.md * Remove all the model related python and instead using model_list.json as models configuration. Refine the benchmark.py * Refine the code Co-authored-by: Chi Lo <lochi@microsoft.com>	2020-09-15 10:06:01 -07:00

47 commits