onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-12 17:57:38 +00:00

Author	SHA1	Message	Date
George Wu	71ce0b8e0a	remove 64bit warning message from python validation. (#2727 )	2019-12-23 17:06:55 -08:00
Andrey	abda478083	[server] Add supposed for model_name and model_version as cli parameter (#2708 )	2019-12-23 12:52:09 -08:00
Ashwini Khade	cd6ff42442	Add schema for new Qops (#2611 ) * Add schema for new Qops * adding shape inference + qlinearaveragepool * plus review comments * plus review comments * updates per review comments * plus review comments	2019-12-23 10:19:28 -08:00
KeDengMS	71940c0915	Update Nuphar tutorial notebook (#2721 ) 1. Reflect int8 GEMV improvements for multi-threading from #2696 2. Add notes on multi-threading control using OpenMP 3. Add samples of running multi-isa AOT, and show int8 GEMM differences between AVX and AVX2 4. Add rnn_benchmark example to resolve #1993	2019-12-22 22:42:03 -08:00
Jeff	bbe32f0990	Extend DML kernels (#2641 ) * Additional DML operators * Check unsupported attributes and inputs * Address PR comments * Add kernel capability function used for partitioning, and re-enable stride-based int64 support based on value range * Fix test failures * Build fix * PR comments	2019-12-20 23:28:15 -08:00
KeDengMS	4b900dc585	Simplify cache implementation and avoid static variables that may carry over between models	2019-12-20 21:04:17 -08:00
Changming Sun	da03ed4473	Tiny fix to codegen	2019-12-20 21:04:17 -08:00
Changming Sun	b42cb61904	Packaging pipeline changes for VS 2019 (#2711 )	2019-12-20 19:53:51 -08:00
Ori Levari	f0046a1531	switch back order of session_state_ and execution_providers_ in InferenceSession.h (#2714 )	2019-12-20 17:26:51 -08:00
Xavier Dupré	7c0235c15a	Propagate documentation modification from rel-1.0.0 (#2713 )	2019-12-21 00:25:45 +01:00
Ori Levari	967ffc03cc	a subset of the models in testdata were missing an ir_version. update this ir_version to 3. (#2702 )	2019-12-20 13:56:11 -08:00
Yufeng Li	72286d91e7	fix the CUDNN_BN_MIN_EPSILON difference issue between cudnn7.3 and cudnn7.6 (#2681 )	2019-12-20 13:49:03 -08:00
jignparm	64112db346	Fix C# handling of unicode strings (#2697 ) * Fix C# handling of unicode strings * more tests * check for handle before freesing * variable reuse efficiency * refactor and cleanup utf8 o utf16 conversion block	2019-12-19 21:02:54 -08:00
Dmitri Smirnov	233bdd268b	Advance Featurizers commits and fix up ORT lining issues. (#2705 ) Update featurizers. Fix up constraint issue. Pass static VCRT library option down to Featurizers CMAKE. Make build Featurizers OFF by default. Rename registration call.	2019-12-19 20:27:36 -08:00
KeDengMS	9b14d8fd67	Update rnn_benchmark to reset OMP threads (#2698 )	2019-12-19 14:10:39 -08:00
Faith Xu	bb7f43ee91	Documentation update: build instructions (#2636 ) * Spacing fix for code block * Update instructions Include java, acl, and nn api instructions on build page * Update build instructions to link to build.md * typo * Update build instructions to link to build.md * Include other minor build.md page updates * Update CUDA version * Fix dockerfile links	2019-12-19 13:40:34 -08:00
Hariharan Seshadri	b3d0b114fe	Support Round op in the CUDA EP (#2601 ) * Support Round op for the CUDA EP * Update version * Fix build * Fix opset version * Update * PR comments * Fix build * Nit	2019-12-19 11:36:50 -08:00
baowenlei	715e365723	Enable int8 GEMV tensorization (#2696 ) Enable int8 GEMV tensorization and improve performance.	2019-12-18 21:02:52 -08:00
Hariharan Seshadri	971bc439b5	Support CumSum op in the CUDA EP (#2647 ) * Initial commit * Initial commit * Updates * Fix build * Updates * PR feedback * Minor optimization * Update * Update	2019-12-18 16:49:59 -08:00
KeDengMS	9017e93701	[NupharEP] fix for Windows build and VS 2019 (#2694 )	2019-12-18 16:16:46 -08:00
Changming Sun	89d6bfaa94	VS 2019 build pipeline changes (#2693 ) 1. Move Win GPU pipeline to VS2019 2. Move C API pipeline to VS 2019 3. Move nuget mklml pipeline to VS 2019 4. Move windows no contrib ops pipeline to VS 2019	2019-12-18 15:34:58 -08:00
Yang Chen	0d5504877b	fixed "unreachable code" warnings for Windows build (#2682 ) When NUPHAR_USE_MKL and NUPHAR_USE_AVX2 were not defined, we got "unreachable code" warnings on Windows, which were turned into errors and broke the build.	2019-12-18 12:51:19 -08:00
Changming Sun	504ba6f2ca	Remove a warning	2019-12-18 11:13:34 -08:00
Dmitri Smirnov	ce7a180f21	Import more featurizers with tests (#2685 ) Advance commit to 4df80d5865a9d4e97f6d0b9304d4316115a04d9e Add generated code for the commit before editing. Import more featurizers. Rename Automl ops domain to mlfeaturizers. Rename conditional compilation macro. Move and rename files getting rid of automl Rename --use_automl build switch to --use_featurizers Rename CMake option accordingly. Rename automl CMake targets. Adjust CI and packaging pipeline switches. Rename namespace automl to featurizers.	2019-12-17 22:17:40 -08:00
KeDengMS	c767e264c5	[NupharEP] update tutorial with GPT-2 (#2677 )	2019-12-16 17:57:34 -08:00
Jeff	b38b36a941	Increase max_num_graph_transformation_steps from 5 to 10 (#2633 )	2019-12-16 14:17:37 -08:00
Jeff	9dddabf237	Miscellaneous DML EP fixes (#2634 ) * Misc DML EP fixes * Fix white space * Build fix	2019-12-16 14:16:10 -08:00
Tracy Sharpe	c907881852	MLAS: optimize QuantizeLinear (#2660 ) This adds MLAS support to quantize a buffer using vector instructions for x86/x64/ARM64. This helper is now used for the QuantizeLinear and DynamicQuantizeLinear ops.	2019-12-16 10:56:23 -08:00
Hector Li	47503ec7a6	Initiate the build scripts for ARM ACL (#2652 ) 1. Add scripts to build Yocto image & toolchain 2. Update docker build scripts to support Onnxruntime build with ARM ACL 19.02/19.05	2019-12-16 09:44:19 -08:00
Yang Chen	f7412899a1	added cache version for nuphar JIT binaries (#2646 ) * added cache version for nuphar JIT binaries Previously, when the user wrongfully loaded a JIT binary generated from a Nuphar version different from the current used one, she would get mysterious runtime failures, because we didn't perform any version check on JIT binaries. This change added cache versions to the Nuphar runtime and JIT binaries. The Nuphar runtime will issue verbose message that informs the user version-mismatch errors. * address CR feedback * include NUPHAR_CACHE_VERSION in python wheel	2019-12-14 22:46:30 -08:00
Dmitri Smirnov	7c87070b24	Import Featurizers (#2643 ) Import FeaturizerLibrary as ExternalPorject which is optional and is not registered as git submodule.	2019-12-13 16:07:12 -08:00
Changming Sun	a46a28b7d8	Windows CI changes(#2650 )	2019-12-13 12:23:49 -08:00
Tianlei Wu	18bdde34dc	Fix memory exception in Layer Norm Fusion (#2644 )	2019-12-13 11:13:55 -08:00
Zhang Lei	1996129ddf	Improve performance of resize() in Nearest mode (#2626 ) Special treatment for 2D, check same size as input image. And in 2d kernel, template use_expolation.	2019-12-13 09:43:13 -08:00
Hector Li	6e68007d2d	fix build error for ARM (#2648 )	2019-12-13 09:21:03 -08:00
Zhang Lei	fff1ed9bfc	Optimize cuda scatter() on 2D compatible. (#2628 ) * Optimize cuda scatter() on 2D compatible. * Add some comments.	2019-12-12 16:38:06 -08:00
Zhang Lei	e31be23c42	Cuda pad optimize when no padding is needed. (#2625 ) * Shortcut cuda Pad() when no padding is needed.	2019-12-12 16:36:46 -08:00
Zhang Lei	8631b70c73	Improve cuda expand() opeator's performance. (#2624 )	2019-12-12 16:36:06 -08:00
Jeff	ac08b58867	Add shape inference to ConvTransposeWithDynamicPads schema (#2632 )	2019-12-12 14:46:06 -08:00
shahasad	4dbf9442cc	removed unnecessary batch file and fix path (#2640 )	2019-12-12 14:21:02 -08:00
liuziyue	c7cd336917	epsilon attribute for layernormalization fusion (#2639 )	2019-12-12 10:58:02 -08:00
Yufeng Li	d6f33dceb1	fix float16 comparison in initializer (#2629 )	2019-12-11 19:03:57 -08:00
Tianlei Wu	c04647b952	Improve Embed Layer Norm Fusion for SQuAD with static input shape (#2621 )	2019-12-11 15:25:35 -08:00
Zhang Lei	6859d92e2b	Make sure fenced tensor could not reuse other tensor. (#2561 ) Fix random error caused by this.	2019-12-11 12:04:54 -08:00
Hariharan Seshadri	b2d65b447f	Remove unnecessary parameter in some places in GatherElements implementation (#2612 ) * Remove unnecessary parameter in some places * Update * Update	2019-12-11 11:15:12 -08:00
Eric Cousineau	8729784635	Allow providers to be set for InferenceSession at construction (#2606 )	2019-12-11 10:24:43 -08:00
liuziyue	1ee250def8	EmbedLayerNormalization Fusion For Dynamic Squad Model Opset 10 (#2613 ) Support subgraph of SQuAD model exported from pytorch with dynamic input axes	2019-12-10 23:40:55 -08:00
Yang Chen	2ca9733cee	Dump subgraph ID and fused graph ID (#2607 ) * Dump subgraph ID and fused graph ID Dump subgraph ID and fused graph ID for better debugging * Remove local static fused_count added a field global_fused_count_ to NupharExecutionProvider class	2019-12-10 19:56:39 -08:00
KeDengMS	45babd6c00	symbolic shape inference: fix warnings in GPT-2 model (#2608 ) And revise nuphar perf test on BERT squad	2019-12-10 16:05:00 -08:00
Tianlei Wu	bc89eccb21	Support opset 11 subgraph of Squad model in Embed Layer Normalization (#2605 ) Support opset 11 Squad model that is exported from PyTorch nightly. The embed layer uses Range op which is missed in the transformer.	2019-12-10 15:22:19 -08:00

1 2 3 4 5 ...

1732 commits