onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-17 21:10:43 +00:00

Author	SHA1	Message	Date
Ashwini Khade	ce49cfa67c	add support for configurable build dir when building nuget packages (#5352 ) * add support for configurable build dir when building nuget packages * rename vars	2020-10-02 09:31:35 -07:00
Changming Sun	f265834c2c	Exclude GPT2_LM_HEAD from OpenVino's model test list (#5356 ) GPT2_LM_HEAD is a new ONNX model zoo model that OpenVino doesn't support. Error message:1: [ONNXRuntimeError] : 6 : RUNTIME_EXCEPTION : Non-zero status code returned while running OpenVINO-EP-subgraph_1162 node. Name:'OpenVINOExecutionProvider_OpenVINO-EP-subgraph_1162_1' Status Message: _Map_base::at	2020-10-01 21:49:45 -07:00
Sunghoon	1612934f72	Allow protobuf format of input data for performance test (#5323 ) * Allow protobuf format of input data like onnxruntime_perf_tool * Add OnnxML.cs to fix build failure	2020-10-01 21:40:29 -07:00
Yufeng Li	e8b9aa1f29	fix quantization of EmbeddingLayerNorm (#5321 )	2020-10-01 20:08:43 -07:00
KeDengMS	7495dc167a	Symbolic shape inference: fix a bug in auto_merge when broadcasting (#5349 ) The bug happens when merging following shapes: input0: [1, 1, 'Min(1024, input1_dynamic_axes_3)', 'Min(1024, input1_dynamic_axes_3)'] input1: ['input1_dynamic_axes_1*input1_dynamic_axes_2', 12, 'input1_dynamic_axes_3', 'input1_dynamic_axes_3'] input2: [] The fix is to avoid broadcasting merge on input2	2020-10-01 15:24:00 -07:00
Ye Wang	caed6c264c	Add tf2pytorch wrapper in transformers tool (#5316 ) * init checkin * format * refactor * review comments	2020-10-01 13:58:58 -07:00
edgchen1	d62873a331	Docker image release build updates (#5326 ) - Update docker image release build to use build commit. - Use valid default in component governance detection step. - Use smaller docker build context.	2020-10-01 12:25:31 -07:00
liqunfu	fe50213491	Liqun/bert pretrain2 (#5327 ) * bert single node multi GPU pretrain w/o checkpoint Co-authored-by: liqun <liqun@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2020-10-01 11:01:26 -07:00
Brian Martin	1cad3e322e	typo in contributing.md (#5340 ) there's a missing space between two words.	2020-10-01 10:23:08 -07:00
Guoyu Wang	2098d621a6	Make some string optional for save to/load from flatbuffers (#5331 ) * Update how to save and load string using flatbuffers and ort_format_only_test * Add some comments * Address PR comments	2020-10-01 09:24:37 -07:00
Hariharan Seshadri	383b1e207c	Fix bug in the Resize operator kernels (#5303 )	2020-09-30 15:33:33 -07:00
Ashwini Khade	3f00b8db8f	move all experimental ops to version 1 of ms domain (#5287 ) * move all experimental ops to version 1 of ms domain * deprecate TransposeMatMul in favor of FusedMatMul * update documentation	2020-09-30 14:50:18 -07:00
edgchen1	2c32309e2c	Update dockerfiles/README.md onnxruntime-training image tags. (#5333 )	2020-09-30 14:35:38 -07:00
Sherlock	37445d1198	Update Bert Perf Script (#5339 ) Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2020-09-30 14:30:20 -07:00
Changming Sun	8d4740b39c	Add some log for the GetFileLength function (#5330 )	2020-09-30 10:39:42 -07:00
Faith Xu	cb57c100e6	Doc updates for 1.5 (#5302 ) * Fix Windows AI version * Update text to extend telemetry coverage Includes all official binaries * Update text about EP pluggability * Update CUDA/cuDNN versions * Add link to reduce operator kernel page * Update roadmap * Add preview for migraphx * Move Rockchip under IoT/Edge * Update text to include ORT for Mobile doc link	2020-09-30 09:53:33 -07:00
Tim Harris	69dbaaa015	Add additional test cases to check for leaks in thread pool creation / destruction (#5311 ) Add additional test cases such as ThreadPoolTest.TestPoolCreation_10Iter to create and destroy thread pools to watch for any memory leaks. Running under Valgrind, these tests should show all of the data allocated being deallocated again. Two recent issues #5176 and #5292 indicated memory leaks. The test cases help identify whether or not any of the data structures used in the thread pool are being leaked. Currently, on WSL, the only data not being de-allocated in these tests are a small number of nsync waiter objects. This behavior is as expected (the waiter objects should be held on a free list in the nsync library).	2020-09-30 11:26:02 +01:00
Ye Wang	1a12f510fc	Support T5 benchmarking in transformers tool (#5133 ) * init checkin * review comments * modify according to transformers release	2020-09-29 22:58:28 -07:00
Sherlock	9ec1ed42a8	Enable BiasDropoutFusion for CUDA EP only (#5324 ) Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2020-09-29 14:00:15 -07:00
Wenbing Li	ed102e9d88	Add iOS test pipeline and a sample app. (#5298 ) * Add iOS test pipeline and a sample app. * clean up the unused code. * clean up. * revert the unknown change * disable the shared library for iOS. * add open source notice text. * ignore the skipped test. * extract the common ortenv setup	2020-09-29 13:53:11 -07:00
Tracy Sharpe	f07059ccc0	Add weight prepacking to LSTM kernel (#5305 )	2020-09-29 13:33:38 -07:00
Sherlock	11c194ce29	Minor fix for ComputeBroadcastBackwardAxesDynamic; Fix for GradientGraphBuilder logging (#5313 ) Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2020-09-29 09:49:05 -07:00
liqunfu	24d8b1bf42	to skip an unstable test to unblock release (#5314 ) Co-authored-by: liqun <liqun@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2020-09-28 22:30:11 -07:00
Hariharan Seshadri	cb83097632	Cosmetic change in non tensor tests (#5317 )	2020-09-28 21:23:30 -07:00
Scott McKay	1ff3b2d5b8	Add ability to generate multiple test dirs so that different input mixes can be tested. (#5310 )	2020-09-29 12:55:15 +10:00
Vincent Wang	eae2473dc1	Scale Op for ReduceMeanGrad. (#5191 ) * Scale Op for ReduceMeanGrad * fix Windows build error * resove PR comments. Co-authored-by: Vincent Wang <weicwang@microsoft.com>	2020-09-29 09:30:49 +08:00
Vincent Wang	506060dc37	Remove Useless Cast from Contiguous Cast Nodes (#5204 ) * remove useless cast * move the optimization to cast transformer * bugfix * resolve comments * fix comment Co-authored-by: Vincent Wang <weicwang@microsoft.com>	2020-09-29 09:18:52 +08:00
Changming Sun	d45d68fdd4	Fix a memory leak in our testing code (#5312 )	2020-09-28 16:00:57 -07:00
Scott McKay	3693f91218	Update doc to be explicit about backwards compatibility. (#5309 )	2020-09-29 07:34:49 +10:00
ytaous	b18a8bc74f	Transpose kernel fix for illegal memory access error (#5294 ) * transpose fix * minor update per comments Co-authored-by: Ethan Tao <ettao@microsoft.com>	2020-09-28 13:59:50 -07:00
Changming Sun	1a04b8f8b7	Add valgrind support to our cmake files (#5296 )	2020-09-28 09:31:08 -07:00
Guoyu Wang	fec890a09a	fix build break (#5306 )	2020-09-28 00:10:48 -07:00
RRRachelllll555	507f5bf5f6	Update test calibrate script (#5185 ) * update test_calibrate according to latest calibrate.py * fix datasize bug in e2e example Co-authored-by: t-yguo <t-yguo@microsoft.com>	2020-09-27 21:59:56 -07:00
Tang, Cheng	d9ecc0cebf	add bert loss legacy back (#5224 )	2020-09-27 13:41:16 -07:00
George Wu	16d35266ab	add install targets for ep shared libs (#5286 )	2020-09-25 07:10:43 -07:00
Guoyu Wang	3a3f26f38e	Move ort flatbuffers helper functions and value info r/w functions into separated lib (#5276 ) * Move fbs include from header to cc * add initial cmake for flatbuffers * Move most flatbuffers util to ort_flatbuffers * move code around * fix * move test/perf runner to use flatbuffer directly instead of model * minor update * Fix build break * Clean up includes and foward decl * Fix traning CI build breaks * Addressed PR comment, replaced some include with forward decls * Remove ORT_MUST_USE_RESULT temporarily	2020-09-25 05:36:29 -07:00
Changming Sun	17f1178c2e	Downgrade GCC (#5269 ) Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>	2020-09-24 21:14:54 -07:00
Sherlock	b03fb82ab7	Transformer layer-wise Recompute (#4526 ) * Build Recomputation Graph * Make topological sort to run FW nodes first * Pattern match start and end of transformer layer * Topological sort with Priority * Add logger to Gradient Graph Builder * Use Logger * Introduce Execution Order	2020-09-24 19:56:32 -07:00
Faith Xu	b6e71200eb	Add additional tutorial links (#5272 )	2020-09-24 17:27:58 -07:00
Dmitri Smirnov	89742411ec	Insert telemetry template into GPU build, add telemry build switches. (#5278 )	2020-09-24 17:13:09 -07:00
Thiago Crepaldi	ebeeff22dd	Update PyTorch TransformerModel sample (#5275 )	2020-09-24 16:28:07 -07:00
Ryan Lai	71b52ad5de	Fix inbox telemetry (#5265 ) * ifdef to check if redist or not * Fix redist telemetry Co-authored-by: Ryan Lai <ryalai96@gamil.com>	2020-09-24 14:58:07 -07:00
Scott McKay	b49ff6151e	Workaround issue with VS2017 compiler. (#5279 ) The definitions for some Eigen classes don't get pulled in leading to errors. Split out the broadcast function creation logic from the functions using std::enable_if to workaround that.	2020-09-25 06:50:14 +10:00
KeDengMS	5a71819be6	Symbolic shape inference: fix a case for concat (#5277 ) * Symbolic shape inference: fix a case when concat requires merge multiple dims * Fix a bug triggered in newer version of sympy Fix a bug in output data type guessing	2020-09-24 08:16:47 -07:00
Josh Bradley	4ed31ca214	Combine custom logger global threadpools (#4857 ) * add custom logger and global threadpools to C and C++ API * code cleanup and formatting * reformat code * tidy up some more code formatting * remove comment * fix API break from merging from master * renamed API function to CreateEnvWithCustomLoggerAndGlobalThreadPools * rename log variable and apply clang-format	2020-09-24 00:50:26 -07:00
Dwayne Robinson	6ad39819c2	Update DirectML Nuget to 1.3.0 (#5274 ) Update to 1.3.0	2020-09-23 22:53:02 -07:00
Dwayne Robinson	a4cb00b91e	Merge pull request #5273 from microsoft/user/dwayner/CmakeLinkerOptFlags Linker opt flags - fix conflicting CMake linker flags which contradict those needed by the Windows inbox universal CRT	2020-09-23 20:08:21 -07:00
edgchen1	6d5b93b805	Synchronize training dependency versions between Docker image and Python wheel. (#5261 ) Synchronize training dependency versions between Docker image and wheel, update docs, refactor build scripts.	2020-09-23 19:03:42 -07:00
Justin Stoecker	56862f4022	Add way to disable additional linker opt flags	2020-09-23 12:56:40 -07:00
Ashwini Khade	16220f3848	Add FusedMatMul contrib op (#5213 ) * bug fix transformer * fuse cpu kernel for transposescalematmul and matmul * fuse transpose_scale_matmul cpu kernel with matmul * fix test * Add FusedMatMul Contrib Op * fix test * fix typo * plus more updates per review	2020-09-23 12:17:50 -07:00

1 2 3 4 5 ...

3484 commits