onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-03 03:58:54 +00:00

Author	SHA1	Message	Date
edgchen1	2cb8cb816f	Disable or update flaky tests, improve test random seed accessibility. (#3495 ) - Add output of test random seed - Allow setting of test random seed with environment variable - Disable / relax tolerance for flaky tests	2020-04-17 15:57:32 -07:00
edgchen1	de543c0308	Add SafeInt include to WinML targets (#3558 ) Fixing Windows builds on the ort_training branch in preparation for the merge to master. SafeInt (included via onnxruntime/core/common/safeint.h) was recently made a dependency of onnxruntime/core/framework/bfc_arena.h. That requires consumers of bfc_arena to compile with the SafeInt include directory.	2020-04-17 09:54:01 -07:00
manashgoswami	9fc2b6482b	Ort training README (#3404 ) Added README for ORT Training	2020-04-16 14:51:33 -07:00
M. Zeeshan Siddiqui	6c1ccb659f	SoftmaxCrossEntropyLoss-12 forward and backward kernel implementation. (#3465 ) * Update ONNX submodule commit to the latest. * build break. * SoftmaxCrossEntropyLoss: Forward and backward kernel implementation. * Revert "build break." This reverts commit 847cb50d294efbe6c09fa760e7cacf25bfb6146d. * Add more tests and misc clean up. * revert unintended changes. * PR feedback. * cleanup. * PR feedback.	2020-04-16 12:27:07 -07:00
edgchen1	0ec90f7019	Put safeint_interface include directory into onnxruntime_common interface include directories to simplify usage by other targets. (#3546 )	2020-04-16 10:34:32 -07:00
edgchen1	2f16172e69	Address PR comments and clean up. (#3536 ) Address PR comments and clean up. - https://github.com/microsoft/onnxruntime/pull/3174#discussion_r408549886 - https://github.com/microsoft/onnxruntime/pull/3174#discussion_r408551151	2020-04-15 15:51:52 -07:00
Jesse Benson	2536e80602	Rename API to onnxruntime.set_seed(<seed>)	2020-04-15 09:44:48 -07:00
Jesse Benson	644bc05830	Add Python API to set random seed: onnxruntime.seed(<seed>)	2020-04-15 09:44:48 -07:00
pengwa	2c7c45076b	MaxBatchSize E2E Test (#3454 ) * max batch size e2e test *update test data snapshot	2020-04-15 09:50:44 +08:00
edgchen1	4fa88a0a23	Remove cast to OpKernelContextInternal to get threadpool and directly use OpKernelContext. (#3523 )	2020-04-14 14:30:26 -07:00
Tixxx	06b63975c0	Fix fp16 type mismatch when graph output is an fp32-only node (#3411 ) * verify output node before changing its type in mixed precision mode	2020-04-14 09:35:19 -07:00
edgchen1	ba7225f986	Update Graph SetInputs and SetOutputs for training (#3446 ) Fix training modification of Graph SetInputs() and SetOutputs(). Originally there were distinct code paths in Graph based on whether the graph was loaded from a GraphProto or created from scratch. The training modifications made that distinction a bit ambiguous - i.e., even though the Graph is loaded from a GraphProto for training, sometimes we rely on the other code path, e.g., to deduce the graph inputs after modifying it. Consequently, there was some odd behavior when using SetInputs(). For correctness, this change separates the cases where the graph is loaded from a GraphProto and where it is created from scratch.	2020-04-13 19:10:44 -07:00
M. Zeeshan Siddiqui	5d99f179b9	Merge pull request #3486 from microsoft/sedymche/merge_master_ort_training Merge from master into ort_training	2020-04-13 10:55:36 -07:00
Sergii Dymchenko	bf3df41424	Put back SubmoduleCheckoutMode parameter into mac-ci.yml.	2020-04-12 21:49:38 -07:00
Sergii Dymchenko	b670cdc11d	Remove usage of DeviceProp (which is removed in ort_training) from cudnn_common.cc.	2020-04-12 21:40:54 -07:00
Sergii Dymchenko	7b2fc196fe	Get cudnn_common.cc from master.	2020-04-12 21:10:36 -07:00
Sergii Dymchenko	571a6d52f5	Add to list of failing backend tests from master.	2020-04-12 19:16:53 -07:00
edgchen1	cffdff6702	Publish unit test results from Linux and Mac builds (#3480 ) * Added publish test results step to Linux and Mac builds. * Fix test result file pattern.	2020-04-10 14:51:56 -07:00
Tixxx	f5ba9c922d	fix internal loss scale (#3483 ) * Changed internal loss scale to 1-D * added test Co-authored-by: root <root@525204a066204ea794f942530b05ae7f000000.axlncovkyjne5caro2tmz3zryb.xx.internal.cloudapp.net>	2020-04-10 14:13:48 -07:00
Sergii Dymchenko	4b5f66ad17	Put dropout_default, dropout_random, celu back in the list of broken tests.	2020-04-10 13:27:00 -07:00
edgchen1	20c7dd9f5c	Remove orttraining/docker directory. (#3476 ) The docker images are not publicly available yet. Addressing PR comment: https://github.com/microsoft/onnxruntime/pull/3174#discussion_r390761308	2020-04-10 09:41:22 -07:00
Vincent Wang	03996c7c08	Fixes for Where, ConcatGrad and ReduceSumGrad (#3415 ) * Fixes for Expand, Where, ConcatGrad ReduceSumGrad. * Roll back expand, fix, add tests for reduce grad. * Roll back CPU Expand change. * Fix after merge. Co-authored-by: Vincent Wang <weicwang@microsoft.com>	2020-04-10 19:35:32 +08:00
Sergii Dymchenko	bb2f427990	Remove FastGelu from activations.	2020-04-09 19:17:54 -07:00
Sergii Dymchenko	507d2bb9b9	Get onnxruntime/core/providers/cuda/math/matmul_integer.cc from ort_training.	2020-04-09 18:09:46 -07:00
Sergii Dymchenko	1b465ba820	Get onnxruntime/core/providers/cuda/cu from ort_training.	2020-04-09 18:03:17 -07:00
Sergii Dymchenko	c5176087bf	Get onnxruntime/contrib_ops/cuda/bert/fast_gelu.cc from ort_training.	2020-04-09 17:55:52 -07:00
Sergii Dymchenko	6bbc80951d	Get onnxruntime/core/providers/cuda/tensor/slice.h from ort_training.	2020-04-09 17:03:58 -07:00
Sergii Dymchenko	0e4080f1d6	Get cuda_common.h from master.	2020-04-09 16:56:52 -07:00
Sergii Dymchenko	84773c61c6	Rename ONNX OPTIONAL to OPTIONAL_VALUE.	2020-04-09 16:22:30 -07:00
liqunfu	e7297e6c9d	create pipeline for ci frontend tests (#3422 ) create pipeline for nightly python front-end e2e tests	2020-04-09 15:31:22 -07:00
Sergii Dymchenko	eaa3f652df	Fix dynamicslice.cc after merge.	2020-04-09 15:17:21 -07:00
Sergii Dymchenko	8ea0e596ec	Fix onnxruntime_unittests.cmake after merge.	2020-04-09 13:14:15 -07:00
Sergii Dymchenko	6ba7c99e50	Merge branch 'master' into ort_training	2020-04-09 12:42:04 -07:00
ytaous	a08f16471a	Address comments around bfc arena (#3460 ) * rename setting * todo comments * fix build Co-authored-by: Ethan Tao <ettao@microsoft.com>	2020-04-08 19:35:32 -07:00
liqunfu	a298556600	raid rtol to unblock CI (#3457 ) raise rtol to avoid expected CI test failure in onnxruntime_test_ort_trainer.py	2020-04-08 17:17:44 -07:00
ytaous	f73008483a	safeint for region bytes in bfc arena and code clean up (#3447 ) * PR comments * remove build issue workaround * SafeInt for region bytes * fix build * fix build Co-authored-by: Ethan Tao <ettao@microsoft.com>	2020-04-08 13:54:42 -07:00
liqunfu	1ddfe1249b	frontend test to use random seed (#3209 ) frontend test to use random seed	2020-04-08 10:03:07 -07:00
Yufeng Li	4d71958ccf	Use IMMA for int8 matmul to leverage Turing Tensor Core (#3413 ) Use IMMA for int8 matmul to leverage Turing Tensor Core Format files under onnxruntime/core/providers/cude	2020-04-07 15:22:04 -07:00
Tracy Sharpe	de60a14c16	Fix output range for int8_t QuantizeLinear op (#3445 )	2020-04-07 15:01:20 -07:00
Yulong Wang	aabf47b107	Fix Split CUDA implementation for zero sized input (#2942 ) * Fix Split CUDA implementation for zero sized input * resolve comments * add case * test case update: split into 2 tensors	2020-04-07 14:44:20 -07:00
Scott McKay	48e96ea65f	Reduce binary size of Slice implementation (#3238 ) * Make the Slice implementation based on type sizes and reduce templatized code to a minimum. * Remove using 'dynamic' as a template param to Slice as well.	2020-04-08 07:19:29 +10:00
ytaous	b35468289a	View Op - new unit tests and add support for tensor memcpy by offset/size (#3439 ) * view ops UTs * update per comments * PR comments - code clean up * code clean up per comments Co-authored-by: Ethan Tao <ettao@microsoft.com>	2020-04-07 13:07:11 -07:00
Dmitri Smirnov	53b9d52fc6	Rework TensorToTensorProto. Do not put string data to raw_string. Eliminate redundant argument. (#3438 ) Rework TensorToTensorProto. Eliminate redundant argument. Do not put string data into raw_data.	2020-04-07 11:42:10 -07:00
Andrews548	43d6c464fc	Fix ACL EP pooling build breakage (#3429 ) The commit `06fc9506fd` which refactored cpu Pool class broke ACL EP build. Also worked on the commit `a4fe60c4d3` as it also affects the new class. Move the declaration of the new MaxPoolV8 cpu class in the header file. Implement MaxPool 8-11 in ACL EP. Co-authored-by: Andrei-Alexandru <andrei-alexandru.avram@nxp.com>	2020-04-07 07:03:52 -07:00
Tianlei Wu	4bdb5cc8e2	Add CPU implementation for FastGelu operator (#3398 ) * Add CPU implementation for FastGelu operator * Update optimization script to fuse Gelu or FastGelu according to Elf or Tanh is used in graph. * Merge BiasGelu and FastGelu into one class * Enable FastGelu Fusion optimizer for CPU Execution Provider.	2020-04-07 00:19:30 -07:00
Changming Sun	9e65298d7a	Re-enable tests (#3437 ) Re-enable some tests that was recently fixed.	2020-04-06 20:13:34 -07:00
Thiago Crepaldi	15e32b44fd	Merge pull request #3383 Merge from master into ort_training	2020-04-06 19:05:01 -07:00
Tianlei Wu	8ab09186b7	Bert Optimization Script Improvements (#3387 ) Add opt_level option for graph optimization level in bert perf test. Support BERT models that output each layer, where SkipLayerNormalization has more than 4 children. Check weight and bias are 1D for layer norm fusion. Add a dummy class Gpt2OnnxModel for further changes of GPT2 model.	2020-04-06 16:55:40 -07:00
Edward Chen	95707d22a5	Disable gradient clipping for E2E test.	2020-04-06 23:07:28 +00:00
Dmitri Smirnov	c8f5e6e632	Implement Min/Max/Clip(12) (#3410 ) Implement Max/Min for opset 12. Add CLip(12) CPU impl. Implement Clip(12) for CPU and CUDA add tests	2020-04-06 14:24:59 -07:00

1 2 3 4 5 ...

2136 commits