onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-01 03:45:06 +00:00

Author	SHA1	Message	Date
Sherlock	8a450d523f	Check gradient correctness in the UTs (#6803 ) * Check gradient correctness in the UTs Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2021-02-25 13:31:07 -08:00
baijumeswani	fa8a9015bd	Mount hf model cache and use cache for loading hf models (#6810 )	2021-02-25 13:30:14 -08:00
Sergii Dymchenko	99ffffbe6a	Remove backward workaround from test. (#6811 )	2021-02-25 13:23:46 -08:00
ashbhandare	b05403d877	Clear iobinding outputs (#6774 )	2021-02-25 11:50:43 -08:00
Sherlock	8e200e13fe	Rewrite ORTModule background task coordination (#6700 ) * Introduce OrtTasks to replace EventPool * return run_id to frontend * pass run_id to backward * OrtTasks support multiple bg_events * make message_queue a member of orttask * Replace MessageQueue with std::promise * Move status_promise into Task * Move terminate flag into Task * Reenable previously disabled UTs * Add unit tests * Replace condition variables with std::promise * Move to CreateBackgroundTask in the main thread * return status and output in forward_future * use throw for terminating background thread * cleanup tasks at destructor * reenable test_mixed_nnmodule_ortmodules_training * add mutex for ORTTasks functions * add mutex for bg_threads * delay tests before start * add ut for multi-task common backbone Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2021-02-24 18:00:25 -08:00
baijumeswani	7ce4075bbd	Support nested sequence and mapping types in ORTModule (#6791 )	2021-02-24 15:45:56 -08:00
Thiago Crepaldi	aa5cd37ac8	Refactor device handling and basic support for PyTorch Lightning (#6758 )	2021-02-24 14:12:55 -08:00
baijumeswani	65ba51d93e	Re-enable test and increase timeout (#6785 )	2021-02-23 18:51:06 -08:00
Thiago Crepaldi	563218dcda	Update torchtext usage for pytorch transformer sample (#6767 ) * Update torchtext usage for pytorch transformer sample * Temporarily disable tests to unblock repo (failures are being worked on already) * Update loss numbers for ORTTrainer UTs	2021-02-23 14:06:35 -08:00
Sergii Dymchenko	58f3aca95d	Support keyword arguments for ORTModule (#6539 ) * Support keyword arguments for ORTModule. * Add backward workaround to the test. * Specify test name directly without -k. * Handle unused inputs removed by ONNX exporter.	2021-02-19 13:40:44 -08:00
M. Zeeshan Siddiqui	1a2f1bd23a	Enable external CUDA allocator in ORTModule. (#6745 ) * Enable external CUDA allocator in ORTModule. * Fix assert after unification of allocators. * Update no grad memory test. * update comments. * fix provider options array when not sharing allocator.	2021-02-18 20:01:13 -08:00
ytaous	39d182f7fc	ORTModule - FastGeluFusion/fp16 fix and minor LayerNormFusion cleanup (#6734 ) * fastgelu fix * assert cast outputs Co-authored-by: Ethan Tao <ettao@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2021-02-18 13:17:16 -08:00
Thiago Crepaldi	fb3f1f5cc1	Enable custom ops on ORTModule (#6740 )	2021-02-18 09:08:10 -08:00
Sherlock	b7b5612159	Merge pull request #6742 from microsoft/mzs/sync-from-master Sync from master	2021-02-18 00:12:38 -08:00
M. Zeeshan Siddiqui	40dda452cf	Merge branch 'master' of https://github.com/microsoft/onnxruntime into mzs/sync-from-master	2021-02-18 03:03:01 +00:00
M. Zeeshan Siddiqui	e44ac6524f	Plug n Allocate with external CUDA allocator via PyBind. (#6679 )	2021-02-17 18:59:38 -08:00
liqunfu	dd8ef4409a	Liqun/migrate perf test (#6733 ) move ort training perf tests to azure devops	2021-02-17 17:48:47 -08:00
liqunfu	2c5e603bad	Liqun/nuphar nuget (#6656 ) create nuphar nuget with correct name	2021-02-17 16:13:07 -08:00
Thiago Crepaldi	21f9e32c60	Merge pull request #6714 from microsoft/thiagofc/merge-from-master Merge master into thiagofc/ortmodule-api	2021-02-17 16:08:53 -08:00
Ramakrishnan Sivakumar	a5bef6886b	Threading support for Hybrid core architecture (#6728 )	2021-02-17 15:35:07 -08:00
Guoyu Wang	6810d98ea3	Update links to gh-pages for ORT minimal documents (#6721 ) * Fix broken link in ort minimal docs * Update link of build.md to gh-pages	2021-02-17 14:34:50 -08:00
Justin Stoecker	af4e5c0c6e	Minor WinML model test skip name change	2021-02-17 14:27:58 -08:00
Maajid khan	b41e9b5d4c	[OpenVINO-EP] Fixes OpenVINO-EP build on windows (#6726 ) * Fixes OpenVINO-EP windows build Openvino EP build is broken on windows. The issue is wchar_t is UTF-16 on windows while on other platforms such as Linux and MacOS, wchar_t is UTF-32. so wide Unicode string has to be converted to an UTF8 string for sure on windows. This commit fixes this issue.	2021-02-17 13:49:03 -08:00
Thiago Crepaldi	9d4b730e46	Fix merge leftover	2021-02-17 11:58:06 -08:00
M. Zeeshan Siddiqui	9853ef84f8	Reduce binary size, limit asynchronous/backgroud thread stuff to training only.	2021-02-17 11:51:09 -08:00
M. Zeeshan Siddiqui	5b7e7aaa45	Move event_pool and message_queue to core.	2021-02-17 11:50:56 -08:00
M. Zeeshan Siddiqui	eecce31a8b	Fix build, cleanup.	2021-02-17 11:50:41 -08:00
Thiago Crepaldi	3184c47ad1	Merge branch 'master' into thiagofc/merge-from-master	2021-02-17 11:49:52 -08:00
Yulong Wang	9a9202a218	[Node.js binding] update dependency typedoc (#6720 )	2021-02-17 10:22:05 -08:00
Changming Sun	0be5475de6	Update packaging pipelines(#6664 )	2021-02-17 09:53:36 -08:00
Changming Sun	46c06f6ac7	Change Windows GPU CI pipeline to CUDA11 (#6616 )	2021-02-17 09:44:44 -08:00
Changming Sun	eefeacd828	Skip running gpt2 model in C# x86 (#6722 )	2021-02-17 09:37:16 -08:00
Derek Murray	b8d5fa812c	Fix typo in README.md (#6713 ) Fixes #6710.	2021-02-17 09:29:30 -08:00
Wei-Sheng Chin	9e67b88c83	Use local rank as GPU ID (#6719 )	2021-02-17 22:42:54 +08:00
RandySheriffH	9043df8b66	Deprecate OMP from nuget pipeline (release:1.7) (#6671 ) * deprecate OMP from nuget * remove omp build * remove * add openmp build * add variants * rename package * move GPU to no omp pipeline * reset path * switch to abs path * reset path * add cpu package * remove obsolete name * set package name Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2021-02-17 00:03:44 -08:00
Suffian Khan	105883f4b8	remove longformer_global_impl.cu from hipify (#6716 )	2021-02-16 22:26:18 -08:00
Hariharan Seshadri	aa2622efb2	Support multiple dynamic inputs in custom ops (#6666 )	2021-02-16 20:54:30 -08:00
baijumeswani	01dfa8e125	Support non tuple return values from torch.nn.module (#6660 ) * Support dictionary, namedtuples and huffingface ModelOutput type for model return values	2021-02-16 20:48:32 -08:00
Scott McKay	02c7873b0e	Update ORT model conversion script to support custom ops (#6701 ) * Add support for custom ops library to the ORT model conversion script Simplify model conversion now that we read ops from the ORT format model. Enable custom ops in the python bindings if custom ops are turned on in a minimal build. * Add test of model conversion involving custom ops.	2021-02-17 12:52:39 +10:00
Thiago Crepaldi	7f33671ade	Handle multiple devices scenarios (#6672 ) * Handle multiple devices scenarios	2021-02-16 18:22:30 -08:00
Thiago Crepaldi	7ee5baa60d	Remove monkey patch for PyTorch Nightly + ORTTrainer (#6659 )	2021-02-16 17:24:50 -08:00
Tianlei Wu	9b446d5f7e	Longformer Attention CUDA kernel memory Improvements (#6646 ) * Integrate memory improvements from NVidia * compute max_global_num before buffer allocation * update conversion script to support transformers 4.0 * update benchmark script for creating dummy inputs for different batch_size * Use a wrapper of cuda event to avoid memory leak	2021-02-16 14:54:48 -08:00
Edward Chen	b09a370218	Address warning in data_types_internal.h (#6704 ) Address "unreachable code" warning in data_types_internal.h.	2021-02-16 12:41:48 -08:00
RandySheriffH	c36ee4bd40	Rename Python packaging pipelines (#6682 ) * rename pipelines * resync and rename * resync master * rename package id * remove OrtPackageId which is for nuget Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2021-02-16 11:43:03 -08:00
RandySheriffH	497eef8d3d	remove omp (#6675 ) Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2021-02-16 11:42:32 -08:00
Changming Sun	d48a4c0a54	Add CG step to nuget GPU pipeline (#6678 )	2021-02-16 09:46:20 -08:00
Changming Sun	9a01174037	Disable some unit tests for training (#6699 )	2021-02-16 09:45:59 -08:00
Scott McKay	33279250b5	Update a couple of usages of args.minimal_build to check for not specified vs empty list correctly. (#6688 )	2021-02-16 14:46:51 +10:00
Tracy Sharpe	37b83acd76	MLAS: add uint8_t NHWC max pooling (#6684 ) Add support to transform graphs containing uint8 MaxPool to a custom operator that supports NHWC format that can be more easily vectorized.	2021-02-15 10:05:29 -08:00
Edward Chen	a35b30e237	Change BuildKernelDefConstraintsFunctorFromTypeList struct to BuildKernelDefConstraintsFromTypeList function. (#6674 )	2021-02-15 09:16:07 +10:00

1 2 3 4 5 ...

4331 commits