onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-04 04:07:22 +00:00

Author	SHA1	Message	Date
Thiago Crepaldi	335edaa2c4	Merge pull request #6973 from microsoft/thiagofc/merge-ortmodule-into-master Introduce ORTModule training API to ONNX Runtime	2021-03-17 10:30:06 -07:00
Chen Fu	03885af5a0	Adding prepacking to QLinearMatMul (#6980 ) Reuse the same prepacking logic in mat mul integer, to enable prepacking weight for QLinearMatMul. Currently only prepacking 2D matrix weights	2021-03-17 09:28:24 -07:00
Tracy Sharpe	90642e7eac	MLAS: more code cleanup (#7036 ) Change int32_t->ptrdiff_t when interacting with the threadpool. Migrate more code from MlasMaskMoveAvx->MlasMaskMoveTableAvx. Update more code to use FUNCTION_ENTRY macro.	2021-03-17 09:22:55 -07:00
jeyblu	8e0970a020	dnnl format tag fix (#6943 )	2021-03-17 00:24:12 -07:00
Guoyu Wang	0f9383e583	[NNAPI EP] Add support of QlinearAveragePool (#6915 ) * [NNAPI EP] Add support of QlinearAveragePool * Merge master and modify UT and fix some code issues after running UT	2021-03-17 00:08:54 -07:00
Ryan Hill	a0fdabd23f	Rename all of the ONNX_NAMESPACE types for shared providers to be back in the ONNX_NAMESPACE with their original names. (#7034 )	2021-03-16 21:18:50 -07:00
Tianlei Wu	73d085ccdd	add slow test (#7035 )	2021-03-16 20:49:51 -07:00
Thiago Crepaldi	3348b8485f	Post merge update for ORTModule Changes include: * Revert Event Pool changes * Add copyright and revert unrelated changes * Add DLPack as submodule and remove to_dlpack and from_dlpack from public API * Update golden numbers for DHP Parallel tests * Update ORTTrainer unit test numbers * Rollback to DLPack v0.3 * Disable flaky test * Update third party notices and CG manifest file * Minor refactoring of ORTValue API	2021-03-16 20:11:59 -07:00
Changming Sun	ed2d441a2e	Update ORT server build pipeline (#7030 ) 1. Migrated it to Ed's new docker build script 2. Use python 3.6 instead, because it is the default one in ubuntu 18.04 3. Move the "pip install" command to the docker image build stage(instead of when running the image)	2021-03-16 18:02:09 -07:00
stevenlix	2e38bf5e23	add TensorRT configuration to OrtProviderOptions (#6979 ) * add TensorRT configurations in provider options * Update ort_test_session.cc * Update tensorrt_execution_provider.cc * Update onnxruntime_pybind_state.cc * Update main.cc	2021-03-16 17:16:28 -07:00
Ori Levari	783acb144f	Ignored return value SDL bug fix (#6451 )	2021-03-16 15:00:08 -07:00
Changming Sun	2361cb99b6	Remove CentOS CI pipeline (#6997 )	2021-03-16 10:55:03 -07:00
Tiago Koji Castro Shibata	975e4efb8a	Package ARM artifacts (#6805 )	2021-03-16 10:12:48 -07:00
Hariharan Seshadri	3f0e50f14d	Cleanup in RoiAlign (#7012 )	2021-03-16 07:21:57 -07:00
Jinsong Ji	087d96200d	HIP_CLANG_FLAGS replaces HIP_HCC_FLAGS for ROCm later than 4.0 (#6955 ) * HIP_CLANG_FLAGS replaces HIP_HCC_FLAGS for ROCm later than 4.0 HIP_HCC_FLAGS was deprecated in ROCm4.x	2021-03-15 23:28:00 -07:00
Tracy Sharpe	5480f8dd1d	MLAS: misc cleanup (#7013 ) Miscellaneous changes to synchronize the style used over time: Remove unneeded PFN types in favor of FN*. Switch more functions over to using the common FUNCTION_ENTRY macro. Switch logistic/tanh kernels over to the style used in TransKernelFma3.asm.	2021-03-15 18:24:18 -07:00
Ye Wang	4e670f7ab1	Support larger hidden size in Attention Cuda kernel (#7002 ) * Support larger hidden size in Attention Cuda kernel * Update attention_transpose.cu * review comments * fix typo and add check in quantization * update readme	2021-03-15 15:46:10 -07:00
Hariharan Seshadri	27ac88201a	Support a CPU kernel for Celu (#6995 )	2021-03-14 20:37:40 -07:00
Nat Kershaw (MSFT)	d0cca35308	Add README for docs (#6626 ) * Add README for docs * Add section on contributing to docs to CONTRIBUTING.md	2021-03-12 15:14:40 -08:00
Edward Chen	e5e922ec1e	Fix some warning option override warnings from dependencies. (#6983 )	2021-03-12 11:37:15 -08:00
sfatimar	4c9ccb0f1a	[OpenVino] getcapability design (#6863 ) * get capability design refactor Co-authored-by: sfatimar <sahar.fatima@intel/com> Co-authored-by: MaajidKhan <n.maajidkhan@gmail.com>	2021-03-12 11:18:33 -08:00
Changming Sun	4161758058	Remove openmp related packaging pipeline (#6991 ) 1. Remove openmp related packaging pipelines and build jobs. 2. Set continueOnError to true for the TSAUpload tasks. Their service is unstable recently. 3. Update Ubuntu 16 docker images to Ubuntu 18, in prepare for getting C++17 support 4. Cherry-pick the changes in 1.7.1 to the master: updating CFLAGS/CXXFLAGS to strip out debug symbols	2021-03-12 10:02:59 -08:00
Shucai Xiao	c588d5d13a	Add rocm execution provider to prover_list (#6306 ) * code changes to add rocm ep to ep_list	2021-03-12 07:51:08 -08:00
Alberto Magni	031587814b	Add support to save onnx graph with external initializers file. (#6911 ) Add functionality to the Graph class to be dumped to protobuf using an external binary file for the float initializers. This change is meant to avoid hitting the 2GB protobuf limit when dumping large graphs. This limit was particularly easy to exceed when dumping graphs after auto-diff. The use of the external file is limited to initializers larger than a user-specified threshold. This gives the possibility to users to include in the onnx file shape constants used by Reshape and Transpose used by Shape Inference.	2021-03-12 09:15:25 +00:00
Hariharan Seshadri	12b5ab3bab	Update CUDA custom op unit tests to account for recent ORT change (#6971 )	2021-03-11 22:22:45 -08:00
Xavier Dupré	694389a85d	Automate generation of python documentation (#6909 ) Co-authored-by: xavier dupré <xavier.dupre@gmail.com>	2021-03-11 19:02:45 -08:00
baijumeswani	f7df2f805b	Resolve HTTP Error 503: Service Unavailable for MNIST dataset	2021-03-11 13:53:54 -08:00
Edward Chen	aa60a8368f	Update type reduction operator type usage processors set. (#6976 )	2021-03-11 09:22:53 -08:00
Ye Wang	b57a85d863	Support symbolic shape infer in transformers tool (#6899 ) * fusion support runtime edge shape checking * trim ctor * add test * fix * Update test_shape_infer_helper.py * use torch input size as dynamic axis hints * check dir * update * support longformerattention * update and add support for bert ops * trim * review comments * review comments	2021-03-10 21:37:12 -08:00
Edward Chen	f4796e1953	Enable type reduction for Range, ReverseSequence, ScatterND, Split, and Unique CPU kernels. (#6963 )	2021-03-10 16:20:25 -08:00
Chen Fu	4a4488baae	Release buffers for prepacked tensors (#6820 ) Unsolved problems: 1. One test failure was caused by a bug in Cudnn rnn kernels, when they can allocate a buffer and partially initialize it, the garbage data near tail of the buffer caused problem in some of the hardware. To attack this problem in a broader sense, should we add code in our allocators, and during a memory fuzzing test, fill an allocated buffer with garbage before returning to the caller? 2. Prepacking is used more widely than we know. For instance, Cudnn rnn kernels also cache their weights. They mix several weight tensors together into a single buffer, and never touch the original weight tensor anymore. This is the same idea with pre-pack, but they didn't override the virtual function, and they never tried to release those weight tensors, leading to memory waste. It also seems to me that there are some other kernels have similar behavior. Wonder how much memory we can save if we try to cleanup those too. 3. Turning off memory pattern planning does increase memory fragmentation, leading to out of memory error in some training test cases. Perhaps we can revisit the idea of pushing kernels-creation stage earlier, and then during initializer deserialization, we only avoid tracing those that will be prepacked.	2021-03-10 14:07:20 -08:00
Guoyu Wang	2f307dd223	Fix possible fd leak in NNAPI (#6966 )	2021-03-10 11:20:08 -08:00
Thiago Crepaldi	89d450697b	Introduce ORTModule training API to ONNX Runtime	2021-03-10 10:48:10 -08:00
Ori Levari	9f84819f32	Update onnxruntime_perf_test.exe to accept free dimension overrides (#6962 ) Co-authored-by: Ori Levari <orlevari@microsoft.com>	2021-03-10 10:45:19 -08:00
David Medine	f723ff2285	fixed type to experimental session constructor (#6950 ) * fixed type to experimental session constructor Co-authored-by: David Medine <david.medine@brainproducts.com>	2021-03-10 10:18:27 -08:00
Tianlei Wu	4884eee642	Attention fusion detect num_heads and hidden_size automatically (#6920 )	2021-03-10 10:17:00 -08:00
Sergii Dymchenko	ce403eea98	Add *args support for ORTModule inputs (#6883 )	2021-03-10 10:15:23 -08:00
Zhang Lei	acfe7ac4ce	Implement QLinearAveragePool with unit tests. (#6896 ) Implement QLinearAveragePool with unit tests.	2021-03-10 10:02:01 -08:00
Weixing Zhang	1e13e2666e	Support ROCM EP for ORTModule (#6967 ) 1. Disable external allocator for ROCM EP since it is not supported yet. 2. For AMD GPU, the EP name is ROCMExecutionProvider	2021-03-10 10:00:35 -08:00
Tracy Sharpe	a8b897f710	MLAS: quantized GEMM update (#6916 ) Various updates to the int8_t GEMMs: 1) Add ARM64 udot kernel to take advantage of dot product instructions available in newer cores. Some models run 4x faster than the stock implementation we used before. 2) Refactor the x64 kernels to share common code for AVX2(u8u8/u8s8/avxvnni) vs AVX512(u8u8/u8s8/avx512vnni) to reduce binary size. 3) Extend kernels to support per-column zero points for matrix B. This is not currently wired to an operator.	2021-03-10 09:54:43 -08:00
Edward Chen	bc319bd7aa	Fix warning from setting multiple MSVC warning level options. (#6917 ) Fix warning from setting multiple MSVC warning level options. Replace an existing /Wn flag instead of always appending a new one.	2021-03-10 09:27:54 -08:00
Vincent Wang	8468099f93	Use DLPack for Graph Inputs and External Outputs of YieldOp (#6968 )	2021-03-10 09:13:45 -08:00
Vincent Wang	3f579facbc	Relax atol for some ORTModule UTs (#6969 )	2021-03-10 08:59:56 -08:00
Edward Chen	d5ed3e7fba	Enable type reduction in EyeLike, Mod, random.cc CPU kernels. (#6960 ) * Update EyeLike CPU kernel. * Update Mod CPU kernel. * Update Multinomial CPU kernel. * Slight improvement to Pad CPU kernel binary size. * Update RandomNormal[Like], RandomUniform[Like] CPU kernels.	2021-03-10 15:32:56 +10:00
Tianlei Wu	89916fdb05	fix stream sync issue (#6954 )	2021-03-09 20:57:18 -08:00
Wei-Sheng Chin	bdaea1d9ae	Update baseline due to loss scale fix (#6948 )	2021-03-10 09:46:15 +08:00
Raduan Al-Shedivat	743a93faf3	Fix broken link in server usage and remove absolute path from dockerfiles readme (#6926 )	2021-03-09 11:54:21 -08:00
Weixing Zhang	534adbb065	Support ORTModule on ROCm EP (#6945 )	2021-03-09 10:10:57 -08:00
ytaous	3b2847b2d8	Add UT correctness and address comments for previous symbolic shape PR (#6930 ) * address comments * disable assert * testing relaxed tolerance * testing relaxed tolerance * testing relaxed tolerance * per comments * modify UT * remove imports * remove prints Co-authored-by: Ethan Tao <ettao@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2021-03-09 10:10:18 -08:00
George Nash	ba51774a1f	Add GPU support for DNNL endpoint (#6741 ) * Added code for Relugrad with GPU support. Signed-off-by: Chethan Palangotu Keshava <chethan.palangotu.keshava@intel.com> * Add GPU support for DNNL ConvGrad Signed-off-by: George Nash <george.nash@intel.com> * Add GPU support for DNNL MaxPoolGrad Updates to MaxPool for training with GPU Update oneDNN to version 1.8.1 Signed-off-by: George Nash <george.nash@intel.com> * Fixed issues found durring code review - error in code comment - using auto when the direct type would have been better - removed ternary operators that were returning bool values Signed-off-by: George Nash <george.nash@intel.com> Co-authored-by: Chethan Palangotu Keshava <chethan.palangotu.keshava@intel.com>	2021-03-09 09:40:42 -08:00

1 2 3 4 5 ...

4470 commits