* remove memory copy between CUDA and TRT
* add info to RegisterExecutionProvider input
* use new IDeviceAllocator for trt allocator
* remove SetDefaultInputsMemoryType from TRT EP
* remove onnx-tensorrt 5.0
* add submodule onnx-tensorrt branch 5.1
* remove redundancy
* Update transformer_memcpy.cc
* Update tensorrt_execution_provider.cc
* switch to TensorRT 5.1.5.0
* update python binding
* disable failed test case on TensorRT
* Update activation_op_test.cc
* upgrade to TensorRT container 19.06
* update according to feedback
* add comments
* remove tensorrt allocator and use cuda(gpu) allocator
* update onnx-tensorrt submodule
* change ci build cuda directory name
* Update cuda for python wheels
* Update cuda for python wheels
* Update cuda for python wheels
* Update azure-pipelines-py-packaging.yml
* Update to cuda 10
* Only test win gpu
* Update cuda for python wheels
* Use manylinux2010 image to build linux python wheels
Allow wheels built to truly be compliant with a manylinux policy
* Finer control over when Python tests are run
* add --build_wheel to linux pipeline, instead of run_build.sh
* add --build_wheel to all ci configurations
* update per review comments
* Adding versioned dlls to tar/zip packages
* fix syntax error
* fix version name of dylib
* minor fix in the target
* update pattern for versioned dylib files
* add variables for version number and git commit hash
* fix typo
* fix typo
* some logging
* some logging
* some logging
* some logging
* some logging
* some logging
* some logging
* some logging
* some more edits to see generic scripts can print
* working
* fixing windows git hash
* try quoted echo
* fix git rev-parse
* echo without quotes
* removed commit hash from artifact filename, added long commit hash as a file inside
* added the missing commit id parameter
* fix windows pipeline
* keep only win 64, others disabled
* remove disabling conditions
* cross compile x86 linux
* fix comments
* install multilib for ubuntu cross compile
* remove tailing slash
* fix -fPIC relocations for x86 target too
* add asm make flag
* fix x86 compile err
* test x86 with zlib and png
* Disable zlib from x86
* install x86 python header
* remove cross-compiling changes
* test 32bit ubuntu
* add x86 ubuntu docker file
* add x86 as arch parametr for docker build
* config pipeline
* avoid dotnet install
* install cmake
* skip dep install
* use latest ubuntu
* install latest cmake
* install x86 deps
* configure cmake
* install ninja
* correct ninja dir
* apt get re2c
* install onnx
* set processor x86
* disable warning
* skip test
* disable test
* disable test
* find lib
* fix typo
* restore test
* disable backend model test
* disable test
* fix test err
* stop installing onnx
* disable onnx test on x86
* restore yml
* mergef with master yml
* cancel needless config setting
* enable x86 flag
* restore all onnx tests
* fix yml typo
* install onnx
* add back x86 flag
* disable cases
* disable case
* disable cases
* add macro to disable cases
* fix typo
* print platform
* remove condition
* use CUDA 9.1 for both linux and windows
* added powershell scripts for cuda props setup/cleanup
* fix yml syntax
* set path to cuda9.1 bin
* correct label
* ad --cuda_version
* added some log to browse the directory
* disabled jobs other than win gpu to save some resource while testing
* add msvc_toolset=14.11
* added more logs
* log the props file
* remove setting vcvarsall
* try some modificationi n build.py
* fix typo
* let the config Step modify envoronment
* set some more env vars manually
* try reordering vcvars after cuda props copying
* use single script for build and test
* single line script
* remove extra quote
* cleanup trial changes
* added linux packaging template and pipeline
* Update linux-packaging-pipeline.yml for Azure Pipelines
* fix path seperator
* update copy command for linux
* fixed linux gpu artifact name, added mac build
* fixed linux gpu artifact name, added mac build
* fixed vmImage syntax
* use 1 model at a time for macos
* added onnx test on Mac CI
* some refactor of the pipeline scripts
* try fixing the tensorproto for x86 build
* try __cdecl
* try C-style cast
* use ORTAPICALL
* put the deleter under the namespace