onnxruntime/onnxruntime/core/codegen
KeDengMS 60208463a9
[NupharEP] Enable parallel schedule (#2505)
* [NupharEP] Enable parallel schedule
* Update TVM with the fix to TVM threadpool to use OpenMP if possible
* Add parallel schedule when trying to vectorize
With this change, BERT squad perf on a 4-core (8 HT) CPU goes from 187ms to 150ms

* Address CR, docs and cmake update

* Doc fix

* Fix mkl

* Fix TVM windows build when using mklml
2019-11-28 08:35:56 -08:00
..
common Correctly handle implicit inputs for fused nodes (#2390) 2019-11-21 10:27:09 -08:00
mti [NupharEP] Multiple optimizations (#2380) 2019-11-14 10:40:33 -08:00
passes [NupharEP] Enable parallel schedule (#2505) 2019-11-28 08:35:56 -08:00