onnxruntime/onnxruntime/test/shared_lib
Chi Lo c964da7ea2
FasterTransformer model wrapper using custom op (#15013)
### Description
<!-- Describe your changes. -->
We are introducing the FasterTransfomer model-level integration using
ORT [custom op runtime
wrapper](https://github.com/microsoft/onnxruntime/pull/13427).
In order to make the FT wrapper/integration work, two things need to be
done:

- New API `KernelInfoGetConstantInput_tensor`. (Done in this PR)
During custom op kernel initialization, it needs to get the model
weights (saved as node's constant inputs) ready for FT's weights
instantiation. What's why we need to add this new API to make kernel
info capable of getting constant inputs.

- Custom op and custom op kernel to wrap FT model. (Will provide in
onnxruntime extensions or inference examples)
During custom op kernel initialization, it can fetch attributes from
kernel info to determine which kind of FT model instance create. During
custom op kernel compute/inference, it can get input/output from kernel
context and then assign input/output buffers for model instance to run.
2023-03-20 09:05:30 -07:00
..
cuda_ops.cu
custom_op_utils.cc Add logging APIs for custom operators (#14416) 2023-03-17 15:05:28 -07:00
custom_op_utils.h Changes to support standalone custom ops in a minimal build. (#14497) 2023-03-01 11:22:54 +10:00
fns_candy_style_transfer.c
onnx_protobuf.h
test_allocator.cc
test_fixture.h
test_inference.cc FasterTransformer model wrapper using custom op (#15013) 2023-03-20 09:05:30 -07:00
test_io_types.cc Fix type mismatch when ORT_ENABLE_STREAM is off (#14324) 2023-01-18 13:45:00 +10:00
test_model_loading.cc Exclude affinity-setting logic from minimal build (#13967) 2022-12-15 14:43:42 -08:00
test_nontensor_types.cc
test_ort_format_models.cc Changes to support standalone custom ops in a minimal build. (#14497) 2023-03-01 11:22:54 +10:00
test_run_options.cc
test_session_options.cc [CUDA] Update fused MHA to support flash attention and causal mask (#13953) 2022-12-31 10:33:54 -08:00
utils.cc
utils.h