mirror of
https://github.com/saymrwulf/onnxruntime.git
synced 2026-05-16 21:00:14 +00:00
169 lines
5.3 KiB
Markdown
169 lines
5.3 KiB
Markdown
# C# Sample: ResNet50 v2
|
|
|
|
The sample walks through how to run a pretrained ResNet50 v2 ONNX model using the Onnx Runtime C# API.
|
|
|
|
The source code for this sample is available [here](Program.cs).
|
|
|
|
## Prerequisites
|
|
|
|
To run this sample, you'll need the following things:
|
|
|
|
1. Install [.NET Core 3.1](https://dotnet.microsoft.com/download/dotnet-core/3.1) or higher for you OS (Mac, Windows or Linux).
|
|
2. Download the [ResNet50 v2](https://github.com/onnx/models/blob/master/vision/classification/resnet/model/resnet50-v2-7.onnx) ONNX model to your local system.
|
|
3. Download [this picture of a dog](dog.jpeg) to test the model. You can also use any image you like.
|
|
|
|
## Getting Started
|
|
|
|
Now we have everything set up, we can start adding code to run the model on the image. We'll do this in the main method of the program for simplicity.
|
|
|
|
### Read paths
|
|
|
|
Firstly, let's read the path to the model and path to the image we want to test in through program arguments:
|
|
|
|
```cs
|
|
string modelFilePath = args[0];
|
|
string imageFilePath = args[1];
|
|
```
|
|
|
|
### Read image
|
|
|
|
Next, we will read the image in using the cross-platform image library [ImageSharp](https://www.nuget.org/packages/SixLabors.ImageSharp):
|
|
|
|
```cs
|
|
using Image<Rgb24> image = Image.Load<Rgb24>(imageFilePath, out IImageFormat format);
|
|
```
|
|
|
|
Note, we're specifically reading the `Rgb24` type so we can efficiently preprocess the image in a later step.
|
|
|
|
### Resize image
|
|
|
|
Next, we will resize the image to the appropriate size that the model is expecting; 224 pixels by 224 pixels:
|
|
|
|
```cs
|
|
using Stream imageStream = new MemoryStream();
|
|
image.Mutate(x =>
|
|
{
|
|
x.Resize(new ResizeOptions
|
|
{
|
|
Size = new Size(224, 224),
|
|
Mode = ResizeMode.Crop
|
|
});
|
|
});
|
|
image.Save(imageStream, format);
|
|
```
|
|
|
|
Note, we're doing a centered crop resize to preserve aspect ratio.
|
|
|
|
### Preprocess image
|
|
|
|
Next, we will preprocess the image according to the [requirements of the model](https://github.com/onnx/models/tree/master/vision/classification/resnet#preprocessing):
|
|
|
|
```cs
|
|
Tensor<float> input = new DenseTensor<float>(new[] { 1, 3, 224, 224 });
|
|
var mean = new[] { 0.485f, 0.456f, 0.406f };
|
|
var stddev = new[] { 0.229f, 0.224f, 0.225f };
|
|
for (int y = 0; y < image.Height; y++)
|
|
{
|
|
Span<Rgb24> pixelSpan = image.GetPixelRowSpan(y);
|
|
for (int x = 0; x < image.Width; x++)
|
|
{
|
|
input[0, 0, y, x] = ((pixelSpan[x].R / 255f) - mean[0]) / stddev[0];
|
|
input[0, 1, y, x] = ((pixelSpan[x].G / 255f) - mean[1]) / stddev[1];
|
|
input[0, 2, y, x] = ((pixelSpan[x].B / 255f) - mean[2]) / stddev[2];
|
|
}
|
|
}
|
|
```
|
|
|
|
Here, we're creating a Tensor of the required size `(batch-size, channels, height, width)`, accessing the pixel values, preprocessing them and finally assigning them to the tensor at the appropriate indicies.
|
|
|
|
### Setup inputs
|
|
|
|
Next, we will create the inputs to the model:
|
|
|
|
```cs
|
|
var inputs = new List<NamedOnnxValue>
|
|
{
|
|
NamedOnnxValue.CreateFromTensor("data", input)
|
|
};
|
|
```
|
|
|
|
To check the input node names for an ONNX model, you can use [Netron](https://github.com/lutzroeder/netron) to visualise the model and see input/output names. In this case, this model has `data` as the input node name.
|
|
|
|
### Run inference
|
|
|
|
Next, we will create an inference session and run the input through it:
|
|
|
|
```cs
|
|
using var session = new InferenceSession(modelFilePath);
|
|
using IDisposableReadOnlyCollection<DisposableNamedOnnxValue> results = session.Run(inputs);
|
|
```
|
|
|
|
### Postprocess output
|
|
|
|
Next, we will need to postprocess the output to get the softmax vector, as this is not handled by the model itself:
|
|
|
|
```cs
|
|
IEnumerable<float> output = results.First().AsEnumerable<float>();
|
|
float sum = output.Sum(x => (float)Math.Exp(x));
|
|
IEnumerable<float> softmax = output.Select(x => (float)Math.Exp(x) / sum);
|
|
```
|
|
|
|
Other models may apply a Softmax node before the output, in which case you won't need this step. Again, you can use Netron to see the model outputs.
|
|
|
|
### Extract top 10
|
|
|
|
Next, we will extract the top 10 class predictions:
|
|
|
|
```cs
|
|
IEnumerable<Prediction> top10 = softmax.Select((x, i) => new Prediction { Label = LabelMap.Labels[i], Confidence = x })
|
|
.OrderByDescending(x => x.Confidence)
|
|
.Take(10);
|
|
```
|
|
|
|
### Print results
|
|
|
|
Next, we will print the top 10 results to the console:
|
|
|
|
```cs
|
|
Console.WriteLine("Top 10 predictions for ResNet50 v2...");
|
|
Console.WriteLine("--------------------------------------------------------------");
|
|
foreach (var t in top10)
|
|
{
|
|
Console.WriteLine($"Label: {t.Label}, Confidence: {t.Confidence}");
|
|
}
|
|
```
|
|
|
|
## Running the program
|
|
|
|
Now the program is created, we can run it will the following command:
|
|
|
|
```
|
|
dotnet run [path-to-model] [path-to-image]
|
|
```
|
|
|
|
e.g.
|
|
|
|
```
|
|
dotnet run ~/Downloads/resnet50-v2-7.onnx ~/Downloads/dog.jpeg
|
|
```
|
|
|
|
Running this on the following image:
|
|
|
|

|
|
|
|
We get the following output:
|
|
|
|
```
|
|
Top 10 predictions for ResNet50 v2...
|
|
--------------------------------------------------------------
|
|
Label: Golden Retriever, Confidence: 0.9212826
|
|
Label: Kuvasz, Confidence: 0.026514154
|
|
Label: Clumber Spaniel, Confidence: 0.012455719
|
|
Label: Labrador Retriever, Confidence: 0.004103844
|
|
Label: Saluki, Confidence: 0.0033182495
|
|
Label: Flat-Coated Retriever, Confidence: 0.0032045357
|
|
Label: English Setter, Confidence: 0.002513516
|
|
Label: Brittany, Confidence: 0.0023459378
|
|
Label: Cocker Spaniels, Confidence: 0.0019343802
|
|
Label: Sussex Spaniel, Confidence: 0.0019247672
|
|
```
|