Visual detection models localize an object of interest in an image by returning a box that bounds that object, as well as the type of that object, also referred to as the class. A detector can detect multiple images of different classes per image. For each detection, a detector outputs a confidence score that is independent of any other detections.
The output object in Hive detection APIs lists each detected object, including:
- The geometric description of the detected bounding box.
- The predicted class for the detection.
- For some model’s, the confidence score for the detection.
Hive's detector models include the logo detection model and common object detection model.
{
"output": [
{
"time": 0,
"bounding_poly": [
{
"vertices": [
{
"x": 257.5439780950546,
"y": 155.76384043693542
},
{
"x": 288.42563331127167,
"y": 155.76384043693542
},
{
"x": 288.42563331127167,
"y": 197.2337465286255
},
{
"x": 257.5439780950546,
"y": 197.2337465286255
}
],
"dimensions": {
"top": 155.76384043693542,
"bottom": 197.2337465286255,
"left": 257.5439780950546,
"right": 288.42563331127167
},
"classes": [
{
"class": "orange",
"score": 0.586775541305542,
"label": "orange"
}
],
"meta": {
"score": 0.586775541305542,
"type": "coco"
}
}
]
}
]
}
Name | Description |
---|---|
time | Timestamp in seconds of the frame extracted from original media. Always 0 for images. |
bounding_poly | List containing all detections. |
vertices | Four coordinates representing the detected bounding box. Starts at the top-left of the box and lists remaining vertices in clockwise order. |
dimensions | Four sides representing the detected bounding box. top: y-dimension of the top side. bottom: y-dimension of the bottom side. right: x-dimension of the right side. left: x-dimension of the left side. |
classes: class | Predicted class |
classes: score | Confidence score for the detection. |
Note that vertices and dimensions are expressed in the standard image processing coordinate system: