The following XML shows a single record produced by object recognition.
<output> <record> ... <trackname>object.Result</trackname> <ObjectResult> <id>ed1f3af7-9f00-434f-8bc4-a328ff4c67fc</id> <identity> <identifier>HPE</identifier> <database>logos</database> <imagelabel>752fdee3b5c478f7314eca75365c4094</imagelabel> <confidence>100</confidence> <metadata> <item> <key>CompanyName</key> <value>HPE</value> </item> </metadata> </identity> <boundary> <point> <x>106</x> <y>100</y> </point> <point> <x>271</x> <y>101</y> </point> <point> <x>272</x> <y>183</y> </point> <point> <x>107</x> <y>183</y> </point> </boundary> </ObjectResult> </record> </output>
The record contains the following information:
The id
element provides a unique identifier for the recognized object. Media Server issues an ID for each appearance of an object. If you are recognizing objects in video and consecutive frames show the same object in a near-identical location, all records related to that appearance will have the same ID.
For example, if an object appears in the same location for a hundred consecutive video frames, the engine uses the same ID for each record in the data track and the single record in the result track. The record in the result track will have a timestamp that covers all of the frames.
If the object disappears and then reappears, the engine considers this as a new detection and produces a new ID and a new record in the result track.
The identity
element represents a match between the ingested media and an object in your training database.
identifier
element provides the identifier of the object that was detected in the ingested media.database
element provides the name of the database in which the object exists.imagelabel
element provides the label of the image that was the best match to the analyzed media (many training images can be associated with a single object in the database). This element is present only for 2D objects. With 3D objects all of the training images are used to define a single representation of the object.confidence
element provides the confidence score for the match (from 0 to 100).metadata
element provides metadata that you associated with the object when you trained Media Server. If there is no metadata in the training database, this element is omitted.boundary
element provides the position of the object in the ingested media, as a set of points that form a polygon which surrounds the object.
|