The following XML shows a single record produced by audio categorization.
<record>
<timestamp>
<startTime iso8601="1970-01-01T00:00:01Z">1000000</startTime>
<duration iso8601="PT00H00M00.500000S">500000</duration>
<peakTime iso8601="1970-01-01T00:00:01Z">1000000</peakTime>
<endTime iso8601="1970-01-01T00:00:01.500000Z">1500000</endTime>
</timestamp>
<trackname>AudioCategorize.Result</trackname>
<AudioCategorizeResult>
<id>e8d84838-bdf2-4b9b-9a92-e7e42b249103</id>
<category>Music</category>
<confidence>80</confidence>
</AudioCategorizeResult>
</record>
The record contains the following information:
The id element contains the identifier for the audio segment.
The category element shows how the audio segment was classified. The categories are pre-defined and this value can be:
DialTone
DTMF-*, DTMF-0, DTMF-1, DTMF-2, and so on. These values indicate that the audio contains a DTMF tone. For example, DTMF-2 indicates the tone for the "2" button.Music
Noise
Silence
Speech
NOTE: Dial tone and DTMF tone detection are enabled only when you process audio with a sample rate of 8KHz.
confidence element provides the confidence score for the classification, from 0 to 100, where 100 indicates the greatest confidence.|
|