The AudioSecurity task detects and labels audio segments containing security-related sounds including alarms, breaking glass, screams, and gunshots. The task can also identify the type of alarm if it matches a template in its database. If it does not match one of the templates, the task returns the label <UNKNOWN-ALARM>.
The AudioSecurity task requires the EVENTS resource. Ensure that the Model parameter in the [audiosec] section of the configuration file is set to EVENTS.
| Parameter | Description | Required |
|---|---|---|
| Type | The task name. Set to AudioSecurity. |
Yes |
| EndTime | The end of an audio section to process. | |
| File | The audio file to process. | Yes, if InputType is File. |
| InputType | The type of audio to process (file, binary data, or stream). | |
| Out | The .ctm file to write the results to. | Yes |
| StartTime | The beginning of an audio section to process. | |
| SugdInputChannels | The channel layout of the input media file. This parameter does not apply when InputType is Stream. |
|
| SugdInputFrequency | The sampling rate of the input media file. This parameter does not apply when InputType is Stream. |
|
| TemplateList | The list containing the file names of the alarm templates. | |
| TemplatePath | The path to the directory containing the alarm templates specified in the template list. |
http://localhost:13000/action=AddTask&Type=AudioSecurity&File=C:\data\Sample.wav&Out=SampleSec.ctm
This action uses port 13000 to instruct IDOL Speech Server, which is located on the local machine, to search the Sample.wav file for security-related sounds and to write the results to the SampleSec.ctm file.
|
|