Audio configuration ONVIF 4 min read

Audio profiles for
ONVIF cameras

ONVIF audio profiles let you tune a camera microphone for the scenario in front of you: high-quality voice recording, bandwidth-constrained deployments, real-time monitoring, two-way communication, or simple background capture.


Audio encoding - the four codecs

ONVIF cameras support four audio encoding formats. Each one makes a different trade-off between voice clarity, compression efficiency, device compatibility, and network cost.

AAC_LC
All regions
QualityHigh
BandwidthModerate
Advanced Audio Coding - Low Complexity. Best when audio clarity matters and the receiving system supports AAC decoding.
G.711U
North America / Japan
QualityModerate
BandwidthLow
Telephony-grade voice encoding optimized for North American and Japanese infrastructure. Useful on constrained networks.
G.711A
Europe / Rest of world
QualityModerate
BandwidthLow
A common ONVIF-friendly voice codec with excellent compatibility across cameras, VMS systems, and intercom workflows.
ADPCM_D
All regions
QualityLow
BandwidthVery low
A lightweight option for secondary audio capture where detecting sound matters more than understanding speech.
Compatibility note: G.711A and G.711U are similar in quality. Choose based on your camera, region, and receiving system. AAC_LC gives better quality, but older devices may not decode it.

Sample rate - quality vs bandwidth

The sample rate controls how many times per second the audio signal is measured. Higher rates capture more detail, but they also increase bandwidth and storage requirements.

8 kHz
Telephony / minimum
16 kHz
Wideband voice
44.1 kHz
CD-quality audio
48 kHz
Professional / broadcast
Rule of thumb: For voice capture in surveillance contexts, 16 kHz is usually the sweet spot: clear enough for speech, light enough for predictable network use.

Audio profile reference

These profile configurations cover the most common surveillance audio scenarios.

01
HQ
High-quality audio recording
Meetings, compliance recordings, detailed surveillance with voice
EncodingAAC_LC
Sample rate48 kHz
Quality
Bandwidth
Use this when audio content has legal, compliance, or operational value. It is clear and detailed, but consumes the most bandwidth.
02
LB
Minimal bandwidth surveillance
Constrained networks, remote sites, basic sound monitoring
EncodingG.711U
Sample rate8 kHz
Quality
Bandwidth
Captures basic sound presence without stressing the network. Good for alarms, loud events, and low-priority audio checks.
03
RT
Real-time monitoring with audio
Live observation, general surveillance with speech
EncodingG.711A
Sample rate16 kHz
Quality
Bandwidth
The practical default for many deployments: intelligible speech, broad compatibility, and moderate bandwidth usage.
04
BG
Background audio capture
Secondary audio layer, noise detection only
EncodingADPCM_D
Sample rate8 kHz
Quality
Bandwidth
Use this when audio is a secondary signal and you only need to know that something loud happened.
05
EV
Enhanced audio in controlled environments
Office speech recording, reception desks, review footage
EncodingAAC_LC
Sample rate44.1 kHz
Quality
Bandwidth
Near-CD quality capture for clear speech in controlled spaces, with slightly lower bandwidth than the 48 kHz profile.
06
2W
Intercom and two-way audio
Entry intercoms, door cameras, live voice communication
EncodingG.711A
Sample rate16 kHz
Quality
Bandwidth
A natural fit for live voice interaction: low latency, reliable compatibility, and enough clarity for two-way speech.

Profile summary at a glance

Use this table when you know the scenario and need a quick recommended setting.

ScenarioCodecSample rateBest for
High-quality recordingAAC_LC48 kHzMeetings, voice evidence, compliance
Minimal bandwidthG.711U8 kHzRemote sites, constrained networks
Real-time monitoringG.711A16 kHzGeneral surveillance with speech
Background captureADPCM_D8 kHzSecondary audio, noise detection only
Enhanced voiceAAC_LC44.1 kHzOffice speech, reception desks
Two-way / intercomG.711A16 kHzEntry intercoms, door cameras
Not sure where to start? Profile 03, G.711A at 16 kHz, is the recommended default for most ONVIF surveillance deployments.

Configuring ONVIF audio in Banalytics?

Audio profiles are configured per camera in the Banalytics camera setup interface. See the media profiles article for the companion video configuration guide.