G.711 vs G.729: VoIP Codec Comparison

Two monitors displaying G.711 and G.729 audio waveforms side by side, the G.711 waveform tall and detailed in blue and the G.729 waveform narrow and compressed in orange, illustrating the bandwidth and quality trade-off between the two VoIP codecs

Almost every SIP trunk negotiation, IP-PBX configuration, and carrier interconnect sooner or later forces the same decision: G.711 or G.729. The two codecs are the practical defaults for digital voice over IP, and they were designed for opposite worlds. G.711 carries voice at near-PSTN quality and consumes 64 kbps per call before any network overhead. G.729 compresses the same speech down to 8 kbps using an aggressive predictive algorithm, at the cost of some quality, some CPU, and a long history of licensing complications.

This article is a head-to-head comparison: bandwidth math, voice quality, CPU and license cost, fax and DTMF compatibility, and a clear framework for choosing one or the other on each leg of a real network. For the broader codec landscape including Opus and AMR, the SIP trunking pillar covers the wider context. For the mechanics of converting between codecs when both end up on the same call, the SBC AMR to G.711 transcoding article goes deep on the transcoding plane.

Key Terms and Concepts
A quick-reference glossary for terms used throughout this article.
G.711The original digital telephony codec, sampling speech at 8 kHz and producing a steady 64 kbps stream. Two compounding variants exist, A-law (used outside North America) and µ-law (used in North America and Japan). G.711 is the lingua franca of the PSTN, most IP-PBX systems, and the vast majority of SIP trunks.
G.729A low-bit-rate speech codec that compresses 8 kHz audio to 8 kbps using CS-ACELP (Conjugate-Structure Algebraic Code-Excited Linear Prediction). Designed for narrow links where bandwidth costs more than the modest CPU and quality trade-off.
G.729a / G.729abVariants of G.729. The “a” annex reduces computational cost while keeping the bit rate; the “b” annex adds VAD (voice activity detection) so the codec stops sending packets during silence.
MOS (Mean Opinion Score)A 1-to-5 perceptual quality score for speech. G.711 sits around 4.2 (near-toll quality), G.729 sits around 3.9, and wideband codecs like Opus reach 4.5 on the same scale.
Codec payloadThe bits of compressed audio inside each RTP packet, before any network headers are added.
ptime (packetization time)The duration of audio carried in one RTP packet, typically 20, 30, or 40 milliseconds. Larger ptime means fewer packets per second and lower header overhead, at the cost of more delay if a packet is lost.
Codec negotiationThe SDP exchange in which two endpoints announce supported codecs in priority order and converge on a common one. When no overlap exists, an SBC must transcode or the call fails.
TranscodingReal-time conversion of an audio stream from one codec to another. The SBC decodes the incoming media to raw PCM and re-encodes it in the destination codec.
CS-ACELPThe compression algorithm behind G.729. It models the human vocal tract instead of sampling the waveform directly, which is why it compresses so aggressively and why it does not survive fax modem tones.
RFC 2833 / RFC 4733The out-of-band DTMF transport that carries digits as RTP events rather than as in-band audio. Mandatory when G.729 is in use because in-band DTMF tones do not survive the compression.

The Engineering Difference

G.711 and G.729 sit at opposite ends of the bandwidth-quality trade-off, and the engineering reasons trace back to when each codec was designed.

G.711 dates to the 1970s and was built for fixed-line digital telephony. It samples speech at 8 kHz, applies a simple logarithmic compression (A-law or µ-law) to each sample, and produces a steady 64 kbps stream. The math is almost trivial: an x86 CPU can encode and decode thousands of concurrent G.711 streams without breaking a sweat, and converting between A-law and µ-law is a lookup table. Voice quality is essentially what you get from a wired PSTN call.

G.729 dates to the mid-1990s, when corporate VoIP was being shoved through 64 kbps or 128 kbps WAN links that also had to carry data. It samples at the same 8 kHz but compresses each 10 ms frame down to 80 bits using CS-ACELP, yielding 8 kbps. The codec analyzes short windows of speech, models the speaker’s vocal tract as a set of filter coefficients, and transmits the coefficients plus an excitation index instead of the waveform itself. The receiver synthesizes the speech back from those parameters. The compression works well for human voice and badly for everything else, which is why G.729 cannot carry fax modem tones, music on hold, or in-band DTMF reliably.

The signaling-side reference for how each codec is offered and accepted lives in the SDP body of the SIP INVITE, defined in RFC 4566. The codec list and its priority order are what the negotiation operates on, and an SBC has full visibility into both sides.

Bandwidth: What the Numbers Actually Look Like

The 64 kbps versus 8 kbps headline figure is the codec payload alone. Real per-call bandwidth on the wire is higher on both sides because every RTP packet carries RTP, UDP, IP, and Ethernet headers that add the same number of bytes regardless of how much audio is inside.

At a typical 20 ms ptime, the math works out roughly as follows. G.711 generates 50 packets per second, each carrying 160 bytes of audio plus about 54 bytes of RTP/UDP/IP/Ethernet headers, for a total of around 85 to 87 kbps per direction. G.729 generates the same 50 packets per second, each carrying 20 bytes of audio plus the same 54 bytes of headers, for a total of around 31 kbps per direction. The compression ratio in the payload is 8 to 1, but on the wire it collapses to roughly 2.8 to 1.

Three implications follow. First, larger ptime values (30 or 40 ms) shrink the header overhead and improve the ratio for both codecs, but they also increase end-to-end delay if a packet is dropped, which matters more for low-bit-rate codecs that have less redundancy to begin with. Second, VAD with G.729b cuts the average bit rate further during silence (often by 30 to 50 percent), which helps on shared links but breaks fax and some IVR prompts. Third, the bandwidth savings of G.729 only matter when the link is actually constrained. Inside a managed LAN, on a modern dedicated SIP trunk, or on any link sized for HD video, the difference between 87 kbps and 31 kbps per call is rarely the decision driver.

Voice Quality

G.711 delivers near-toll-quality voice at a Mean Opinion Score of roughly 4.2, which sits within the same range as a wired PSTN call. G.729 sits at roughly 3.9, which most listeners hear as “good but slightly compressed.” Whispered consonants lose definition, sibilants soften, and the sense of presence drops. Nobody will mistake the call for high-fidelity audio, but conversational speech remains intelligible.

The gap matters in three specific situations. The first is tandem transcoding. Every time a call is transcoded from G.729 to G.711 and back, the MOS drops further because the lossy compression compounds. Two G.729-to-G.711 hops in the same call path push perceived quality noticeably below either codec alone, which is why network architects try to minimize the number of transcoding points in any given route. The second is wideband audio. G.729 caps the call at 8 kHz sampling regardless of what either endpoint is capable of, so a Microsoft Teams or WebRTC client that could otherwise deliver wideband Opus will sound narrowband across a G.729 leg. The third is noisy or stressed speech. CS-ACELP is tuned for clean voice; background noise, simultaneous talkers, and emotional or shouted speech all degrade more sharply on G.729 than on G.711.

If voice quality is the primary KPI for a deployment (executive conferencing, contact center recording, sales call analytics), G.711 wins by default. If bandwidth is the binding constraint, the MOS hit on G.729 is a known and manageable trade-off.

CPU, Licensing, and Where the Cost Hides

G.711 is effectively free in both CPU and license terms. The codec is a public ITU-T recommendation with no patent encumbrance, every commercial PBX and SBC supports it natively, and a modern CPU encodes thousands of concurrent streams without breaking 5 percent utilization.

G.729 has a more complicated history. The codec was patent-encumbered for years, with royalties owed to Sipro Lab Telecom and other holders. The patents began expiring in 2017 and the codec became effectively royalty-free, which prompted several open-source projects to add native G.729 support. The commercial reality has not fully caught up. Many enterprise PBX vendors (Cisco, Avaya, and others) still charge per-channel G.729 licenses, partly as legacy revenue capture and partly because hardware DSP boards still carry per-channel licensing terms. The result is that the per-call cost of G.729 on a managed PBX deployment is often dominated by the license, not by the CPU or the bandwidth.

CPU cost on the codec itself is real but modest on modern hardware. Pure-software G.729 encode/decode is workable for hundreds of concurrent streams on a commodity x86 core. At carrier scale, however, hardware DSP acceleration remains the practical default because predictable per-channel latency matters more than peak throughput. ProSBC pairs with TSBC-HW-TRANS for hardware-accelerated G.729 transcoding when a deployment needs the codec at scale; software G.729 transcoding inside ProSBC is on the product roadmap for late 2026.

Fax, DTMF, and Modem Compatibility

The cleanest way to summarize this section is that G.711 is fax-safe and G.729 is fax-fatal. CS-ACELP models speech, not signaling tones, and the moment a T.30 fax handshake hits a G.729 codec the modem tones are unrecoverable. Any fax-bearing call path must either stay on G.711 end to end with passthrough conditions met (echo cancellation disabled, VAD disabled, near-zero packet loss) or switch to T.38 relay at the SBC. The Fax over IP and T.38 article covers the relay mechanics in detail.

DTMF has a parallel story. In-band DTMF carried as audio survives G.711 but not G.729. The fix is RFC 2833 (now RFC 4733), which carries DTMF as named RTP events outside the audio stream. Most modern SIP endpoints negotiate RFC 2833 by default, but legacy IVRs and some PBX trunk types still emit in-band DTMF that an SBC has to detect and convert. If G.729 is in the path, RFC 2833 is mandatory.

Modem and TTY traffic have the same constraint. Voice-band data of any kind needs G.711 or it does not survive the compression.

When to Use Each: A Decision Framework

The choice usually decides itself once the network is mapped honestly. The pattern that holds across most deployments is straightforward.

Inside a managed LAN, on a dedicated SIP trunk sized for the call load, on a UCaaS connection over a healthy WAN, or on any leg where bandwidth is not the binding constraint, G.711 is the default. Quality is higher, CPU is cheaper, fax and DTMF interop is automatic, and the license is free. The bandwidth savings of G.729 do not justify the trade-offs when the pipe is wide enough.

G.729 still earns its place on three kinds of links. Constrained WAN segments, typically branch offices linked to headquarters over 1 to 4 Mbps shared circuits where every 50 kbps of voice headroom matters; mobile, satellite, or maritime backhaul where bandwidth is metered or genuinely scarce; and legacy carrier interconnects where the upstream provider only offers G.729 on a given termination route. In each of these cases the saved bandwidth is meaningful, the fax and DTMF concerns are handled with T.38 and RFC 2833, and the perceptible quality hit is acceptable for the use case.

The trickier middle ground is high-density hosted PBX or contact center platforms where customers are spread across a mix of network types. The right answer there is rarely a single codec choice for the platform; it is a per-customer or per-trunk codec policy enforced at the SBC.

Mixed Networks and the Role of the SBC

Most real networks have both codecs in play. A North American carrier hands off G.711 µ-law to one customer, a European carrier offers A-law to another, a constrained branch office runs G.729, and a Teams Direct Routing tenant negotiates Opus or Silk on its leg. The SBC is the only network element with the per-leg media control, the SDP visibility, and the policy hooks to mediate cleanly between all of them.

Three SBC functions matter for codec policy. The first is per-leg codec negotiation through SDP rewriting: the SBC can offer one codec to the carrier and a different one to the PBX, then bridge the two with transcoding in the middle. The second is per-NAP codec configuration, where each trunk group carries its own preferred-codec list, fallback list, and packetization rules, all enforced at the boundary without either endpoint needing to know what the other is doing. The third is transcoding capacity itself. Software transcoding inside ProSBC handles G.711 A-law to µ-law natively, which covers the most common North-America-to-Europe interop case. Hardware DSP transcoding through TSBC-HW-TRANS covers G.729, AMR, and the full set of complex codecs at carrier scale.

The deeper mechanics of how a B2BUA SBC actually negotiates and converts codecs between legs are covered in the SBC AMR to G.711 transcoding article. The decision in front of the architect is which codec to enforce on each NAP. The SBC is how that decision becomes real on the wire.

Frequently Asked Questions

How much bandwidth does G.711 actually use per call compared to G.729?

Payload alone, 64 kbps versus 8 kbps. With RTP, UDP, IP, and Ethernet headers at a 20 ms ptime, the real per-call bandwidth is roughly 85 to 87 kbps for G.711 and 31 kbps for G.729 in each direction. Larger ptime values shrink the header overhead and improve both numbers. VAD (G.729b) cuts the G.729 average further during silence.

Is G.729 still licensed?

The core G.729 patents expired in 2017 and the codec is effectively royalty-free today. Many commercial PBX vendors still charge per-channel G.729 licenses, partly as legacy revenue capture and partly because DSP boards still carry per-channel licensing terms. In practice, you should expect a license cost on commercial platforms and no cost on open-source platforms.

Why does G.711 sound so much better than G.729 over wideband Wi-Fi or 4G?

Neither codec is wideband. Both sample at 8 kHz and cap the audio bandwidth around 3.4 kHz. The perceived difference between the two is the compression artifact in G.729, not the sampling rate. To get wideband audio, both endpoints have to negotiate a wideband codec like Opus, G.722, or AMR-WB; G.711 over wideband transport is still narrowband audio.

Can I run G.729 on a fax line?

No. CS-ACELP destroys T.30 fax modem tones. A fax call has to either stay on G.711 end-to-end with passthrough conditions met, or switch to T.38 relay at the SBC. See the Fax over IP and T.38 article for the relay mechanics.

Does tandem transcoding between G.729 and G.711 hurt quality?

Yes. Every additional transcoding hop compounds the lossy compression, and a call that crosses two G.729-to-G.711 transcoding points sounds noticeably worse than either codec alone. The network architect’s job is to minimize the number of transcoding hops in any given route, not to optimize each leg independently.

G.729 or Opus for low-bandwidth links?

Opus generally wins on quality at the same bit rate, scales smoothly from narrowband to wideband, and is unencumbered by licensing. The reason G.729 still appears is interop with legacy PBX systems and carriers that have not added Opus to their codec list. On a greenfield deployment with modern endpoints, Opus is the better default for bandwidth-constrained links.

Conclusion

The G.711 versus G.729 decision is rarely as close as it looks on paper. G.711 wins on quality, CPU, licensing, fax interop, and DTMF reliability, and the bandwidth difference (87 kbps versus 31 kbps per call on the wire) only matters when the link is actually constrained. G.729 still earns its place on narrow WAN segments, mobile backhaul, and legacy carrier interconnects where the bandwidth saving is real. The harder problem in most networks is not picking one codec, it is enforcing a coherent per-leg codec policy across a network where both end up in play. That is the SBC’s job, and the per-NAP policy at the boundary is where the decision becomes operational.

Manage G.711 and G.729 Codec Policy with ProSBC

ProSBC handles SIP signaling, B2BUA media control, per-NAP codec negotiation, and the SDP rewriting needed to bridge G.711 and G.729 between any combination of carriers, PBX systems, and UCaaS platforms. For G.711 A-law to µ-law conversion, ProSBC transcodes in software with no external hardware required. For G.729, AMR, Opus, and the full set of complex codecs at production scale, ProSBC pairs with TSBC-HW-TRANS, a carrier-grade hardware transcoding unit that supports up to 2,744 sessions per 1U enclosure and stacks to 30,000 sessions. Software transcoding for G.729, Opus, and AMR inside ProSBC is on the product roadmap for late 2026.

ProSBC itself runs on VMware, KVM, AWS, Azure, or baremetal, and integrates with the transcoding unit over the same management plane. The codec policy you set on each trunk group is what your network actually does on the wire.

Prefer to evaluate on your own first? Start your 30-day free trial.