A Ted's Voice Academy Resource

A Complete Guide to the IPA for Singers

Symbols, sounds, and the languages singers sing — a free interactive reference for singers, voice teachers, and choral directors.

Foundations

The International Phonetic Alphabet is a system in which one symbol represents one sound. It was developed by linguists in the late nineteenth century to give scholars a way to discuss the sounds of any human language without relying on the inconsistent spelling conventions of individual languages.

For singers, the IPA does something specific and practical: it names the exact vowel or consonant the singer is aiming at, independent of how the word is spelled. English spelling is famously unreliable — though, through, cough, and bough each end with a different vowel or consonant sound despite looking similar on the page. IPA removes the ambiguity.

Italian, German, French, Spanish, Latin, Russian, Czech, and Hebrew each have their own spelling-to-sound patterns, and learning every one of them from scratch would be overwhelming. IPA gives the singer one system that works across all of them.

IPA does not replace the ear — it sharpens what the ear is already doing. It does not prescribe how a sound should be colored, shaped, or styled. That remains the singer's choice, the composer's intent, and the tradition's convention. What IPA does is name the target.

How to read IPA

Brackets

IPA symbols appear in square brackets […] or between slashes /…/. Slashes indicate a phonemic transcription; brackets a phonetic one. For most singing purposes the two are used interchangeably.

Stress

A primary stress mark ˈ appears before the stressed syllable. hello = [hɛˈloʊ]. A secondary stress mark ˌ indicates a lighter secondary stress in longer words.

Syllables & length

A period marks a syllable boundary: [ˈka.ro]. The length mark ː means the vowel is held longer — important in German, where [iː] and [ɪ] are distinct phonemes.

Nasalization & diacritics

A tilde above a vowel ([ã]) means the vowel is nasalized — central to French. Small marks above or below modify a symbol in specific ways. A full list lives in the Reference section.

IPA names sounds. How those sounds are colored — how bright or dark, how forward or back, how open or closed — is a matter of style, genre, language tradition, and artistic intent.

Handling ambiguity

Several situations cause honest disagreement among singers and coaches. None has a single right answer; all benefit from awareness.

How do I tell closed from open e and o in Italian?

Italian has four vowels where English-speakers often hear two: closed [e] and open [ɛ]; closed [o] and open [ɔ]. The score does not usually mark which is which. Published transcriptions are the most reliable source. When working without one, a good Italian dictionary (or Castel's transcriptions) will show the distinction. Regional Italian accents vary; the standard operatic convention tends toward Tuscan pronunciation.

Which "ah" should I use — [a], [ä], or [ɑ]?

Singers often encounter three different ah vowels across languages: [a] (bright, front), [ɑ] (dark, back), and [ä] (central, between the two). Italian and Spanish casa uses [a]. English father uses [ɑ]. German Vater is often closer to [ä]. Classical choral tradition often defaults to [a] as a shared ensemble target regardless of language, but this is a convention, not a universal rule.

Which r should I use when there are several options in one language?

English can use any of four r-sounds depending on genre, tradition, and period. German can use any of three. French has two that often coexist in the same piece of repertoire. The choice is determined by style and tradition, not by the letter on the page.

How should I handle regional and historical variants?

Spanish differs substantially between Castilian and Latin American varieties. German pronunciation in Bach-era repertoire differs from modern German. French sung diction in classical repertoire retains features modern spoken French has abandoned. When a score or program identifies a specific tradition, follow it; when it does not, follow the mainstream convention for the genre.

The Sound Inventory

Two charts, every vowel and consonant covered in this guide. Click any symbol to hear it.

The Vowel Trapezoid

A map of tongue position. Horizontal axis shows front-to-back tongue placement; vertical axis shows how open the mouth and jaw are. Gold symbols are rounded (including the "mixed vowels" of French and German).

Interactive IPA vowel trapezoid Every IPA monophthong used in this guide, arranged by tongue height and frontness. Unrounded vowels are shown in navy, rounded in gold. Click any symbol to hear it. FRONT CENTRAL BACK Close Near-close Close-mid Mid Open-mid Near-open Open i y ɨ ʉ ɯ u ɪ ʏ ʊ e ø ɘ ɵ ɤ o ə ɛ œ ɜ ɞ ʌ ɔ æ ɐ a ɶ ä ɑ ɒ
Unrounded Rounded Click any symbol to hear it

The "Ah" family

The single English word "ah" can refer to three distinct IPA targets. This is one of the most common sources of confusion in sung diction.

Rounded & mixed vowels

Some languages — most notably French and German — produce front rounded vowels, combining front tongue position with rounded lips. They are often called mixed vowels because they mix features of both. Singers approaching them typically start by producing the vowel's tongue position first, then gradually rounding the lips without letting the tongue move backward.

Diphthongs

A diphthong is two vowel qualities occurring within a single syllable, connected by a glide. One vowel carries the sustained pitch; the other appears as a quick release.

[aɪ] my, light

Sustain [ɑ], release to [ɪ].

[eɪ] day, lane

Sustain [e], release to [ɪ].

[oʊ] go, home

Sustain [o], release to [ʊ].

[aʊ] now, sound

Sustain [ɑ], release to [ʊ].

[ɔɪ] boy, voice

Sustain [ɔ], release to [ɪ].

[ɪə] here, near (non-rhotic)

Non-rhotic English.

[ɛə] there, care (non-rhotic)

Non-rhotic English.

[ʊə] tour, sure (non-rhotic)

Non-rhotic English.

Timing is a stylistic choice. Classical and choral traditions typically delay the second vowel until the very end of the note. Musical theater and pop often release earlier or blend the two vowels more smoothly. Barbershop and close-harmony ensembles unify diphthong timing precisely, often with a specific count or gesture.

The Consonant Chart

Consonants are classified by manner of articulation (how the airstream is shaped) and place of articulation (where in the vocal tract the constriction occurs). Within each cell, the symbol on the left is unvoiced; the symbol on the right is voiced.

Interactive IPA consonant chart Consonants organized by manner (rows) and place of articulation (columns). Left symbol in a cell is unvoiced; right is voiced. Bilabial Labio- dental DentalAlveolar Post- alveolar RetroflexPalatalVelar Uvular / Glottal PlosiveNasalTrill / FlapFricativeAffricateApproximantLateral p b β t d c ɟ k g ʔ m ɱ n ɲ ŋ ɾ r ʁ f v θ ð s z ʃ ʒ ʂ ʐ ç ʝ x ɣ h ts dz w ɹ j l ʎ
Unvoiced Voiced Left symbol in a pair is unvoiced; right is voiced. Click any symbol to hear it.

The R family

The letter r represents strikingly different sounds across languages, styles, and historical periods. All four principal variants below are legitimate within their contexts. The choice is determined by the language, the repertoire, the ensemble, and the tradition — not by a universal rule.

Syllabic consonants & the glottal stop

Syllabic consonants

A consonant marked with a small vertical line below ([l̩], [n̩], [m̩]) functions as the nucleus of a syllable — carrying the syllable's weight without a separate vowel. English: bottle = [ˈbɒt.l̩]; button = [ˈbʌt.n̩]. German uses these in -en endings.

The glottal stop

A complete closure of the vocal folds producing a brief silence, released into the following vowel. English uh-oh = [ˈʔʌʔoʊ]. In careful German sung diction it appears before every word-initial vowel: ich atme sung as [ɪç ˈʔaːt.mə], not [ɪˈça:tmə]. Hebrew uses the glottal stop freely (the letter aleph).

The Languages

Nine languages, from the cleanest sung diction (Italian, Spanish) to the most intricate (Russian, Czech, Hebrew). Each panel covers the vowels, consonants, distinctive conventions, and example transcriptions you'll meet in the repertoire.

English spelling is unreliable and English sounds vary widely across dialects and genres. What follows is the mainstream sung treatment, with notes where specific genres diverge.

Vowels

Consonants

Notes

  • All principal English vowels appear here. Where a vowel has both a rhotic and non-rhotic treatment, classical/art-song traditions typically favor non-rhotic; musical theater, pop, folk, country, gospel, rock, and contemporary CCM typically favor rhotic.

Genre conventions

Classical & art song
Often non-rhotic; [æ] may be modified toward [a] or [ɛ] for vocal ease; diphthong second vowels are delayed to the end of the note.
Choral (mixed genre)
Varies by director. Classical-leaning choirs favor non-rhotic and unified vowel targets; contemporary choirs favor rhotic and natural diction.
Musical theater
Rhotic; diphthongs released earlier to match conversational rhythm; belting may modify vowels substantially for volume.
Pop, rock, CCM
Rhotic; vowels often kept close to conversational speech to preserve authenticity; consonants frequently softened.
Folk and country
Rhotic; regional accent often preserved as a feature; diphthongs may be elongated on the first vowel for expressive effect.
Gospel
Rhotic; vowel choices often reflect the performer's speaking accent; melismatic passages may alter vowel identity for musical shape.

Example transcriptions

  • Amazing grace
    [əˈmeɪ.zɪŋ ˈgɹeɪs]
  • The water is wide
    [ðə ˈwɔ.tər ɪz ˈwaɪd]
  • Shenandoah
    [ˌʃɛn.ənˈdoʊ.ə]

Working with IPA

Reading a published transcription

Professional transcriptions (Castel, Adams, Wall, IPA Source) usually present a phrase in three parallel lines: the original text, the IPA, and a literal or poetic translation. Conventions vary: some sources mark every syllable division, some mark none; some use primary and secondary stress marks, some only primary; some mark closed/open vowels explicitly, some leave them to the singer. The first task with any published transcription is to note what it marks and what it leaves to interpretation.

Marking your own score

  1. Identify the language and the pronunciation tradition. For Latin, specify Ecclesiastical, Germanic, or Classical.
  2. Mark the sustained vowel of each important syllable in IPA above the word. Reserve marking for vowels that are non-obvious or easy to miss.
  3. Note diphthongs with an arrow showing which vowel gets the time, or a box around the sustained vowel with a small corner mark for the release.
  4. Flag foreign-language vowels or consonants that need special attention — mixed vowels, nasals, trilled r, glottal onsets.
  5. In ensemble music, confirm your vowel targets match what the ensemble has agreed to use.

When transcriptions disagree

  • Closed vs. open e/o in Italian. Consult a pronunciation-marked Italian dictionary or a Castel transcription. When these disagree, the opera-house convention usually follows Tuscan pronunciation.
  • R in classical English. Non-rhotic is traditional in British classical singing; rhotic is common in American classical singing. Follow the ensemble's convention, or the tradition of the repertoire.
  • Liaison in French. In sung French, when in doubt, liaise. Modern spoken French tends to drop liaison that classical sung French preserves.
  • Mute e in French. If a note is written on the syllable, the e is pronounced as [ə]. If no note, it is silent.
  • Germanic vs. Ecclesiastical Latin. For Bach and pre-Bach German sacred music, some performance practice favors Germanic. Most modern practice still uses Ecclesiastical. Follow the conductor's preference.

Ensemble considerations

Vowel unification

When multiple voices sing the same word, their vowel targets must align closely. Small differences between [a] and [ä], or between [e] and [ɛ], become audible as chord color problems — the harmony sounds out of tune even when pitch is correct. Classical choirs typically unify toward bright, Italianate vowels. Barbershop and close-harmony ensembles unify to extremely tight tolerances.

Diphthong timing

When multiple voices sustain a diphthong, the release timing must match. A diphthong released at different moments creates an unintended ensemble blur. Classical ensembles typically delay the release until the end of the note. Barbershop ensembles specify the release to a precise beat or subdivision.

Consonant placement

Final consonants must land together. An ensemble singing a final [t] with five different release points produces an obvious flaw even when everything else is right. Conductors commonly mark the exact beat on which final consonants release.

Reference

Search any symbol, scan the full alphabet, and check the diacritics you need.

Every symbol at a glance

Every IPA symbol used in this guide. Click any symbol to hear it.

Vowels (29)
Nasalized vowels — French (4)
Consonants (45)
Diacritics & modifiers (9)
ː
Length. Long vowel. [iː] vs. [i]. Phonemic in German and Latin; rare in Italian/Spanish.
ˈ
Primary stress. Placed before the stressed syllable.
ˌ
Secondary stress. Lighter stress in longer words.
.
Syllable boundary. Separates syllables. [ˈka.ro].
˜
Nasalization. Above a vowel. Central to French: dans = [dɑ̃].
ʲ
Palatalization. Russian soft consonants. n → [nʲ].
̩
Syllabic consonant. Below. [l̩] in bottle; [n̩] in button.
̪
Dental articulation. Below a consonant.
̥
Voiceless. Below a normally-voiced symbol; indicates devoicing.

Canonical resources

Print references

  • Nico CastelItalian Belcanto Opera Libretti, French Opera Libretti, and other language-specific transcription volumes. The industry standard for opera transcription.
  • David AdamsA Handbook of Diction for Singers. Italian, German, French. Widely used in conservatories.
  • Madeleine MarshallThe Singer's Manual of English Diction. The standard English diction reference.
  • John MoriartyDiction. Italian, Latin, French, Spanish. Compact and widely used.
  • Joan Wall et al.Diction for Singers (with audio). Good for self-study.
  • Ron JeffersTranslations and Annotations of Choral Repertoire (multi-volume).
  • Leslie De'AthUniform Pronunciation for Choirs.

Digital references

"IPA does not tell you how to sing. It tells you what you are aiming at."

Working on diction for a piece?

Voice coaching, choral diction, and ensemble prep across all the languages above.

Work with Ted