Italian phonology

The phonology of Italian describes the sound system—the phonology and phonetics—of Standard Italian and its geographical variants.

Consonants
Notes:


 * Between two vowels, or between a vowel and an approximant or a liquid, consonants can be both singleton or geminated. Geminated consonants shorten the preceding vowel (or block phonetic lengthening) and the first geminated element is unreleased. For example, compare   ('fate') with   ('fact'). However, , , , ,  are always geminated word-internally. Similarly, nasals, liquids, and sibilants are pronounced slightly longer in medial consonant clusters.
 * ,, and are the only consonants that cannot be geminated.
 * are laminal denti-alveolar, commonly called "dental" for simplicity.
 * are pre-velar before.
 * have two variants:
 * Dentalized laminal alveolar (commonly called "dental" for simplicity), pronounced with the blade of the tongue very close to the upper front teeth, with the tip of the tongue resting behind lower front teeth.
 * Non-retracted apical alveolar . The stop component of the "apical" affricates is actually laminal denti-alveolar.
 * are apical alveolar in most environments.  are laminal denti-alveolar  before  and palatalized laminal postalveolar  before .  is velar  before.
 * and do not contrast before  and, where they are pronounced  and , respectively.
 * and are alveolo-palatal. In a large number of accents,  is a fricative.
 * Intervocalically, single is realised as a trill with one or two contacts. Some literature treats the single-contact trill as a tap . Single-contact trills can also occur elsewhere, particularly in unstressed syllables. Geminate  manifests as a trill with three to seven contacts.
 * The phonetic distinction between and  is neutralized before consonants and at the beginning of words: the former is used before voiceless consonants and before vowels at the beginning of words; the latter is used before voiced consonants (meaning  is an allophone of  before voiced consonants). The two can contrast only between vowels within a word, e.g. [ˈfuːzo] 'melted' vs. [ˈfuːso] 'spindle'. According to Canepari, though, the traditional standard has been replaced by a modern neutral pronunciation which always prefers  when intervocalic, except when the intervocalic s is the initial sound of a word, if the compound is still felt as such: for example, presento  ('I foresee', with pre meaning 'before' and sento meaning 'I perceive') vs presento  ('I present'). There are many words in which dictionaries now indicate that both pronunciations, with  and with, are acceptable. Word-internally between vowels, the two phonemes have merged in many regional varieties of Italian, either as  (Northern-Central) or  (Southern-Central).

Vowels


In Italian there is no phonemic distinction between long and short vowels, but vowels in stressed open syllables, unless word-final, are long at the end of the intonational phrase (including isolated words) or when emphasized. Adjacent identical vowels found at morpheme boundaries are not resyllabified, but pronounced separately ("quickly rearticulated"), and they might be reduced to a single short vowel in rapid speech.

Although Italian contrasts close-mid and open-mid  vowels in stressed syllables, this distinction is neutralised in unstressed position, where only the close-mid vowels occur. The height of these vowels in unstressed position is context-sensitive; they are somewhat lowered in the vicinity of more open vowels. The distinction between close-mid and open-mid vowels is lost entirely in a few Southern varieties of Regional Italian, especially in Northern Sicily (e.g. Palermo), where they are realized as open-mid, as well as in some Northern varieties (in particular in Piedmont), where they are realized as mid.

Word-final stressed is found in a small number of words: però, ciò, paltò. However, as a productive morpheme, it marks the first person singular of all future tense verbs (e.g. dormirò 'I will sleep') and the third person singular preterite of first conjugation verbs (parlò 's/he spoke', but credé 's/he believed', dormì 's/he slept'). Word-final unstressed is rare,  found in onomatopoeic terms (babau), loanwords (guru), and place or family names derived from the Sardinian language (Gennargentu, Porcu).

When the last phoneme of a word is an unstressed vowel and the first phoneme of the following word is any vowel, the former vowel tends to become non-syllabic. This phenomenon is called synalepha and should be taken into account when counting syllables, e.g. in poetry.

In addition to monophthongs, Italian has diphthongs, but these are both phonemically and phonetically simply combinations of the other vowels, with some being very common (e.g. ), others being rarer (e.g. ) and some never occurring within native Italian words (e.g. ). None of these diphthongs are however considered to have distinct phonemic status because their constituents do not behave differently than they would in isolation (and all occur in isolation), unlike the diphthongs in some languages like English and German. Grammatical tradition makes a distinction between 'falling' and 'rising' diphthongs; however, since rising diphthongs are composed of one semiconsonantal sound or  and one vowel sound, they are not actually diphthongs. The practice of referring to them as 'diphthongs' has been criticised by phoneticians like Luciano Canepari.

Onset
Italian allows up to three consonants in syllable-initial position, though there are limitations:

CC
 * + any voiceless stop or . E.g. spavento ('fright')
 * + any voiced stop, . E.g. srotolare ('unroll')
 * , or any stop + . E.g. frana ('landslide')
 * , or any stop except + . E.g. platano ('planetree')
 * , or any stop or nasal + . E.g. fiume ('river'), vuole ('he/she wants'), siamo ('we are'), suono ('sound')
 * In words of foreign (mostly Greek) origin which are only partially assimilated, other combinations such as (e.g. pneumatico),  (e.g. mnemonico),  (e.g. tmesi), and  (e.g. pseudo-) occur.

As an onset, the cluster + voiceless consonant is inherently unstable. Phonetically, word-internal s+C normally syllabifies as  rospo 'toad',  Trastevere (neighborhood of Rome). Phonetic syllabification of the cluster also occurs at word boundaries if a vowel precedes it without pause, e.g. la storia 'the history', implying the same syllable break at the structural level,, thus always latent due to the extrasyllabic , but unrealized phonetically unless a vowel precedes. A competing analysis accepts that while the syllabification is accurate historically, modern retreat of i-prosthesis before word initial +C (e.g. erstwhile con isforzo 'with effort' has generally given way to con sforzo) suggests that the structure is now underdetermined, with occurrence of  or  variable "according to the context and the idiosyncratic behaviour of the speakers."

CCC The last combination is however rare and one of the approximants is often vocalised, e.g. quieto, continuiamo
 * + voiceless stop or + . E.g. spregiare ('to despise')
 * + + . E.g. sclerosi ('sclerosis')
 * + voiced stop + . E.g. sbracciato ('with bare arms'), sdraiare ('to lay down'), sgravare ('to relieve')
 * + + . E.g. sbloccato ('unblocked')
 * or any stop + + . E.g. priego (antiquated form of prego 'I pray'), proprio ('(one's) own' / proper / properly), pruovo (antiquated form of provo 'I try')
 * or any stop or nasal + + . E.g. quieto ('quiet'), continuiamo ('we continue')

Nucleus
The nucleus is the only mandatory part of a syllable (for instance, a 'to, at' is a word) and must be a vowel or a diphthong. In a falling diphthong the most common second elements are or  but other combinations such as idea, trae  may also be interpreted as diphthongs. Combinations of with vowels are often labelled diphthongs, allowing for combinations of  with falling diphthongs to be called triphthongs. One view holds that it is more accurate to label as consonants and  as consonant-vowel sequences rather than rising diphthongs. In that interpretation, Italian has only falling diphthongs (phonemically at least, cf. Synaeresis) and no triphthongs.

Coda
Italian permits a small number of coda consonants. Outside of loanwords, the permitted consonants are:


 * The first element of any geminate, e.g. tutto ('everything'), avvertire ('to warn').
 * A nasal consonant that is either (word-finally) or one that is homorganic to a following consonant. E.g. Con ('with'), un poco  ('a little'), ampio ('ample').
 * Liquid consonants and . E.g. per ('for'), alto ('high').
 * (though not before fricatives). E.g. pesca ('peach').

There are also restrictions in the types of syllables that permit consonants in the syllable coda. explains that neither geminates, nor coda consonants with "rising sonority" can follow falling diphthongs. However, "rising diphthongs" (or sequences of an approximant and a following vowel) may precede clusters with falling sonority, particularly those that stem historically from an obstruent+liquid onset. For example:


 * biondo ('blond')
 * chiosco ('kiosk')
 * chiostro ('cloister')
 * chioccia ('broody hen')
 * fianco ('hip')

Syntactic gemination
Word-initial consonants are geminated after certain vowel-final words in the same prosodic unit. There are two types of triggers of initial gemination: some unstressed particles, prepositions, and other monosyllabic words, and any oxytonic polysyllabic word. As an example of the first type, casa ('house') is pronounced but a casa ('homeward') is pronounced. This is not a purely phonological process, as no gemination is cued by the la in la casa 'the house', and there is nothing detectable in the structure of the preposition a to account for the gemination. This type normally originates in language history: modern a, for example, derives from Latin AD, and today's geminate in is a continuation of what was once a simple assimilation. Gemination cued by final stressed vowels, however, is transparently phonological. Final stressed vowels are short by nature, if a consonant follows a short stressed vowel the syllable must be closed, thus the consonant following the final stressed vowel is drawn to lengthen: parlò portoghese 's/he spoke Portuguese' vs. parla portoghese  's/he speaks Portuguese'.

In standard Italian, syntactic gemination occurs mainly in the following two cases:


 * After end-stressed words (such as sanità, perché, poté, morì and so on).
 * After the words a, che, chi, come, da, do, dove, e, fa, fra, fu, gru, ha, ho, ma, me, mo'  (in the phrase a mo' di), no, o, qua, qualche, qui, so, sopra, sta, sto, su, te, tra, tre, tu, va, vo.

Syntactic gemination is the normal native pronunciation in Central Italy (both "stress-induced" and "lexical") and Southern Italy (only "lexical"), including Sicily and Corsica (France).

In Northern Italy and Sardinia, speakers use it inconsistently because the feature is not present in the dialectal substratum and is not usually shown in the written language unless a new word is produced by the fusion of the two: "chi sa"-> chissà ("who knows" in the sense of goodness knows).

Regional variation
The above IPA symbols and description refer to standard Italian, based on a somewhat idealized version of the Tuscan-derived national language. As is common in many cultures, this single version of the language was pushed as neutral, proper, and eventually superior, leading to some stigmatization of varying accents. Television news anchors and other high-profile figures had to put aside their regional Italian when in the public sphere. However, in more recent years the enforcement of this standard has fallen out of favor in Italy, and news reporters, actors, and the like are now more free to deliver their words in their native regional variety of Italian, which appeals to the Italian population's range of linguistic diversity. The variety is still not represented in its wholeness and accents from the South are maybe to be considered less popular, except in shows set in the South and in comedy, a field in which Naples, Sicily and the South in general have always been present. Though it still represents the basics for the standard variety, the loosened restrictions have led to Tuscan being seen for what it is, just one dialect among many with its own regional peculiarities and qualities, many of which are shared with Umbria, Southern Marche and Northern Lazio.


 * In Tuscany (though not in standard Italian, which is derived from, but not equivalent to, Tuscan dialect), voiceless stops are typically pronounced as fricatives between vowels. That is, → : e.g. i capitani 'the captains', a phenomenon known as the gorgia toscana 'Tuscan throat'. In a much more widespread area of Central Italy, postalveolar affricates are deaffricated when intervocalic so that in Cina ('in China') is pronounced  but la Cina ('the China') is , and /ˈbat͡ʃo/ bacio 'kiss' is [ˈbaːʃo] rather than Standard Italian [ˈbaːt͡ʃo]. This deaffrication can result in minimal pairs distinguished only by length of the fricatives, [ʃ] issuing from /t͡ʃ/ and [ʃː] from geminate /ʃʃ/:  lacerò 's/he ripped' vs.  lascerò 'I will leave'.
 * In nonstandard varieties of Central and Southern Italian, some stops at the end of a syllable completely assimilate to the following consonant. For example, a Venetian might say tecnica as or  in violation of normal Italian consonant contact restrictions, while a Florentine would likely pronounce tecnica as, a Roman on a range from  to  (in Southern Italian, complex clusters usually are separated by a vowel: a Neapolitan would say , a Sicilian ). Similarly, although the cluster  has developed historically as  through assimilation, a learned word such as ictus will be pronounced  by some,  by others.
 * In popular (non-Tuscan) Central and Southern Italian speech, and  tend to always be geminated ( and ) when between two vowels, or a vowel and a sonorant (,, , or ). Sometimes this is also used in written language, e.g. writing robba instead of roba ('property'), to suggest a regional accent, though this spelling is considered incorrect. In Tuscany and beyond in Central and Southern Italy, intervocalic non-geminate  is realized as  (parallel to  realized as  described above).
 * The two phonemes and  have merged in many varieties of Italian: when between two vowels within the same word, it tends to always be pronounced  in Northern Italy, and  in Central and Southern Italy (except in the Arbëreshë community). A notable example is the word casa ('house'): in Northern Italy it is pronounced ; in Southern-Central Italy it's pronounced.
 * In several Southern varieties, voiceless stops tend to be voiced if following a sonorant, as an influence of the still largely spoken regional languages: campo is often pronounced, and Antonio  is frequently.

The various Tuscan, Corsican and Central Italian dialects are, to some extent, the closest ones to Standard Italian in terms of linguistic features, since the latter is based on a somewhat polished form of Florentine.

Phonological development
Very little research has been done on the earliest stages of phonological development in Italian. This article primarily describes phonological development after the first year of life. See the main article on phonological development for a description of first year stages. Many of the earliest stages are thought to be universal to all infants.

Phoneme inventory
Word-final consonants are rarely produced during the early stages of word production. Consonants are usually found in word-initial position, or in intervocalic position.

17 months
Most consonants are word-initial: They are the stops, , , and and the nasal. A preference for a front place of articulation is present.

21 months
More phones now appear in intervocalic contexts. The additions to the phonetic inventory are the voiced stop, the nasal , the voiceless affricate , and the liquid.

24 months
The fricatives, , and are added, primarily at the intervocalic position.

27 months
Approximately equal numbers of phones are now produced in word-initial and intervocalic position. Additions to the phonetic inventory are the voiced stop and the consonant cluster. While the word-initial inventory now tends to have all the phones of the adult targets (adult production of the child's words), the intervocalic inventory tends to still be missing four consonants or consonant clusters of the adult targets:, , , and.

Stops are the most common manner of articulation at all stages and are produced more often than they are present in the target words at around 18 months. Gradually this frequency decreases to almost target-like frequency by around 27 months. The opposite process happens with fricatives, affricates, laterals and trills. Initially, the production of these phonemes is significantly less than what is found in the target words and the production continues to increases to target-like frequency. Alveolars and bilabials are the two most common places of articulation, with alveolar production steadily increasing after the first stage and bilabial production gently decreasing. Labiodental and postalveolar production increases throughout development, while velar production decreases.

6–10 months
Babbling becomes distinct from previous, less structured vocal play. Initially, syllable structure is limited to CVCV, called reduplicated babbling. At this stage, children’s vocalizations have a weak relation to adult Italian and the Italian lexicon.

11–14 months
The most-used syllable type changes as children age, and the distribution of syllables takes on increasingly Italian characteristics. This ability significantly increases between the ages of 11 and 12 months, 12 and 13 months, and 13 and 14 months. Consonant clusters are still absent. Children's first ten words appear around month 12, and take CVCV format (e.g. mamma 'mother', papà 'father').

18–24 months
Reduplicated babbling is replaced by variegated babbling, producing syllable structures such as C1VC2V (e.g. cane 'dog', topo 'mouse'). Production of trisyllabic words begins (e.g. pecora 'sheep', matita 'pencil'). Consonant clusters are now present (e.g. bimba 'female child', venti 'twenty'). Ambient language plays an increasingly significant role as children begin to solidify early syllable structure. Syllable combinations that are infrequent in the Italian lexicon, such as velar-labial sequences (e.g. capra 'goat' or gamba 'leg') are infrequently produced correctly by children, and are often subject to consonant harmony.

Stress patterns
In Italian, stress is lexical, meaning it is word-specific and partly unpredictable. Penultimate stress (primary stress on the second-to-last syllable) is also generally preferred. This goal, acting simultaneously with the child's initial inability to produce polysyllabic words, often results in weak-syllable deletion. The primary environment for weak-syllable deletion in polysyllabic words is word-initial, as deleting word-final or word-medial syllables would interfere with the penultimate stress pattern heard in ambient language.

Phonological awareness
Children develop syllabic segmentation awareness earlier than phonemic segmentation awareness. In earlier stages, syllables are perceived as a separate phonetic unit, while phonemes are perceived as assimilated units by coarticulation in spoken language. By first grade, Italian children are nearing full development of segmentation awareness on both syllables and phonemes. Compared to those children whose mother tongue exhibits closed syllable structure (CVC,CCVC, CVCC, etc.), Italian-speaking children develop this segmentation awareness earlier, possibly due to its open syllable structure (CVCV, CVCVCV, etc.). Rigidity in Italian (shallow orthography and open syllable structure) makes it easier for Italian-speaking children to be aware of those segments.

Sample texts
Provided here is a rendition of the Bible, Luke 2, 1–7, as read by a native Italian speaker from Milan. As a northerner, his pronunciation lacks syntactic doubling ( instead of ) and intervocalic ( instead of ). The speaker realises as  in some positions.

2:1 In quei giorni, un decreto di Cesare Augusto ordinava che si facesse un censimento di tutta la terra. 2 Questo primo censimento fu fatto quando Quirino era governatore della Siria. 3 Tutti andavano a farsi registrare, ciascuno nella propria città. 4 Anche Giuseppe, che era della casa e della famiglia di Davide, dalla città di Nazaret e dalla Galilea si recò in Giudea nella città di Davide, chiamata Betlemme, 5 per farsi registrare insieme a Maria, sua sposa, che era incinta. 6 Proprio mentre si trovavano lì, venne il tempo per lei di partorire. 7 Mise al mondo il suo primogenito, lo avvolse in fasce e lo depose in una mangiatoia, poiché non c'era posto per loro nella locanda.

The differences in pronunciation are underlined in the following transcriptions; the velar is an allophone of  and the long vowels are allophones of the short vowels, but are shown for clarity.

A rough transcription of the audio sample is:

2:1 2  3  4  5  6  7

The Standard Italian pronunciation of the text is:

2:1 2  3  4  5  6  7