Skip to content
— CH. 1 · INTRODUCTION —

Romance languages

~9 min read · Ch. 1 of 7
7 sections
  • Romance languages are spoken today by more than 900 million people as their native tongue. They stretch from the rainforests of Brazil to the villages of Moldova, from the streets of Quebec City to the coast of Equatorial Africa. Yet every one of them traces a direct, unbroken line back to a single ancestor: Vulgar Latin, the everyday speech of soldiers, slaves, merchants, and displaced peoples who lived under the Roman Empire.

    How does a single language become dozens? How do Spanish and Portuguese, so close they can almost be heard as one voice, drift far enough apart to become distinct tongues? Why does Romanian, isolated in the east, still share the same bones as French in the west? And what happened to the dozens of lesser-known Romance languages that survive today only in mountain villages and coastal towns, their futures uncertain?

    This is the story of a linguistic family that built the modern world, and of the forces that made it so varied, so widespread, and so surprising.

  • Between 350 BC and 150 AD, the expansion of the Roman Empire made Latin the dominant native language across continental Western Europe. That dominance was not achieved by culture alone. It was carried by legions, administrators, and settlers who spread into Gaul, Hispania, the Balkans, and North Africa, pulling conquered populations into Latin-speaking orbits.

    But the Latin that spread was not the polished prose of Cicero. It was Vulgar Latin, the spoken register used in markets and barracks rather than in courts and libraries. Many of its speakers were soldiers, slaves, and displaced peoples, more likely to be natives of conquered lands than natives of Rome itself. The distinction between the two registers was real enough that medieval writers even had a name for speaking in each: romanice loqui meant "to speak in Roman," while latine loqui meant speaking in the formal, literary Latin of official life.

    By the sixth and seventh centuries, spoken Latin had reached a tipping point. Innovations had spread widely enough that "the Romanized people of Europe could no longer understand texts that were read aloud or recited to them." Within two hundred years, Latin was a dead spoken language. Romance had replaced it.

    Charlemagne tried to turn back the clock. Holding that the Latin of his age was by classical standards intolerably corrupt, he imposed Classical Latin as the written standard for Western Europe in the late eighth century. The unintended consequence was stark: parishioners could no longer understand the sermons of their own priests. The Council of Tours in 813 issued an edict commanding priests to translate their speeches into the rustica romana lingua, a formal acknowledgment that Romance was now a genuinely separate language from Latin.

  • The colonial empires established by Portugal, Spain, and France from the fifteenth century onward spread their languages so far that roughly two-thirds of all Romance language speakers today live outside Europe. That single fact reshapes the entire map of the family.

    Spanish alone claims almost 500 million native speakers. It is official in nine countries of South America and six in Central America, in Mexico, and across the Caribbean in Cuba, the Dominican Republic, and Puerto Rico. In Africa, it is one of the official languages of Equatorial Guinea. Spanish was even official in the Philippines until 1973, when the 1987 constitution replaced it with English.

    Portuguese tells an equally ambitious story. In its homeland, Portugal, it is spoken by nearly the entire population of around 10 million. But across the Atlantic, Brazil alone accounts for more than 200 million Portuguese speakers, making it the most spoken official Romance language in any single country. Portuguese also carries official status in six African nations: Angola, Cape Verde, Guinea-Bissau, Mozambique, Equatorial Guinea, and Sao Tome and Principe. In Asia, it is co-official in East Timor and Macau, and some 400,000 Portuguese speakers live in Japan, largely descendants of Japanese Brazilians who returned.

    French extended its reach differently. Outside Europe, it is spoken natively most widely in Quebec and parts of New Brunswick and Ontario. In much of Africa it holds official status, with relatively few native speakers but large numbers of second-language users. Today French, Spanish, and Portuguese serve as lingua francas across the world, and two of the six official languages of the United Nations are Romance languages: French and Spanish.

  • Flavio Biondo was the first scholar to note, in 1435, the linguistic kinship between Romanian and Italian, and their shared Latin origin. That connection has only deepened under modern analysis. By most measures, Sardinian and Italian have changed the least from Latin; French has changed the most.

    Romanian's survival is particularly striking. Romania never built a colonial empire, yet the language clung to its eastern territory through centuries of pressure from Germanic tribes, Slavic migrations, Hungarian rule, and Ottoman conquest. The invasion of the Turks and the fall of Constantinople in 1453 ended the Eastern Roman Empire and left the surviving Romance languages of the Balkans in a precarious position. Dalmatian was one; it went extinct in 1898. Romanian survived and is today official in Romania, Moldova, and the autonomous province of Vojvodina in Serbia. Romanian speakers also form about two percent of the population in Italy, Spain, and Portugal, and the language is spoken by roughly five percent of the population of Israel as a native tongue.

    Sardinian presents a different case. Isolated on its island, it preserved a vowel system closer to Classical Latin than almost any other Romance variety. Where most languages merged Latin's ten vowels into seven or fewer, Sardinian retained a simple five-vowel system. The island's geographic remove slowed the pressures of change that reshaped the mainland tongues.

    Catalan, with more than four million native speakers, is official in Andorra and co-official with Spanish in Catalonia, the Valencian Community, and the Balearic Islands. It is also spoken in the city of Alghero on Sardinia. Galician, with more than three million speakers, holds official status in the autonomous community of Galicia alongside Spanish.

  • The UNESCO Red Book of Endangered Languages classifies virtually all minor Romance languages as threatened to some degree. Sicilian and Venetian rank as merely "vulnerable." Franco-Provencal and most Occitan varieties are listed as "severely endangered."

    National governments have historically viewed linguistic diversity as a liability rather than a heritage. Administrations suppressed minority languages by promoting official tongues, restricting their use in media, dismissing them as dialects rather than languages, and in some periods actively persecuting their speakers. The Occitan of southern France, for instance, lost steadily to the Parisian dialect that spread across the country after the invention of the printing press standardized written language within political boundaries.

    British Romance and African Romance, the forms of Vulgar Latin used in Britain and the Roman province of North Africa, vanished entirely during the Middle Ages. Moselle Romance in Germany disappeared too. Dalmatian, spoken along the Adriatic coast, died out in 1898. Mozarabic and Andalusi Romance, used on the Iberian Peninsula, went extinct in the late thirteenth century.

    Since the late twentieth century and into the early twenty-first, heightened sensitivity to minority rights has allowed some of these languages to begin recovering lost prestige. Asturian and Aragonese in Spain, Mirandese in Portugal, Friulian, Sardinian, and Franco-Provencal in Italy, and Romansh in Switzerland now enjoy some degree of regional legal recognition. Whether that recognition arrives early enough to reverse their decline remains genuinely open.

  • An eighth-century document known as the Reichenau Glossary preserves a remarkable record of linguistic drift. It compiled about 1,200 words from the fourth-century Vulgate Bible that had changed in form or fallen out of use, pairing each with its eighth-century equivalent in proto-Franco-Provencal. The gap between the two columns tells the story of Vulgar Latin's transformation in plain view.

    One of the most sweeping shifts was lenition, the "softening" of consonants in the speech of Western Romance. Single voiceless plosives became voiced between vowels: a hard "p" became a "b," a "t" became a "d," a hard "c" became a "g." In Spanish, those voiced consonants weakened further still, becoming fricatives or even disappearing. The Latin word vītam, meaning "life," becomes vita in Italian, vida in Portuguese, vida in Spanish, and simply vie in French, a four-step illustration of progressive consonant erosion across the family.

    The dividing line between Western Romance, where this softening occurred, and Eastern and Central Romance, where it largely did not, is called the La Spezia-Rimini Line. That boundary, running across northern Italy, is one of the most important isogloss bundles in the Romance dialect area.

    Vowels underwent equally dramatic reorganization. Classical Latin had ten vowels distinguished by length: five short and five long. Vulgar Latin lost that distinction, merging pairs by quality rather than duration. The simplest outcome, preserved in Sardinian, produced a five-vowel system. Most other languages ended up with seven, though Spanish diphthongized its open-mid vowels into gliding sounds like "ie" and "ue," and Romanian developed a balanced seven-vowel system including the central vowels written as "a" with a breve and the letter "i" with a circumflex.

    French phonemicized yet another vowel length system around 1300 as a result of a specific sound change involving consonant clusters, a sign that the language was still actively reshaping itself well into the medieval period.

  • Several dozens of creole languages descended from French, Spanish, and Portuguese have developed in former European colonies, some of them now serving as national languages or lingua francas. Haitian Creole is one of Haiti's two official languages and the majority native language. Cape Verdean Creole is Cape Verde's national language and lingua franca. Seychellois Creole holds official status in the Seychelles. Papiamento, which draws on Portuguese, is an official language of the Dutch Antilles.

    The Romance family also inspired a different kind of invention: constructed languages. In 1903, Italian mathematician Giuseppe Peano developed Latino sine flexione, a simplified Latin intended as a naturalistic international language. He chose Latin as the base because, as he described it, Latin had been the international scientific language until the end of the eighteenth century. Interlingua, developed in 1951, became the most famous of these efforts. Earlier projects included Idiom Neutral in 1902 and Interlingue-Occidental in 1922. Lingua Franca Nova followed in 1998. All of them aimed at creating a shared vocabulary recognizable across the living Romance languages.

    Some amateur linguists took an even more speculative path, constructing Romance languages that mirror real languages which developed from other ancestral languages. Brithenig mirrors Welsh. Breathanach mirrors Irish. Wenedyk mirrors Polish. These inventions, though never spoken natively, reveal something genuine: the Romance family is coherent enough as a system that linguists can simulate parallel evolutions from it.

    The 880 million native speakers counted around 2020 divide roughly as follows: Spanish accounts for 54 percent, Portuguese for 26 percent, French for 9 percent, Italian for 7 percent, and Romanian for 3 percent. Catalan represents about half a percent, with the remaining three percent spread across dozens of other varieties, nearly all of whose speakers are also bilingual in one of the national languages.

Common questions

What are the Romance languages and where did they come from?

The Romance languages are the languages that directly descended from Vulgar Latin, the spoken language of the Roman Empire. They are the only extant subgroup of the Italic branch of the Indo-European language family. The five most widely spoken by native speakers are Spanish, Portuguese, French, Italian, and Romanian.

How many people speak Romance languages worldwide?

There are more than 900 million native speakers of Romance languages worldwide. Spanish accounts for roughly 54 percent of that total, with about 475 million native speakers, followed by Portuguese at around 230 million and French at around 80 million.

Why are Romance languages spoken in so many countries outside Europe?

Romance languages spread globally because of European colonialism beginning in the fifteenth century. Portugal, Spain, and France established colonial empires that carried their languages to the Americas, Africa, and Asia. Today roughly two-thirds of all Romance language speakers live outside Europe.

Which Romance language has changed the least from Latin?

By most measures, Sardinian and Italian have changed the least from Latin. Sardinian in particular preserved a simple five-vowel system close to the original Latin vowel inventory. French has changed the most among the major Romance languages.

When did Latin split into the Romance languages?

Latin reached a stage of widespread innovation around the sixth and seventh centuries, after which speakers of the Romanized regions could no longer understand texts read aloud to them in Latin. By the eighth and ninth centuries, Romance had effectively replaced spoken Latin. The Council of Tours in 813 formally acknowledged this, ordering priests to deliver sermons in the rustica romana lingua rather than classical Latin.

Are any Romance languages endangered?

Yes. The UNESCO Red Book of Endangered Languages classifies virtually all minor Romance languages as threatened. Sicilian and Venetian are listed as vulnerable, while Franco-Provencal and most Occitan varieties are listed as severely endangered. Some varieties, such as Dalmatian, have already gone extinct, with its last recorded death in 1898.

All sources

83 references cited across the entry

  1. 2bookEthnologue : Languages of the WorldEthnologue — 2009-05-30
  2. 3webLatin
  3. 5bookVulgar LatinJózsef Herman et al. — Pennsylvania State University Press — 2000
  4. 9bookLingüística RomânicaRodolfo Ilari — Ática — 2002
  5. 11bookVariation and Change in SpanishRalph Penny — Cambridge University Press — 2000
  6. 12bookDialectologyChambers Jack — Cambridge University Press — 2004
  7. 14harvnbSala, Posner,Sala, Posner,
  8. 16webLanguagesEuropean Union
  9. 17webUnión LatinaUNIÓN LATINA
  10. 18webOfficial LanguagesUnited Nations
  11. 19webCameroonuOttawa
  12. 21web¿Por qué hablamos español en Colombia?Marcela Hernández Chacón — Instituto Caro y Cuervo
  13. 23journalCommentary: Language Policy in Galicia, 1980-2020. An OverviewHenrique Monteagudo — 2024-03-08
  14. 26webEthnologueSIL Haley — 2022
  15. 30harvnbSala, Posner
  16. 31bookThe Languages and Linguistics of Europe: A Comprehensive GuideJohannes Kabatek et al.
  17. 32bookLas Lenguas románicas estándar: historia de su formación y de su usoMiguel Metzeltin
  18. 33bookErdély történetének atlasza Atlas of the History of TransylvaniaAndrás Bereznay — Méry Ratio — 2011
  19. 34journalLiteracy in the Roman Provinces: Qualitative and Quantitative Data from Central SpainLeonard A. Curchin — 1995
  20. 35bookRomance LanguagesMartin Harris et al. — Routledge — 2001
  21. 36bookThe Cambridge History of the Romance Languages: Volume 2Martin Banniard — Cambridge University Press — 2013
  22. 37bookVulgar LatinJozsef Herman — Penn State Press — 2010
  23. 38bookThe French language: past and presentGlanville Price — Grant and Cutler Ltd — 1984
  24. 39citationIntroductionCambridge University Press — 2010
  25. 40webThe Oxford Guide to the Romance LanguagesBarbara Frank-Job et al. — 2016
  26. 41bookLanguage Contact at the Romance–Germanic Language BorderLuc van Durme — Multilingual Matters — 2002
  27. 44webDie romanischen Sprachen – und ihre SprechergemeinschaftenThomas Krefeld — 26 February 2019
  28. 45journalSur la notion de frontière linguistique entre Limousin, Périgord, Poitou et AngoumoisJean-Christophe Dourdet — collectif Edinum — 2021
  29. 46bookA history of the Spanish languageRalph Penny — Cambridge University Press — 2014
  30. 48bookThe peoples of EuropeH. J. Fleure — Рипол Классик
  31. 49webHermathena1942
  32. 50bookHistorical Linguistics: A cognitive grammar introductionMargaret E. Winters — John Benjamins Publishing Company — 2020
  33. 54journalIl latino quale lingua ausiliare internazionaleGiuseppe Peano — 1903–1904
  34. 56webÞrjótrunn: A North Romance Language: HistoryHenrik Theiling — Kunstsprachen.de — 2007-10-28
  35. 57webRelay0/R – JelbazechSteen.free.fr — 2004-08-28
  36. 59bookVowel prosthesis in Romance: a diachronic studyRodney Sampson — Oxford University Press — 2010
  37. 60webThe Early History of Romance PalatalizationsMarcello Barbato — 20 June 2022
  38. 61webPalatalizations in the Romance LanguagesDaniel Recasens — 30 July 2020
  39. 62bookThe Cambridge Handbook of Romance LinguisticsGiovanna Marotta — Cambridge University Press — 2022
  40. 63journalCeltic lenition and Western Romance consonantsAndré Martinet — 1952
  41. 64bookComparative historical dialectology: Italo-Romance clues to Ibero-Romance sound changeThomas D. Cravens — John Benjamins Publishing — 2002
  42. 65bookDe la Latină la RomânăMarius Sala — Editura Pro Universitaria — 2012
  43. 66bookLa langue corseMarie-Josée Dalbera-Stefanaggi — Presses universitaires de France — 2002
  44. 67journalMetaphony and Two Models for the Description of Vowel SystemsJeffery W. Kaze — 1991
  45. 68webMetaphonyAndrea Calabrese
  46. 70journalContinuity and Innovation in Romance: Metaphony and Mass-Noun Reference in Spain and ItalyRalph Penny — 1994
  47. 72journalMetipsimus in Spanish and FrenchAurelio M. Espinosa — 1911
  48. 74webSicilian–English DictionaryItalian.about.com — 2010-06-15
  49. 82webOccitan–English DictionaryBeaumont — Freelang.net — 2008-12-16