Taiwanese Tone Sandhi and TTS Challenges

Tone sandhi is when the pitch pattern (tone) of a syllable changes when it is followed by another syllable in continuous speech.

How the MTL Writing System Simplifies Sandhi

The MTL (Modern Taiwanese Language) writing system is designed to simplify tone sandhi. It's sandhi-aware, meaning the way a multi-syllable word is written already shows the changed tone of the preceding syllables.

🔄 General Tone Change Rules (Word Level)

In continuous speech, the basic rule is that all syllables in a word change tone, **except** under specific conditions:

Default Rule and Exception

Default Change: All words change tone, following the standard Taiwanese Tone Circle (a set pattern of tone shifts).
Default Exception (No Change): A word's final syllable **does not change tone** if it:
1. Is the last word of a sentence.
2. Comes immediately before a punctuation mark.

Example: In the sentence "Lie karm u pid? U, goar u cidky," the words *pid*, *cidky*, and *U* keep their original tones because they mark the end of a phrase or sentence. The other words (*Lie*, *karm*, *goar*) change tones.

Specific Lexical and Structural Exceptions

1. Nouns (N) and Gerunds

Nouns and gerunds (verb forms used as nouns) generally DO NOT change tone.

The Big Exception: Nouns DO change tone when they are used **as adjectives** or **as measure words** (denoting a unit).

Example: In *Taioaan-laang* (Taiwanese person), the noun *Taioaan* (Taiwan) is acting as an adjective and **changes tone**.

2. Pronouns (r)

Pronouns (like *goar* "I", *lie* "you") DO change tone by default.

Exception: A pronoun DOES NOT change tone if the speaker pauses to **emphasize** it.

4. Words Ending in 'ar'

When adding the suffix 'ar' (like a diminutive), the front syllable usually changes tone (e.g., *"niaw" → "niau'ar"*).

Exception: If the front syllable has a **flat tone** (Tone 7), it **does not change tone** (e.g., *"te" → "te'ar"*).

🤖 Challenges for Text-to-Speech (TTS) Development

Advantages of the MTL System for NLP

The Biggest TTS Challenge

The most difficult technical challenge for a Taiwanese TTS system is accurately implementing the Noun/Adjective Exception (where the tone change depends on the word's function).

The Problem: The tone change of a word like *Taioaan* depends entirely on its function in the sentence: Noun (no change) vs. Adjective (change).

The Solution: The TTS system needs a robust Part-of-Speech (POS) Tagger to label every word in the input text as a Noun, Adjective, Verb, etc., before applying the tone rules.

The lack of established, high-quality Taiwanese POS tagging models and large, tagged datasets is currently the main bottleneck for building truly accurate Taiwanese TTS.

Taiwanese Tone Sandhi and Text-to-Speech (TTS) Challenges