Chinese Tones Explained: The Complete Beginner's Guide
Master Mandarin's 4 tones with clear explanations, real-world examples, and proven practice strategies. Whether you're just starting out or need to fix bad tone habits, this guide covers everything from the basics to tone sandhi rules.
Last updated: February 2026
Mandarin Chinese has 4 tones plus a neutral tone. First tone (flat high), second tone (rising), third tone (dipping), fourth tone (falling). Tones change the meaning of words — 'mā' (mother) vs 'mǎ' (horse). Most learners master tones within 2-3 months of consistent practice.
Imagine walking into a restaurant in Beijing and confidently asking for "soup" — but everyone stares at you because you accidentally said "sugar" instead. Or worse, you try to call your mother (mā) and end up saying "horse" (mă). Welcome to the world of Chinese tones, where the pitch of your voice determines the meaning of every single syllable you speak.
If you are new to Mandarin Chinese, tones probably feel like the most intimidating part of the language. Every beginner guide mentions them, every teacher warns you about them, and every learner has at least one embarrassing tone mistake story. The syllable "shī" can mean "poem," "ten," "to make," or "yes" depending on which tone you use. Get the tone wrong, and you are saying a completely different word.
But here is the good news: tones are absolutely learnable. Millions of non-native speakers master them every year, and you do not need to be musically gifted to do it. In fact, you already use tones in English — you just do not think about them. When you say "Really?" with a rising pitch, that is a tone. When you say "No!" with a sharp falling pitch, that is a tone too. Chinese simply uses these pitch patterns to distinguish individual words rather than sentence-level meaning.
This guide walks you through everything you need to know about Chinese tones: what they are, how to produce each one, the tricky tone sandhi rules that trip up beginners, common mistakes and how to fix them, and practical strategies to build solid tone habits from day one. By the end, tones will feel far less mysterious — and far more manageable — than they do right now.
Try HSK 1 Flashcards
Tap a card to reveal its meaning
What Are Tones?
A tone is the pitch contour — the rise, fall, or sustained level of your voice — applied to a syllable. In Mandarin, there are four main tones and one neutral (or "light") tone. Each tone follows a specific pitch pattern, and changing that pattern changes the word entirely. The syllable "ma" is five completely different words depending on which tone you use: mother, hemp, horse, scold, or a question particle.
English speakers sometimes think they "do not use tones," but that is not quite right. English uses pitch to convey emotion and sentence-level meaning all the time. When you say "You're leaving?" with a rising pitch at the end, the rise signals a question. When you say "You're leaving." with a falling pitch, it becomes a statement. The difference is that English uses pitch to change the grammar or emotion of a whole sentence, while Chinese uses pitch to change the meaning of individual words. In Chinese, the pitch pattern is baked into each word the same way a consonant or vowel is — it is not optional, it is part of the word itself.
Mandarin has approximately 400 unique syllables (not counting tones). With four tones plus a neutral tone, that expands to roughly 1,600 possible syllable-tone combinations. This is still far fewer distinct sounds than English has, which is why Chinese relies so heavily on tones to differentiate words. Without tones, you would have massive ambiguity — dozens of common words sharing the same pronunciation.
The 4 Tones of Mandarin
Each tone is marked in pinyin (the romanization system for Mandarin) with a diacritical mark placed over the main vowel. Learning to recognize these marks and associate them with the correct pitch pattern is one of the first skills you will develop. Let us go through each tone in detail.
First Tone (¯) — High and Flat
The first tone is a high, steady pitch that stays flat from beginning to end. Think of it like singing and holding a single high note without letting your voice waver up or down. The pitch should be at the top of your comfortable speaking range — not strained or falsetto, but noticeably higher than your normal conversational pitch.
The most common mistake with the first tone is letting it drift downward. English speakers naturally let their pitch drop at the end of words, but for the first tone you need to keep it level the entire time. Imagine a flat line drawn across the top of a pitch chart — that is your target. It helps to hum the note before adding the consonant and vowel so you can feel the steady pitch in your throat.
Written mark: a flat line over the vowel (¯)
Examples: mā (mother), tā (he/she), bā (eight), fēi (to fly), gāo (tall/high), shī (poem), tiān (sky/day)
Practice tip: Say "ohhhhh" like you are at the doctor's office holding a steady note. That sustained, unwavering pitch is what you want for the first tone.
Second Tone (´) — Rising
The second tone starts at a middle pitch and rises sharply to the top. The best English analogy is the sound you make when you say "Huh?" or "What?" in surprise — that quick upward inflection is exactly the pitch contour of the second tone. It should feel like your voice is climbing a slope from mid-range to high.
Many beginners do not rise enough with the second tone. They produce a slight upward movement that sounds tentative rather than decisive. The rise needs to be clear and strong — think of it as an upward sweep that covers a wide pitch range, not a gentle nudge. If you end the syllable at the same pitch where the first tone sits, you are doing it right.
Written mark: a rising stroke over the vowel (´)
Examples: má (hemp), ná (take), lái (come), rén (person), shí (ten), méi (not have), yáng (sheep)
Practice tip: Ask "Really?" with exaggerated surprise. Feel how your voice sweeps upward? That is the second tone. Apply that same rising energy to Chinese syllables.
Third Tone (ˇ) — Dipping
The third tone is the one that causes learners the most trouble, partly because the way it is taught differs from the way it is actually spoken. In textbooks, the third tone is described as a "dipping" tone: it starts at a mid-low pitch, drops to the bottom of your range, and then rises back up. The tone mark reflects this full dip-and-rise shape. However, in natural speech, the third tone is often realized as just a low, creaky tone — it dips down and stays low without the final rise, especially when followed by another syllable.
The full dip-and-rise pattern (sometimes called the "full third tone") only appears in isolation or at the very end of a phrase. In connected speech — which is how you will be using it 90% of the time — just focus on making it low. Think of it as the "low tone." Drop your pitch to the very bottom of your range and let it sit there with a slightly creaky quality, like the sound of a door slowly creaking open. This low, gravelly quality is the most distinctive feature of the third tone.
Written mark: a caron (ˇ) over the vowel, shaped like a "v"
Examples: mă (horse), nǐ (you), hăo (good), xiăo (small), lăo (old), yǐ (also/already), măi (buy)
Practice tip: Say "well..." in a low, drawn-out, slightly doubtful way — like you are not sure you agree with something. That low, creaky pitch is the core of the third tone. Do not worry about the rising part at the end; in context it barely happens.
Fourth Tone (̀) — Falling Sharp
The fourth tone is a sharp, decisive drop from high to low. It starts at the top of your pitch range and falls rapidly and forcefully to the bottom. Think of saying "No!" firmly, or giving a sharp command like "Stop!" That authoritative, downward-slicing pitch is the fourth tone.
The fourth tone is often considered the easiest for English speakers because we naturally use falling pitch for emphatic statements. The key is making the fall dramatic enough — it should be a steep drop, not a gentle slope. Start high and cut down decisively. The whole syllable should feel short and punchy compared to the other tones.
Written mark: a falling stroke over the vowel (̀)
Examples: mà (scold), dà (big), shì (is/yes), zài (at/in), kàn (look), là (spicy), qiì (four)
Practice tip: Say "Bah!" in an exasperated tone, starting high and dropping sharply. That strong downward motion is the fourth tone. You can also try counting "one, two, three, FOUR!" — the way most people say "four" in a count has a natural falling pitch.
Neutral Tone — Light and Short
The neutral tone (sometimes called the fifth tone or the "light tone") is not a separate tone in the same way as tones 1 through 4. Rather, it is the absence of a strong tone — the syllable is pronounced lightly, quickly, and at a pitch that depends on the tone of the preceding syllable. Neutral-tone syllables are short and unstressed, almost like they are tacked onto the end of the previous word without their own melodic identity.
Neutral tone appears most commonly in grammatical particles and the second syllable of certain common words. You will encounter it frequently in particles like "ma" (the question marker), "de" (the possessive particle), and "le" (the change-of-state particle). It also shows up in words like "māma" (mother, where the second "ma" is neutral) and "bàba" (father).
Written mark: no mark in pinyin
Examples: ma (question particle), de (possessive particle), le (change-of-state particle), ba (suggestion particle), ne (continuation particle)
Tone Pairs: The Real Challenge
Here is something that most textbooks underemphasize: pronouncing individual tones in isolation is relatively straightforward. The real difficulty begins when you start combining tones into words and sentences. A two-syllable word requires you to transition smoothly from one pitch pattern to another, and certain combinations feel far more natural than others.
There are 16 possible tone pair combinations (4 tones × 4 tones), plus additional combinations involving the neutral tone. Some of these pairs — like first tone + first tone or fourth tone + fourth tone — are relatively intuitive. Others, like second tone + third tone or third tone + third tone, can feel awkward because your voice has to navigate pitch contours that do not flow easily into each other. The key to natural-sounding Mandarin is drilling these tone pairs until the transitions become automatic.
The following table shows the most common tone pair combinations with example words. Practice each pair multiple times, paying attention to how your pitch transitions from the first syllable to the second:
| Tone Pair | Example Word | Pinyin | English |
|---|---|---|---|
| 1 + 1 | 今天 | jīntiān | today |
| 1 + 2 | 中国 | Zhōngguó | China |
| 1 + 3 | 喝水 | hēshuǐ | drink water |
| 1 + 4 | 工作 | gōngzuò | work |
| 2 + 1 | 学生 | xuéshēng | student |
| 2 + 3 | 银行 | yínháng | bank |
| 2 + 4 | 结果 | jiéguǒ | result |
| 3 + 1 | 北京 | Běijīng | Beijing |
| 3 + 2 | 美国 | Měiguó | America |
| 3 + 3 | 你好 | nǐhǎo (→ níhǎo) | hello |
| 3 + 4 | 紫色 | zǐsè | purple |
| 4 + 1 | 大家 | dàjiā | everyone |
| 4 + 2 | 自己 | zìjǐ | oneself |
| 4 + 3 | 电脑 | diànnǎo | computer |
| 4 + 4 | 再见 | zàijiàn | goodbye |
When practicing tone pairs, do not just read the words — say them aloud repeatedly. Focus on the transition between the first and second syllable. Where does your voice need to be at the end of the first syllable to set up the beginning of the second? For example, in a 4+2 pair, your voice drops sharply on the first syllable and then needs to start mid-range and rise for the second. That rapid recovery from low to mid is what makes some pairs physically challenging at first.
Practice Chinese Tones with Flashcards
HSK Lord's spaced repetition system includes audio for every word — perfect your tones naturally.
Start Practicing FreeTone Sandhi Rules
Just when you think you have the four tones figured out, Chinese throws you a curveball: some tones change depending on the tones around them. These changes are called "tone sandhi" (sandhi comes from a Sanskrit word meaning "joining"). Tone sandhi is not optional or casual — it is a required part of standard pronunciation. Native speakers apply these rules automatically without thinking about them, and you will too once they become habitual.
Rule 1: Third Tone + Third Tone → Second Tone + Third Tone
This is the most important and most common tone sandhi rule. When two third-tone syllables appear in sequence, the first one changes to a second tone. You have already seen this in the tone pairs table: 你好 is written as "nǐ hǎo" in pinyin (both third tone), but is actually pronounced "ní hǎo" (second tone + third tone).
Other examples: 你好 (nǐhǎo → níhǎo, "hello"), 水果 (shuǐguǒ → shuíguǒ, "fruit"), 很好 (hěnhǎo → hénhǎo, "very good"). The pinyin in textbooks and dictionaries typically writes the original tone (third + third), and you are expected to know that the first one becomes a second tone when speaking. This can be confusing at first, but it quickly becomes second nature.
Rule 2: 不 (bù) Before a Fourth Tone
The character 不 (bù, "not/no") is normally pronounced with a fourth tone. However, when it appears directly before another fourth-tone syllable, it changes to a second tone: bù → bú. For example: 不是 is written "bù shì" but pronounced "bú shì" ("is not"). Similarly, 不对 (bù duì → bú duì, "not correct") and 不会 (bù huì → bú huì, "will not").
Before tones 1, 2, and 3, 不 keeps its original fourth tone. So 不看 (bù kān, "not look") stays as fourth tone because 看 is first tone. The rule only triggers before another fourth tone.
Rule 3: 一 (yī) Tone Changes
The character 一 (yī, "one") is the most tonally flexible word in Mandarin. Its base tone is first tone (yī), and it keeps this tone when used in isolation, when counting, or when it appears at the end of a phrase. However, it changes tone based on what follows it:
- Before a fourth tone: 一 changes to second tone (yí). Example: 一样 (yíyàng, "the same"), 一定 (yídìng, "definitely")
- Before tones 1, 2, or 3: 一 changes to fourth tone (yì). Example: 一天 (yìtiān, "one day"), 一直 (yìzhí, "always"), 一起 (yìqǐ, "together")
- Between two words (as "one"): 一 is pronounced with neutral tone (yi). Example: 看一看 (kànyikàn, "take a look")
This might seem like a lot to remember, but in practice the 一 tone changes become automatic quickly because you encounter this character so frequently. Most learners internalize the patterns within the first few months of study without deliberate memorization.
Common Tone Mistakes (and How to Fix Them)
After working with thousands of Chinese learners, certain tone mistakes come up again and again. Recognizing these patterns in yourself is the first step to correcting them. Here are the most frequent issues and how to address each one:
Mixing Up Second and Third Tones
This is the single most common tone error among English-speaking learners. The second tone (rising) and the third tone (dipping) can sound deceptively similar to untrained ears, especially when the third tone is produced with a slight rise at the end. The key difference is the starting point: the second tone starts at a middle pitch and rises, while the third tone starts low and stays low (or dips even lower before a slight rise). If you are struggling with this distinction, focus on the beginning of the syllable rather than the end. The third tone should start noticeably lower in your vocal range than the second tone.
Fix: Practice minimal pairs back to back. Say "má" (hemp, second tone) and then immediately say "mǎ" (horse, third tone). Exaggerate the difference — make the second tone rise dramatically, and make the third tone drop to the very bottom of your range. Record yourself and listen back. If you cannot hear the difference in your own recordings, keep exaggerating until you can.
Flat-Lining (Not Differentiating Enough)
Many beginners produce all four tones in a narrow pitch range, resulting in tones that technically go in the right direction but sound too similar to each other. This is sometimes called "flat-lining" or "mumble tones." It happens because English uses a relatively narrow pitch range in everyday speech, and learners carry that habit into Chinese.
Fix: Deliberately exaggerate your tones during practice sessions. Make the first tone absurdly high, the fourth tone an absurdly sharp drop, the second tone an absurdly high rise, and the third tone an absurdly low growl. This feels ridiculous at first, but it trains your vocal muscles to use a wider pitch range. Over time, you can dial back the exaggeration while maintaining clear differentiation. Native speakers will understand exaggerated tones perfectly — they will not understand flat ones.
Overthinking Instead of Feeling
Some learners get so caught up in the mechanics of tones — "start high, go flat, keep it level" — that they lose all natural rhythm and fluency. They pause before each syllable to mentally calculate the tone, producing speech that is technically correct but sounds robotic and unnatural.
Fix: Shift from thinking about tones to feeling them. Associate each tone with a physical sensation or emotional expression rather than a set of instructions. First tone = serenity (calm, level hum). Second tone = surprise ("Really?"). Third tone = dissatisfaction (low grumble). Fourth tone = anger ("No!"). When you connect tones to emotions and physical sensations rather than abstract rules, production becomes faster and more natural.
Not Practicing Tone Pairs
Many learners practice individual tones (mā, má, mǎ, mà) and then move on, thinking they have "learned the tones." But as we discussed above, the real challenge is combining tones into pairs and longer sequences. Practicing isolated tones without pair drills is like learning to dribble a basketball without ever practicing while moving — the skill does not transfer to real-game situations.
Fix: Dedicate specific practice time to tone pairs. Take the table above and drill each combination with real words. Say each word five times in a row, focusing on smooth transitions. Then mix the pairs randomly so you are not just building muscle memory for one transition at a time. Apps with audio flashcards — like HSK Lord — are particularly effective for this because every word comes with native-speaker audio that models correct tone pair production.
How to Practice Tones Effectively
Knowing the theory of tones is only half the battle. Building accurate tone production requires deliberate practice with the right techniques. Here are the most effective strategies, ordered from foundational to advanced:
Listen and Repeat with Native Audio
This is the most important practice method and should be a daily habit from day one. Listen to a native speaker say a word, then immediately repeat it, trying to match their pitch pattern as closely as possible. Do not just listen passively — active repetition is what builds the neural pathways for tone production. Start with single syllables, progress to two-syllable words, then to short phrases and sentences. Spaced repetition flashcard apps with audio are ideal for this because they provide hundreds of repetition opportunities across your entire vocabulary.
Record Yourself and Compare
Recording yourself is uncomfortable but incredibly effective. Play a native audio clip, record yourself saying the same word, and then compare the two recordings. You will immediately notice discrepancies that you cannot hear in real-time while speaking. Pay special attention to the starting pitch of each tone (is your third tone starting low enough?), the contour (is your second tone rising enough?), and the ending pitch (is your first tone staying level?). Doing this for just 5 minutes per day produces rapid improvement.
Exaggerate at First, Then Normalize
When learning any physical skill, exaggeration helps build correct motor patterns. Singers exaggerate vowel shapes, athletes exaggerate technique in slow motion, and Chinese learners should exaggerate tones. Make your first tone absurdly high and flat. Make your fourth tone drop like a stone. Make your third tone a deep, gravelly rumble. This is not how you will speak in conversation — it is training that ensures your "normal" tones are clear enough to be understood. As the exaggerated versions become easy, gradually reduce to a natural speaking level.
Practice Minimal Pairs
Minimal pairs are words that differ by only one tone. Practicing them trains your ear and mouth to distinguish and produce the exact difference between tones. The classic set is mā/má/mǎ/mà (mother/hemp/horse/scold), but you should also practice pairs like shī/shí/shǐ/shì (poem/ten/history/yes) and bā/bá/bǎ/bà (eight/pull/handle/father). Say each set rapidly, transitioning between tones without pausing. Then have someone quiz you — can you identify which tone they are saying?
Use Spaced Repetition with Audio
Spaced repetition systems (SRS) are proven to be the most efficient method for long-term vocabulary retention, and when the flashcards include native audio, they double as tone practice. Every time you review a flashcard with audio, you hear the correct tone and can say the word aloud before checking the answer. Over hundreds of reviews, the correct tone for each word becomes deeply ingrained. This is one of the key advantages of using an audio-enabled flashcard app over pure text-based study — you are training your ear and your voice simultaneously with every review session.
Why Tones Aren't as Hard as You Think
After reading all the rules, exceptions, and common mistakes, you might be feeling more intimidated about tones than when you started. Let us bring some perspective back. Tones are genuinely challenging, but they are not the insurmountable barrier that internet forums and social media make them out to be. Here is why:
Context Does Most of the Heavy Lifting
In real conversations, context disambiguates the vast majority of potential tone confusions. If you are in a restaurant talking about food and you say "tāng" with a slightly off tone, everyone knows you mean soup (汤, tāng) and not sugar (糖, táng) because of the conversational context. Linguists estimate that even with significantly imperfect tones, context resolves the correct meaning 90% or more of the time. This does not mean you should ignore tones — clear tones make communication faster and smoother — but it does mean that imperfect tones rarely cause total communication breakdown.
You Already Use Tone in English
English speakers are not starting from zero with pitch variation. You already modulate pitch dozens of times in every sentence — for emphasis, questions, sarcasm, excitement, boredom, and more. The neural machinery for pitch control is already developed in your brain. Learning Chinese tones is not about building a new ability from scratch; it is about repurposing an existing ability for a new function. That is a much smaller learning task than it initially appears.
Most Learners Get Functional Within 2-3 Months
Research on second-language tone acquisition shows that most adult learners achieve functional tone accuracy — meaning they can be understood by native speakers in normal conversation — within 2 to 3 months of consistent practice. This is not native-level perfection, and certain difficult combinations will take longer, but it is enough to communicate effectively. The initial period of tone confusion is temporary. Every learner who sticks with it passes through it and comes out the other side with ears that hear tones and a voice that produces them.
The biggest risk with tones is not that they are too hard to learn — it is that learners give them insufficient attention at the beginning and build bad habits that take much longer to correct later. Investing time in tones during your first few months of Chinese study is the single highest-return investment you can make. Get tones approximately right early, and everything else — vocabulary, grammar, listening comprehension — builds on that solid foundation.
Ready to start building your Chinese vocabulary with correct tones? Start with HSK Lord today and hear every word pronounced by native speakers with clear, accurate tones.
Free Tone Practice Guide
Download our tone pair practice sheet with the 50 most important tone combinations for beginners.
No spam. Unsubscribe anytime.
Frequently Asked Questions
Related Articles
Related Articles
Start Mastering Chinese Tones Today
HSK Lord's audio-enabled flashcards let you hear and practice every tone. Free for 30 days, no credit card required.
Start Free →Share this article: