StudySmarter - The all-in-one study app.
4.8 • +11k Ratings
More than 3 Million Downloads
Free
Americas
Europe
Lerne mit deinen Freunden und bleibe auf dem richtigen Kurs mit deinen persönlichen Lernstatistiken
Jetzt kostenlos anmeldenIf you could guide a tiny microphone into one nostril, through the nasal cavity, into the pharynx, and all the way to the larynx, you would be able to record the pure, unfiltered sound of the voice. As uncomfortable as this sounds, linguists like the famous Peter Ladefoged have actually done it in order to study the source-filter model of speech production. This model is based on the theory that the vocal tract filters the original sound source of the voice for unique consonant and vowel production. A few definitions and examples will help you really understand source-filter theory without the need for linguists—both literally and figuratively—to shove the concept down your throat.
First formulated by the phoneticist Gunnar Fant in 1960,1 the source-filter model provides an acoustic description of speech. The idea is that a simple sound from the vocal folds has to travel through the throat and the mouth, passing all sorts of obstacles in order to produce a meaningful sound. Here's a summary of the key components of the source-filter model.
Running a recording of human speech (aka a speech signal) through a phonetic-analysis program presents you with a great deal of information and graphs. The source-filter model is important because it helps you interpret and analyze this information. It allows you to make predictions about the nature of a speech signal just by looking at it in a computer program.
The definition of source-filter theory is relatively simple:
Source-filter theory is theory of phonetics that describes speech as the sound produced by a source (usually the vocal folds) and modified by a filter (the vocal tract).
In other words, source-filter theory divides speech into two parts. One part creates a raw sound, and the other part acts as a tube to shape the sound as it passes through.
Playing the trumpet provides an example of source-filter theory in action.
In order to get a sound out of a trumpet, you have to purse your lips and blow air into the trumpet's mouthpiece. This is sometimes called "buzzing" your lips against the mouthpiece. This action creates the sound source.
The body of the trumpet acts as a filter for the sound source. It turns the simple buzz of the mouthpiece into an amplified, clear, recognizable trumpet sound.
Fig. 1 - "Buzzing" your lips against a trumpet's mouthpiece creates a sound source, which is then filtered by the body of the trumpet.
Remember that cruel and unusual method of recording the voice right at the level of the vocal folds? The resulting recording sounds like a simple buzzing sound—not unlike the buzz of a trumpet mouthpiece. If you're curious, you can look up the sound and hear it for yourself.
Now to translate the source-filter model from trumpets to the vocal tract. In most speech sounds, the sound source is the vibration of the vocal folds, and the filter is the remainder of the vocal tract.
The source of voiced speech sounds is the vibration of the vocal folds. The vocal folds are contained in the part of the larynx called the glottis.
The larynx, also known as the voice box, is an organ made of bone and tissue located at the center of the throat.
The glottis is the part of the larynx that contains the vocal folds.
The vocal folds make up the tissue membrane responsible for voicing in speech.
When you force air from your lungs through your closed glottis, you cause your vocal folds to vibrate. This vibration, called the glottal source wave, is the sound source for most speech sounds.
The glottal source wave is the sound source for most, but not all, speech sounds. The source for voiceless speech sounds that originate higher in the vocal tract is the constriction within the vocal tract. For example, when you produce the voiceless labiodental fricative [f], the sound source is air passing through the constriction between the lower lip and the upper teeth. The filter for this sound is very small because there isn't much in front of those structures to morph the sound.
On its way out into the world, the glottal source wave must pass through the filter: the vocal tract.
The vocal tract consists of all the speech organs from the larynx to the lips.
Fig. 2 - The sound source is the vibration of the vocal folds (glottis), and the filter is the vocal tract.
The primary organs in the vocal tract that are relevant to speech production are the epiglottis, pharynx, velum, tongue (tip, blade, body, and root), velum, alveolar ridge, hard palate, teeth, lips, and nasal cavity. Each of these organs can filter the sound from the glottal source wave.
Source-filter theory is also useful because it can help explain the key characteristics of vowels, specifically fundamental frequency (pitch) and formants.
The fundamental frequency (or pitch) of a voiced sound is the sound's primary audible frequency.
Formants are amplified frequency ranges that distinguish one vowel from another.
When you sing an A at 440 Hz, the fundamental frequency of your voice is 440 Hz. The fundamental frequency of a speech sound depends on the speed of vocal fold vibration and on the length of the filter.
Frequency is usually measured in hertz (Hz), which means cycles per second. If the wave from your voice has a fundamental frequency of 440 Hz, then the wave repeats its pattern 440 times every second. You can raise the fundamental frequency of your voice either by raising your larynx to shorten the vocal tract or by pushing air through your vocal folds with more pressure, forcing them to vibrate faster.
Formants are bands of loud frequencies that characterize unique vowels. They are categorized into three bands:
F1, the lowest formant, correlates to vowel height. The lower the F1 frequency, the higher the vowel. For example, the vowels with the lowest F1 values are the high vowels, like [i] as in beet and [u] as in boot.
F2, the next lowest formant after F1, correlates to vowel backness. The lower the F2 frequency, the further back the vowel. For example, the vowels with the lowest F2 values are the back vowels, like [o] as in boat and [ɑ] as in bot.
F3, the highest formant relevant to speech sounds, correlates to certain vowel constrictions, especially r-colored vowels like the [ɹ] in the General American pronunciation of bird. This sound has a relatively low F3 value.
What does this have to do with source-filter theory? According to the source-filter model, formants are determined by the shape of the filter.
The low back vowel [ɑ] includes a constriction, or area of tension, at the pharynx. This constriction separates the vocal tract filter into a short tube from the larynx to the pharynx and a long tube from the pharynx to the lips. This filters the glottal source wave into a sound with a high F1 and a low F2, which you perceive as the [ɑ] vowel.
You can see examples of the acoustic effects of the source and filter by looking at a sound spectrum.
A sound spectrum is a plot of the simple wave components of a complex wave.
The sound spectrum is like a snapshot of a vowel at a single point in time. It shows all of the frequencies present in the wave (on the x-axis) and the amplitude, or loudness, of each frequency (on the y-axis).
Remember that the sound source helps to determine the fundamental frequency of the wave. The frequencies on the x-axis are evidence of the glottal source wave. These frequencies look like a row of "spikes" on the spectrum.
The first big spike on the sound spectrum is the wave's fundamental frequency.
The filter determines a vowel's formants. Formants are characterized by the loudest frequencies in the wave. On the sound spectrum, the curvy pattern in amplitude at the top of the spikes provides evidence of the filter.
Fig. 3 - The sound source provides a compilation of frequencies. The filter changes the amplitudes of the source's frequencies.
The most important points of this example of the source-filter theory are these:
A change in the number of frequency spikes on the sound spectrum represents a change in the sound source.
A change in the curvy amplitude shape on top of the frequency spikes represents a change in the filter.
The source of voiced speech sounds is the vibration of the vocal folds. The vocal folds are contained in the part of the larynx called the glottis.
On its way out into the world, the glottal source wave must pass through the filter: the vocal tract
Source-filter theory is a theory of phonetics that describes speech as the sound produced by a source (usually the vocal folds) and modified by a filter (the vocal tract).
First thought up by the phoneticist Gunnar Fant in 1960,1 the source-filter model provides an acoustic description of speech. The idea is that a simple sound from the vocal folds has to travel through the throat and the mouth, passing all sorts of obstacles, in order to produce a meaningful sound.
Running a recording of human speech through a phonetic analysis program presents you with a great deal of information and graphs. The source-filter model is important because it helps you interpret and analyze this information. It allows you to make predictions about the nature of a speech signal just by looking at it in a computer program.
Be perfectly prepared on time with an individual plan.
Test your knowledge with gamified quizzes.
Create and find flashcards in record time.
Create beautiful notes faster than ever before.
Have all your study materials in one place.
Upload unlimited documents and save them online.
Identify your study strength and weaknesses.
Set individual study goals and earn points reaching them.
Stop procrastinating with our study reminders.
Earn points, unlock badges and level up while studying.
Create flashcards in notes completely automatically.
Create the most beautiful study materials using our templates.
Sign up to highlight and take notes. It’s 100% free.
Over 10 million students from across the world are already learning smarter.
Get Started for Free