Capo
نظرية الموسيقى

Why Slowing Audio Doesn't Have to Sound Like a Chipmunk

فريق Capoقراءة 3 دقيقة

Slow a record down on a turntable and everything drops into a low groan. Speed a tape up and voices turn into chipmunks. For most of audio history, speed and pitch were two ends of the same lever — you couldn't move one without the other. Modern audio software broke that link, and understanding how explains why a tool like Capo can slow a song to a crawl while it still sounds completely natural.

Speed and pitch used to be locked together. The trick is keeping them apart.

Why speed and pitch were always linked

Sound is a wave. Its pitch is how often that wave repeats per second — its frequency. The naïve way to slow audio is to play its samples back more slowly, which stretches every wave out in time. A stretched wave repeats less often, so its frequency drops, and with it the pitch. Play it faster and the opposite happens. That's not a bug; it's just geometry. Slowing the playback is lowering the pitch, when you do it the simple way.

This linked behaviour has a name: varispeed — the classic tape-machine effect.

To change speed without changing pitch, software uses time-stretching. Instead of stretching the waves themselves, it chops the audio into very short grains and intelligently repeats or removes them — making the sound last longer or shorter while each individual wave keeps its original frequency. The pitch stays exactly where it was; only the timeline changes.

The reverse — pitch-shifting — changes the frequency while keeping the duration the same, which is what lets Capo move a song up or down in pitch and key without it speeding up.

Formants: why a shifted voice still sounds human

There's a subtler detail. A voice isn't a single frequency; it has formants — fixed resonances, set by the size and shape of someone's throat and mouth, that give it its character. Shift pitch carelessly and you drag the formants along too, which is what makes a pitched-up voice sound cartoonish. Quality pitch-shifting treats pitch and formants separately, so a transposed singer still sounds like a person, just higher or lower.

What this means in practice

Because Capo keeps speed and pitch as independent controls, you get the useful half of varispeed with none of the side effects:

The old lever is now two separate dials — which is exactly what you want when you're trying to understand a recording rather than warp it.

Hear the difference for yourself — free.

Chrome وFirefox · v2.5 · مجاني، دون حساب
أضِفه إلى Chrome — مجانًا

أسئلة

Why does slowing audio down normally make it sound deep?
Because the simplest way to slow audio is to play its samples back more slowly, which stretches every sound wave — lowering its frequency, and therefore its pitch. Speed and pitch move together unless something keeps them apart.
What is varispeed?
Varispeed is the classic tape-machine behaviour where changing the speed also changes the pitch, the two linked together. Slow the tape and everything drops; speed it up and it rises. It's a real effect with musical uses, but it's not what you want when you're trying to keep a track sounding natural.
What is time-stretching?
Time-stretching changes how long audio takes to play without changing its pitch, by intelligently repeating or compressing tiny grains of sound. It's what lets Capo slow a song to a quarter speed while the singer still sounds like themselves.
What are formants and why do they matter?
Formants are the resonant frequencies that give a voice or instrument its character. Good pitch-shifting preserves them, which is why a transposed voice still sounds human rather than cartoonish.

تابِع القراءة

ضع Capo على كل ما تستمع إليه.

أضِفه إلى Chrome — إنه مجانيمتوفّر أيضًا على Firefox · v2.5 · دون حساب