MP3 File Compression - What is MP3 Compression
Audio files with no compression, as found on a normal audio cd, can be very large - around 10 MB per minute of stereo sound.
Without a way to compress this data, portable audio players such as MP3 would probably not exist.
Fortunately, with today's technology, there are many ways to compress digital data. computer data files, for instance, can be
compressed with the popular ZIP format. The resultant zip file is normally much smaller than the original file and can be unzipped so that the
original file is intact.
Compression formats which retain all of the original data are called 'lossless'. Certain types of files like audio, video and
graphics can also be compressed by discarding some of the original data. This form of compression is called lossy compression.
MP3 compression is a lossy format as there is no way to reconstruct an MP3 file to its original sonic quality. However, the major
advantage in this is a much smaller file size.
One of the main advantages of MP3 files is that they still retain acceptable audio quality, even after compression, while greatly
reducing the size of the file. This is accomplished by discarding certain parts of the audio spectrum which are considered to be less important.
This includes higher frequencies from the audio spectrum and sounds which may be hidden behind louder sounds.
Audio compression depends a lot on something called psychoacoustics which is the study of how sound is perceived by the ear and
by the brain. The MP3 audio compression codec uses a model that determines how a particular audio file is heard by the average person. This model
can decide how much of the audio data can be removed while still retaining an acceptable sound.
There are a number of compression schemes, and each of them has their own model for figuring which parts of the audio spectrum
are removed. This results in different sound quality between MP3, WMA, Ogg Vorbis and AAC formats. The actual sound quality is subjective as some
people may favour one format over another.
MP3 Compression - Bitrate
Besides the actual algorithm used for audio compression, the bitrate has the biggest influence on sound quality. Bitrate
represents the amount of streaming audio data and is expressed as kilobits per second. The standard bitrate for MP3 files is 128 kb per second,
and this represents a good compromise between sound quality and file size.
Higher bitrates remove less data and consequently have a higher quality sound. WMA and AAC can realize comparable sound quality
to MP3 at lower bitrates, so these formats generate smaller file sizes.
In the early years of MP3, all files were encoded at a constant bitrate or CBR. This means that a bitrate of 128 kb per second
(for example) is used for every section of the music no matter whether it was silent or loud. Musical passages which are relatively dense,
however, have more audio data than quiet passages, so variable bitrates or VBR can be used to achieve a higher quality sound. As the music
becomes more complex, the bitrate rises, and conversely when the music thins out, a lower bitrate is utilised.
When you are encoding your own music files it is a good practice to try out a variety of bitrates and file formats to see what
works best to you.
You should also bear in mind, that certain kinds of music such as classical, pop or jazz, may sound better with a particular
codec. For instance, it is reported that the Ogg Vorbis codec is ideal for classical music.
|