Dev ToolsMarch 30, 2026

Audio File Size Calculator Guide: Bitrate, Format & Storage Planning

By The hakaru Team·Last updated March 2026

Quick Answer

  • *Uncompressed formula: File size (bytes) = Sample Rate × Bit Depth × Channels × Duration ÷ 8
  • *A 3-minute stereo WAV at 44.1 kHz / 16-bit is about 30 MB; the same song as a 320 kbps MP3 is about 7 MB.
  • *A 1-hour podcast at 128 kbps mono MP3 is approximately 56 MB.
  • *Use WAV or FLAC for archiving and editing; MP3 or AAC for distribution and streaming.

How Audio File Size Works

Every digital audio file is made up of samples — snapshots of a sound wave captured at regular intervals. The number of samples per second (sample rate), the precision of each snapshot (bit depth), the number of audio channels, and the duration of the recording all determine how large the resulting file will be.

For uncompressed formats like WAV and AIFF, the math is straightforward. For compressed formats like MP3 and AAC, the encoder discards data that human ears are less likely to perceive, resulting in dramatically smaller files with minimal perceived quality loss.

The Audio File Size Formula

Uncompressed Audio (WAV, AIFF, PCM)

File Size (bytes) = Sample Rate × Bit Depth × Channels × Duration (seconds) ÷ 8

Example: a 3-minute stereo track recorded at the standard CD quality of 44,100 Hz and 16-bit:

44,100 × 16 × 2 × 180 ÷ 8 = 31,752,000 bytes ≈ 30.3 MB

Sample RateBit DepthChannelsSize per Minute
44,100 Hz16-bitStereo10.1 MB
44,100 Hz24-bitStereo15.1 MB
48,000 Hz24-bitStereo16.5 MB
96,000 Hz24-bitStereo33.0 MB
44,100 Hz16-bitMono5.0 MB

Compressed Audio (MP3, AAC, OGG)

For compressed formats, size depends on the bitrate:

File Size (bytes) = Bitrate (bps) × Duration (seconds) ÷ 8

A 4-minute song at 320 kbps MP3: 320,000 × 240 ÷ 8 = 9,600,000 bytes ≈ 9.2 MB

BitrateSize per MinuteTypical Use Case
64 kbps0.5 MBVoice, audiobooks
128 kbps1.0 MBPodcast, speech
192 kbps1.4 MBMusic streaming
256 kbps1.9 MBHigh-quality music
320 kbps2.4 MBMaximum MP3 quality

Audio Format Comparison: Top 5 Formats

1. WAV (Waveform Audio File Format)

The industry standard for uncompressed audio. WAV files preserve 100% of the original recording data with no quality loss. They're the go-to for professional recording, editing, and mastering. The downside: a 44.1 kHz / 24-bit stereo WAV takes up about 15 MB per minute. According to the Audio Engineering Society (AES, 2024), over 80% of professional studios use WAV as their primary working format.

2. MP3 (MPEG-1 Audio Layer III)

The most universally supported compressed audio format. MP3 uses psychoacoustic compression to remove frequencies that human hearing is least sensitive to. At 320 kbps, most listeners cannot detect a difference from lossless in blind tests. At 128 kbps, the compression is noticeable in quiet passages and high-frequency detail. Spotify streams at 160 kbps on its standard tier and 320 kbps on Premium.

3. AAC (Advanced Audio Coding)

AAC is the successor to MP3 and the default format for Apple Music, YouTube, and iTunes. It achieves better quality than MP3 at the same bitrate — an AAC file at 256 kbps sounds roughly equivalent to an MP3 at 320 kbps. Apple Music streams at 256 kbps AAC. Amazon Music HD offers up to 3,730 kbps lossless streaming.

4. FLAC (Free Lossless Audio Codec)

FLAC compresses audio without any quality loss — it's like ZIP for audio files. A typical FLAC file is 40–60% smaller than WAV while being bit-for-bit identical when decoded. According to Tidal (2025), over 4 million subscribers use their HiFi plan which streams in FLAC. FLAC is the preferred archiving format for audiophiles who want lossless quality with manageable file sizes.

5. OGG Vorbis

An open-source compressed format used by Spotify (alongside AAC) and many games. OGG Vorbis competes with AAC in quality-to-size ratio and is patent-free. Spotify uses OGG Vorbis at 96 kbps (low), 160 kbps (normal), and 320 kbps (very high) depending on subscription tier and device settings.

Practical Storage Planning

For Podcasters

A podcast at 128 kbps mono MP3 uses about 56 MB per hour of content. A weekly 1-hour show generates roughly 2.9 GB per year. According to Buzzsprout's 2025 podcast statistics, the average podcast episode is 43 minutes long. Most podcast hosting services count bandwidth and storage separately — plan for both the file storage and the download bandwidth from listeners.

For Music Producers

Recording at 48 kHz / 24-bit stereo (common for video production) generates 16.5 MB per minute per track. A session with 32 simultaneous tracks generates 528 MB of raw audio per minute. For a 4-minute song with full band tracking and multiple takes, expect 15–30 GB of raw session data before any bouncing or mixdown. The Recording Industry Association of America (RIAA, 2025) reports that digital audio workstation (DAW) storage requirements have grown 40% in the past five years as track counts and sample rates have increased.

For App Developers

If you're bundling audio assets in a mobile app, every megabyte affects download size and therefore install conversion rates. According to Google Play data (2024), app install rates drop approximately 1% for each additional 6 MB of download size. Use OGG Vorbis or AAC at 128–192 kbps for in-app music and 64–96 kbps for voice and sound effects. For short UI sounds under 2 seconds, WAV is sometimes preferable to avoid decoder startup latency on lower-end devices.

Sample Rate and Bit Depth: What They Actually Mean

The Nyquist theoremstates that a sample rate must be at least twice the highest frequency you want to capture. Human hearing tops out at approximately 20,000 Hz, which is why 44,100 Hz (44.1 kHz) became the CD standard — it captures frequencies up to 22,050 Hz with margin to spare.

Sample RateMax FrequencyCommon Use
8,000 Hz4,000 HzTelephone quality
22,050 Hz11,025 HzLow-quality audio
44,100 Hz22,050 HzCD, music streaming
48,000 Hz24,000 HzVideo production, broadcast
96,000 Hz48,000 HzHigh-resolution audio
192,000 Hz96,000 HzStudio mastering

Bit depth controls dynamic range. Each additional bit adds 6 dB of dynamic range:

  • 16-bit: 96 dB dynamic range — sufficient for playback
  • 24-bit: 144 dB dynamic range — preferred for recording and mixing, where headroom prevents clipping
  • 32-bit float: Effectively unlimited dynamic range — used internally by many DAWs to prevent any clipping during processing

Calculate your audio file sizes instantly

Try the Free Audio File Size Calculator →

Frequently Asked Questions

How do I calculate the size of an audio file?

For uncompressed audio (WAV/AIFF): File Size (bytes) = Sample Rate × Bit Depth × Channels × Duration (seconds) ÷ 8. For example, a 3-minute stereo WAV at 44,100 Hz and 16-bit is: 44,100 × 16 × 2 × 180 ÷ 8 = 31,752,000 bytes, or about 30.3 MB. For compressed formats like MP3, divide the bitrate (in bits per second) by 8 to get bytes per second, then multiply by duration.

What bitrate should I use for MP3?

For music, 320 kbps is the gold standard — most listeners cannot distinguish it from lossless. At 192 kbps, quality is good for casual listening. Podcasts and voice recordings are perfectly clear at 128 kbps mono, which produces files roughly 1 MB per minute. Anything below 96 kbps introduces noticeable artifacts in music.

How large is a 1-hour podcast episode?

A 1-hour podcast at 128 kbps mono MP3 is approximately 56 MB. At 192 kbps stereo, the same hour is about 84 MB. Most podcast hosting platforms recommend 128 kbps mono for voice to balance quality and download speed. Listeners on mobile data appreciate smaller files.

Is WAV better than MP3?

WAV is lossless and uncompressed — it preserves every detail of the recording. MP3 is lossy: it discards audio information the ear is less likely to notice, using psychoacoustic models. For archiving, mastering, or professional editing, always use WAV or FLAC. For distribution and streaming, MP3 or AAC at appropriate bitrates is perfectly suitable and far more practical.

How much storage do I need for a music recording studio?

A typical recording session with 24 tracks at 48 kHz / 24-bit generates roughly 750 MB per minute of recorded audio. A 10-minute song with 24 simultaneous tracks would produce about 7.5 GB of raw audio data. With project files, plugins, and session bounces, budget at least 50–100 GB per album project. An NVMe SSD for active sessions and a NAS or external drive for archiving is the standard setup.

What is the difference between sample rate and bit depth?

Sample rate (measured in Hz or kHz) controls how many times per second the audio signal is sampled — higher sample rates capture more of the frequency spectrum. 44,100 Hz (44.1 kHz) is the CD standard and covers the full range of human hearing (up to ~20 kHz). Bit depth controls dynamic range — the difference between the quietest and loudest sounds. 16-bit gives 96 dB of dynamic range; 24-bit gives 144 dB, which is useful during recording and editing to prevent clipping.