Huffman Coding

Characters & Frequencies

Animation StatusStep 0 / 0

Click 'Build Tree' to start Huffman Coding.

What is Huffman Coding?

Huffman Coding is a lossless data compression algorithm. The idea is to assign variable-length codes to input characters, with lengths based on the frequencies of corresponding characters.

The most frequent character gets the smallest code and the least frequent character gets the largest code. This variable-length coding is created using a Huffman Tree, which is built using a greedy approach.

Visual Example

A: 1, B: 00, C: 01. The most frequent character 'A' has the shortest code.

Algorithm Strategy

1. Calculate Frequencies

Count how often each character appears in the data. Create a leaf node for each character and build a min-heap of all leaf nodes.

2. Build Tree

Extract two nodes with the minimum frequency from the min-heap. Create a new internal node with a frequency equal to the sum of the two nodes' frequencies. Insert this new node back into the min-heap. Repeat until one node remains.

3. Generate Codes

Traverse the constructed tree from root to leaves. Assign '0' for a left branch and '1' for a right branch. The sequence of bits from root to a leaf node is the code for that character.

Complexity Analysis

Time:O(N log N)

Space:O(N)

Where N is the number of unique characters. Extracting minimum from the priority queue takes O(log N) and it is done 2*(N-1) times.

Time Complexity: O(N log N)

Key Takeaways

Huffman Coding generates prefix codes, meaning no code is a prefix of another code. This ensures unambiguous decoding.
It is optimally efficient for character-by-character coding when character frequencies are known.

algorithm.txt

12345678910111213141516171819202122232425structure Node
  initialize(char, freq)
    this.char = char
    this.freq = freq
    this.left = null
    this.right = null

algorithm buildHuffmanTree(chars, freqs)
  pq ← []
  for (i ← 0; i < chars.length; i++) 
    pq.push(new Node(chars[i], freqs[i]))
  
  pq.sort((a, b) => a.freq - b.freq)

  while pq.length > 1 do
    left ← pq.shift()
    right ← pq.shift()

    sumNode ← new Node(null, left.freq + right.freq)
    sumNode.left = left
    sumNode.right = right

    pq.push(sumNode)
    pq.sort((a, b) => a.freq - b.freq)
  return pq[0]

Huffman Coding Quiz

How it works:

+1 point for each correct answer
0 points for wrong answers
Earn stars based on your final score (max 5 stars)

Explore other Tree Applications

Heap Sort

Efficient sorting algorithm based on binary heap.

Decision Trees

Machine learning decision-making workflows.

Syntax Trees

Abstract Syntax Trees in compilers.