#huffman #hpack #canonical #huffman-coding #codec #http #string-literal

httlib-huffman

Canonical Huffman algorithm for handling HPACK format in HTTP/2

14 releases

0.3.4 Nov 11, 2020
0.3.3 Nov 10, 2020
0.2.3 Nov 5, 2020
0.2.0 Oct 20, 2020
0.1.4 Oct 16, 2020

#560 in Algorithms

Download history 1544/week @ 2024-08-08 1457/week @ 2024-08-15 1128/week @ 2024-08-22 1228/week @ 2024-08-29 1108/week @ 2024-09-05 1132/week @ 2024-09-12 987/week @ 2024-09-19 1004/week @ 2024-09-26 1292/week @ 2024-10-03 1063/week @ 2024-10-10 1367/week @ 2024-10-17 1499/week @ 2024-10-24 1707/week @ 2024-10-31 1694/week @ 2024-11-07 1381/week @ 2024-11-14 1394/week @ 2024-11-21

6,470 downloads per month
Used in 17 crates (2 directly)

MIT license

240KB
7K SLoC

httlib-huffman

This crate implements canonical Huffman functionality for handling HPACK format in HTTP/2. It exposes a simple API for performing the encoding and decoding of HTTP/2 header string literals according to the HPACK spec.

Documentation Source

Header Compression format for HTTP/2, known as HPACK, foresees the use of the Huffman algorithm for encoding header literal values. This contributes to the additional decrease in the quantity of data, transferred with each web request and response.

A Huffman code is a particular type of optimal prefix code that is commonly used for lossless data compression. The process of finding or using such a code proceeds by means of Huffman coding, an algorithm developed by David A. Huffman. The output from Huffman's algorithm can be viewed as a variable-length code table for encoding a source symbol (such as a character in a file). The algorithm derives this table from the estimated probability or frequency of occurrence (weight) for each possible value of the source symbol. As in other entropy encoding methods, more common symbols are generally represented using fewer bits than less common symbols. Huffman's method can be efficiently implemented, finding a code in time linear to the number of input weights if these weights are sorted.

HPACK compression entails a pre-created canonical Huffman code table for encoding ASCII characters to the Huffman sequence. A canonical Huffman code is a particular type of Huffman code with unique properties that allow it to be described in a very compact manner. In the aforementioned table are the Huffman codes for each ASCII character with a length up to 32 bits (4x by 8 fields with value 0 or 1), in the form of base-2 integer, aligned on the most significant bit (MSB is the bit farthest to the left).

Each module covers this topic in more details so check the rest of the code to learn more.

Usage

Encoding example:

use httlib_huffman::encode;

let mut dst = Vec::new();
let text = "Hello world!".as_bytes();
encode(&text, &mut dst).unwrap();

Decoding example:

use httlib_huffman::{DecoderSpeed, decode};

let speed = DecoderSpeed::ThreeBits;
let mut dst = Vec::new();
let src = vec![135];
decode(&src, &mut dst, speed).unwrap();

Articles

License: MIT

No runtime deps