Deflate

Simple implementation of the Deflate compression/decompression in Go.

Building

Go

There are no dependencies in this project, so you can build it directly using go:

$ go build .

This will create a deflate binary file in the project folder.

Docker (Linux only)

Also, there is a Dockerfile for building the project inside of the Docker container. To use it just execute the build.sh shell script:

$ ./build.sh

This will create a deflate binary in the build folder.

Usage

Just execute the deflate binary with the according parameters.

The program expects input at stdin, writes output to stdout, and shows errors in stderr. This behavior can be overwritten by in and out flags.

By default program decompresses the input stream to the output stream, to compress specify c flag. The compression rate can be regulated by bs, insize, outsize, and sthreshold flags.

More information is available at deflate -h:

$ deflate -h
Usage of ./deflate:
  -bs int
        maximal block size in symbols (default 65536)
  -c    compress file instead of decompression
  -in string
        specify input file (default "stdin")
  -insize int
        size of the input part of the sliding window (0-258) (default 258)
  -out string
        specify output file (default "stdout")
  -outsize int
        size of the output part of the sliding window (0-32768) (default 32768)
  -sthreshold int
        maximal block size in symbols that is encoded using static huffman trees (default 256)

Note: Uncompressed blocks are not supported and an attempt to decode them will raise an error.

Compression tools comparison

Set	Zopfli	ZLIB	Deflate
README.md	88.3%	93.3%	92.5%
alice29.txt	33.8%	35.7%	36.4%
alphabet.txt	0.3%	0.3%	0.5%
asyoulik.txt	37.0%	39.0%	40.1%
cp.html	31.3%	32.4%	33.2%
fields.c	26.9%	28.0%	28.4%
grammar.lsp	31.7%	32.8%	33.3%
helloworld.txt	91.3%	117.4%	91.3%
random.txt	75.2%	77.1%	77.3%
sum	30.3%	33.7%	35.1%
xargs.1	39.9%	41.1%	41.6%

We can see that the program works well compared to other tools, and in some examples even outperforms ZLIB implementation. Yet it is not very close to Zopfli which uses iterative improvement of the compression, and on average performs a little worse than ZLIB.

There were no time-measuring tests taken because this implementation is far (really far) behind those used widely (see next chapter).

Place for improvement

TL;DR

Areas for improving:

Use hash chains instead of brute force in LZ77;
Add bit buffering;
Optimize package-merge algorithm;
Add uncompressed blocks support.

Details

This implementation is extremely slow due to the brute force (trying every possible (distance, length) pair) algorithm used in LZ77, while other implementations use hash chains. Calling functions for each bit operation without buffering (BitStream) builds up large function overhead, so 32-bit or 64-bit buffering would improve the situation. The Package-Merge algorithm is also not implemented with performance in mind.

Yet the main goal of the project was not to make an applicable compression tool but rather to understand better the compression format itself.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
compress		compress
examples		examples
huffman		huffman
lz77		lz77
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
build.sh		build.sh
deflate.go		deflate.go
go.mod		go.mod
requirements.txt		requirements.txt
tests.csv		tests.csv
tests.py		tests.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deflate

Building

Go

Docker (Linux only)

Usage

Compression tools comparison

Place for improvement

TL;DR

Details

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Deflate

Building

Go

Docker (Linux only)

Usage

Compression tools comparison

Place for improvement

TL;DR

Details

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages