Deduplication

Deduplication is a technique of removing duplicate copies of repeating data. It is useful in many different contexts such as:

Deduplication systems can be categorized according to a number of criteria:

post-process deduplication: new data is first stored on device then later a process analyzes the data looking for duplicates.
inline deduplication: done as data is incoming on the device to look for and eliminate duplicates.
target deduplication: deduplication is done where the data is stored/processed.
source deduplication: deduplication is done where the data is created or originating.

I implemented three different approaches to deduplication, each with it's own benefits. They are:

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
bloom		bloom
cache		cache
cuckoo		cuckoo
example		example
keyvalue		keyvalue
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go

Provide feedback