Experimental fork of 'vavr' with CHAMP collections

Status

This is an experimental fork of github.com/vavr-io/vavr.

CHAMP collections

This fork contains additional collections, that use CHAMP (Compressed Hash-Array Mapped Prefix-tree) as their underlying data structures. The collections are derived from github.com/usethesource/capsule, and github.com/wrandelshofer/jhotdraw8.

The collections are:

ChampSet
ChampMap
LinkedChampSet
LinkedChampMap

Each collection has a mutable partner:

MutableChampSet
MutableChampMap
MutableLinkedChampSet
MutableLinkedChampMap

Performance characteristics

ChampSet, ChampMap, MutableChampSet, MutableChampMap:

Maximal supported size: 2³⁰ elements.
Get/Insert/Remove: O(1)
Head/Tail: O(1)
Iterator creation: O(1)
Iterator.next(): O(1)
toImmutable/toMutable: O(1) a cost distributed across subsequent updates of the mutable copy

The costs are only constant in the limit. In practice, they are more like O(log₃₂ N).

If a collection is converted from/to immutable/mutable, the mutual copy of the collection loses ownership of all its trie nodes. Updates are slightly more expensive for the mutable copy, until it gains exclusive ownership of all trie nodes again.

LinkedChampSet, LinkedChampMap, MutableLinkedChampSet, MutableLinkedChampMap:

Maximal supported size: 2³⁰ elements.
Get/Insert/Remove: O(1) amortized
Head/Tail: O(N)
Iterator creation: O(N)
Iterator.next(): O(1)
toImmutable/toMutable: O(1) a cost distributed across subsequent updates of the mutable copy

The collections are not actually linked. The collections store a sequence number with each data element. The sequence numbers must be renumbered from time to time, to prevent large gaps and overflows/underflows.

When we iterate over the elements, we need to sort them. We do this with a bucket sort in O(N) time. We achieve O(N) instead of O(N log N) for the bucket sort, because we use at least N buckets, and no more than N * 4 buckets.

Currently, the code contains a fall-back code for collections that grow larger than 2³⁰ elements. For very large collections the buckets do not fit into a Java array anymore. We have to fall back to a heap. With the heap, Iterator.next() needs O(log N) instead of O(1).

Benchmarks

The following chart shows a comparison of the CHAMP maps with vavr collections and with Scala collections. Scala org.scala-lang:scala-library:2.13.8 was used.

The collections have 1 million entries.

The y-axis is labeled in nanoseconds. The bars are cut off at 1'500 ns (!). This cuts of the elapsed times of functions that run in linear times.

scala.HashMap has a very competitive and balanced performance. It uses a CHAMP trie as its underlying data structure.
scala.VectorMap is slower than most of the other collections, but the performance is balanced. It uses a radix-balanced finger tree (Vector) and a CHAMP trie as its underlying data structure.
vavr.HashMap has a very competitive and balanced performance. It uses a HAMP trie as its underlying data structure.
vavr.LinkedHashMap has competitive query times, but updates need linear time. It uses a HAMP trie and a Banker's queue as its underlying data structure.
vavr.ChampMap has a very competitive and balanced performance. It uses a CHAMP trie as its underlying data structure.
vavr.LinkedChampMap has competitive performance except for accesses to the first/last entry. It uses a CHAMP trie and sequence numbers on the entries.

Name		Name	Last commit message	Last commit date
Latest commit History 4,356 Commits
.github		.github
generator		generator
gradle/wrapper		gradle/wrapper
src-gen		src-gen
src		src
.gitignore		.gitignore
.gitpod.yml		.gitpod.yml
.travis.yml		.travis.yml
BenchmarkChart.png		BenchmarkChart.png
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
RELEASE.md		RELEASE.md
build.gradle		build.gradle
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
settings.gradle		settings.gradle
update-copyright.sh		update-copyright.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Experimental fork of 'vavr' with CHAMP collections

Status

CHAMP collections

Performance characteristics

ChampSet, ChampMap, MutableChampSet, MutableChampMap:

LinkedChampSet, LinkedChampMap, MutableLinkedChampSet, MutableLinkedChampMap:

Benchmarks

About

Releases

Packages

Languages

License

wrandelshofer/vavr

Folders and files

Latest commit

History

Repository files navigation

Experimental fork of 'vavr' with CHAMP collections

Status

CHAMP collections

Performance characteristics

ChampSet, ChampMap, MutableChampSet, MutableChampMap:

LinkedChampSet, LinkedChampMap, MutableLinkedChampSet, MutableLinkedChampMap:

Benchmarks

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages