Popularity

6.2

Growing

Activity

0.0

Stable

Stars 8

Watchers 4

Forks 6

Last Commit over 3 years ago

Monthly Downloads: 14

Programming language: Haskell

License: BSD 3-clause "New" or "Revised" License

Tags: Data

intset alternatives and similar packages

Based on the "Data" category.
Alternatively, view intset alternatives based on common mentions on social networks and blogs.

semantic-source

10.0 9.1 intset VS semantic-source

Parsing, analyzing, and comparing source code across many languages
lens

10.0 6.8 intset VS lens

Lenses, Folds, and Traversals - Join us on web.libera.chat #haskell-lens

WorkOS - The modern identity platform for B2B SaaS

The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

Promo workos.com

hnix

9.9 6.8 intset VS hnix

A Haskell re-implementation of the Nix expression language
code-builder

9.8 0.0 intset VS code-builder

Packages for defining APIs, running them, generating client code and documentation.
text

9.8 8.4 intset VS text

Haskell library for space- and time-efficient operations over Unicode text.
compendium-client

9.7 0.0 intset VS compendium-client

Mu (μ) is a purely functional framework for building micro services.
unordered-containers

9.7 5.0 intset VS unordered-containers

Efficient hashing-based container types
Frames

9.7 6.1 intset VS Frames

Data frames for tabular data.
massiv

9.7 5.9 intset VS massiv

Efficient Haskell Arrays featuring Parallel computation
cassava

9.7 5.9 intset VS cassava

A CSV parsing and encoding library optimized for ease of use and high performance
holmes

9.6 0.0 intset VS holmes

A reference library for constraint-solving with propagators and CDCL.
resource-pool

9.5 0.0 intset VS resource-pool

A high-performance striped resource pooling implementation for Haskell
hashable

9.5 5.7 intset VS hashable

A class for types that can be converted to a hash value
alfred-margaret

9.5 3.5 intset VS alfred-margaret

Fast Aho-Corasick string searching
refined

9.5 1.5 intset VS refined

Refinement types with static checking
critbit

9.5 0.0 intset VS critbit

A Haskell implementation of crit-bit trees.
primitive

9.5 5.9 intset VS primitive

This package provides various primitive memory-related operations.
binary

9.5 4.3 intset VS binary

Efficient, pure binary serialisation using ByteStrings in Haskell.
higgledy

9.4 2.2 intset VS higgledy

Higher-kinded data via generics
data-msgpack

9.4 intset VS data-msgpack

A Haskell implementation of MessagePack
caledon

9.4 0.0 intset VS caledon

higher order dependently typed logic programing
jump

9.4 0.0 intset VS jump

Jump start your Haskell development
discrimination

9.4 3.1 intset VS discrimination

Fast linear time sorting and discrimination for a large class of data types
json-autotype

9.4 0.0 intset VS json-autotype

Automatic Haskell type inference from JSON input
network-msgpack-rpc

9.4 intset VS network-msgpack-rpc

A MessagePack-RPC Implementation
aeson-qq

9.4 3.6 intset VS aeson-qq

JSON quasiquoter for Haskell
diskhash

9.4 0.0 intset VS diskhash

Diskbased (persistent) hashtable
hashtables

9.4 1.0 intset VS hashtables

Mutable hash tables for Haskell, in the ST monad
audiovisual

9.3 3.5 intset VS audiovisual

Extensible records, variants, structs, effects, tangles
reflection

9.3 4.8 intset VS reflection

Reifies arbitrary Haskell terms into types that can be reflected back into terms
dependent-sum

9.3 4.3 intset VS dependent-sum

Dependent sums and supporting typeclasses for comparing and displaying them
dependent-map

9.3 0.0 intset VS dependent-map

Dependently-typed finite maps (partial dependent products)
cereal

9.3 0.0 intset VS cereal

A binary serialization library
IORefCAS

9.3 3.5 intset VS IORefCAS

A collection of different packages for CAS based data structures.
certificate

9.3 0.0 intset VS certificate

Certificate and Key Reader/Writer in haskell
protobuf

9.2 2.6 intset VS protobuf

An implementation of Google's Protocol Buffers in Haskell.
safecopy

9.2 3.1 intset VS safecopy

An extension to Data.Serialize with built-in version control
streaming

9.2 0.0 intset VS streaming

An optimized general monad transformer for streaming applications, with a simple prelude of functions
rei

9.2 0.0 intset VS rei

Process lists easily
bifunctors

9.2 5.6 intset VS bifunctors

Haskell 98 bifunctors, bifoldables and bitraversables
orgmode-parse

9.2 0.0 intset VS orgmode-parse

Attoparsec parser combinators for parsing org-mode structured text!
avro

9.2 4.2 intset VS avro

Haskell Avro Encoding and Decoding Native Support (no RPC)
scientific

9.2 0.0 intset VS scientific

Arbitrary-precision floating-point numbers represented using scientific notation
uuid

9.1 2.9 intset VS uuid

A Haskell library for creating, printing and parsing UUIDs
b-tree

9.1 1.8 intset VS b-tree

Haskell on-disk B* tree implementation
uuid-types

9.1 2.9 intset VS uuid-types

A Haskell library for creating, printing and parsing UUIDs
text-icu

9.1 6.1 intset VS text-icu

This package provides the Haskell Data.Text.ICU library, for performing complex manipulation of Unicode text.
witherable

9.1 6.4 intset VS witherable

Filter with effects
stdio

9.1 1.8 intset VS stdio

Haskell Standard Input and Output
tables

9.1 0.0 intset VS tables

Deprecated because of

Do you think we are missing an alternative of intset or a related project?

Add another 'Data' Package

Popular Comparisons

README

Synopsis

This package provides efficient integer interval sets.

Description

Persistent... is it trees?

Yes, Radix trees. Trees are balanced by prefix bits, so we have fast merge operations, such as union, intersection and difference. Chris Okasaki and Andrew Gill shows that Patricia tree based integer maps might be order of magnitude faster than Red-Black tree counterparts on this operations. The same apply to integer sets, we just have keys, but don't have values.

That does mean the "dense"?

That means we keep suffixes in bitmaps and we might pack, say 10, integers which lies close together in one bitmap. This optimization doesn't affect execution times for sparse sets, but makes dense sets much more memory efficient — near 10-50 times less space usage depending on machine word size and the actual density of the set. Basically, this let us be 3-4 times less memory efficient comparing with arrays of tightly packed bits, but see...

How suffix compaction is performed?

There are exist a pretty simple algorithm used in memory allocators called "buddy memory allocator". In a nutshell, we have a big block which is splitted by half when we remove from one of the half, and merge then back when we insert. It's somewhat inverse to the ordinary tree approach — in buddy tree we hold more information about elements that it doesn't contain, while in prefix tree we hold more information about elements that it does contain. It's easy to guess that we should do with it — take the two structures then fuse them into one to produce a new structure which perform better.

Indeed, the key idea in the design is right here — we switch forth and back between representations per subtree basis. We intersperse different representations in different tree branches. It's like chameleon:

If the some subset is sparse, we just keep a radix tree with bitmaps at leafs.
If the some subset becomes full we turn it into block. If some buddy block appears, we join the buddy blocks into one. And so forth.

That is, we just get a structure that dynamically choose the optimal representation depending on density of set. Moreover in best case this lead to huge space savings:

> ppStats (fromList [0..123456])

gives:

Bin count: 6
Tip count: 1
Fin count: 6
Size in bytes: 408
Saved space over dense set:  123072
Saved space over bytestring: 11879
Percent saved over dense set:  99.6695821185617%
Percent saved over bytestring: 96.67941727028567%

The ppStats is not an exposed function but you can play with it using cabal-dev ghci.

I don't know if it is an old idea, but this works just fine.

So when this data structure is good choice?

In many situation. It might be used as persistent and compact replacement for bool arrays or Data.IntSet with the following advantages:

Purity is extremely useful in multithreaded settings — we could keep a set in a mutable transactional variable or an IORef and atomically update/modify the set. So it could be used as replacement for TArray Int Bool as well.
By merging intervals together we achieve compactness. In best case some of main operations will take O(1)time and space, so if you need interval set it's here.
Fast serizalization: if you are need conversion to/from bytestrings. Because of bitmaps it's possible to do this conversion extremely fast.

How this implementation relate to containers version?

Heavely based. Essentially we just add the buddy interval compaction, but it turns out that some operations becomes more complicated and requires much more effort to implement — in order to maintain the all tree invariants we need to take into account more cases. This is the reason why some operations are not implemented yet (e.g. lack of views), but I hope I'll fix it with the time.