Popularity

3.9

Growing

Activity

0.0

Stable

Stars 8

Watchers 3

Forks 0

Last Commit about 6 years ago

Monthly Downloads: 9

Programming language: Haskell

License: MIT License

Tags: Data

groupBy alternatives and similar packages

Based on the "Data" category.
Alternatively, view groupBy alternatives based on common mentions on social networks and blogs.

semantic-source

10.0 9.1 groupBy VS semantic-source

Parsing, analyzing, and comparing source code across many languages
lens

10.0 6.8 groupBy VS lens

Lenses, Folds, and Traversals - Join us on web.libera.chat #haskell-lens

WorkOS - The modern identity platform for B2B SaaS

The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

Promo workos.com

hnix

9.9 6.8 groupBy VS hnix

A Haskell re-implementation of the Nix expression language
code-builder

9.8 0.0 groupBy VS code-builder

Packages for defining APIs, running them, generating client code and documentation.
text

9.8 8.4 groupBy VS text

Haskell library for space- and time-efficient operations over Unicode text.
compendium-client

9.7 0.0 groupBy VS compendium-client

Mu (μ) is a purely functional framework for building micro services.
unordered-containers

9.7 5.0 groupBy VS unordered-containers

Efficient hashing-based container types
Frames

9.7 6.1 groupBy VS Frames

Data frames for tabular data.
massiv

9.7 5.9 groupBy VS massiv

Efficient Haskell Arrays featuring Parallel computation
cassava

9.7 5.9 groupBy VS cassava

A CSV parsing and encoding library optimized for ease of use and high performance
holmes

9.6 0.0 groupBy VS holmes

A reference library for constraint-solving with propagators and CDCL.
resource-pool

9.5 0.0 groupBy VS resource-pool

A high-performance striped resource pooling implementation for Haskell
hashable

9.5 5.7 groupBy VS hashable

A class for types that can be converted to a hash value
alfred-margaret

9.5 3.5 groupBy VS alfred-margaret

Fast Aho-Corasick string searching
refined

9.5 1.5 groupBy VS refined

Refinement types with static checking
critbit

9.5 0.0 groupBy VS critbit

A Haskell implementation of crit-bit trees.
primitive

9.5 5.9 groupBy VS primitive

This package provides various primitive memory-related operations.
binary

9.5 4.3 groupBy VS binary

Efficient, pure binary serialisation using ByteStrings in Haskell.
higgledy

9.4 2.2 groupBy VS higgledy

Higher-kinded data via generics
data-msgpack

9.4 groupBy VS data-msgpack

A Haskell implementation of MessagePack
caledon

9.4 0.0 groupBy VS caledon

higher order dependently typed logic programing
jump

9.4 0.0 groupBy VS jump

Jump start your Haskell development
discrimination

9.4 3.1 groupBy VS discrimination

Fast linear time sorting and discrimination for a large class of data types
json-autotype

9.4 0.0 groupBy VS json-autotype

Automatic Haskell type inference from JSON input
network-msgpack-rpc

9.4 groupBy VS network-msgpack-rpc

A MessagePack-RPC Implementation
aeson-qq

9.4 3.6 groupBy VS aeson-qq

JSON quasiquoter for Haskell
diskhash

9.4 0.0 groupBy VS diskhash

Diskbased (persistent) hashtable
hashtables

9.4 1.0 groupBy VS hashtables

Mutable hash tables for Haskell, in the ST monad
audiovisual

9.3 3.5 groupBy VS audiovisual

Extensible records, variants, structs, effects, tangles
reflection

9.3 4.8 groupBy VS reflection

Reifies arbitrary Haskell terms into types that can be reflected back into terms
dependent-sum

9.3 4.3 groupBy VS dependent-sum

Dependent sums and supporting typeclasses for comparing and displaying them
dependent-map

9.3 0.0 groupBy VS dependent-map

Dependently-typed finite maps (partial dependent products)
cereal

9.3 0.0 groupBy VS cereal

A binary serialization library
IORefCAS

9.3 3.5 groupBy VS IORefCAS

A collection of different packages for CAS based data structures.
certificate

9.3 0.0 groupBy VS certificate

Certificate and Key Reader/Writer in haskell
protobuf

9.2 2.6 groupBy VS protobuf

An implementation of Google's Protocol Buffers in Haskell.
safecopy

9.2 3.1 groupBy VS safecopy

An extension to Data.Serialize with built-in version control
streaming

9.2 0.0 groupBy VS streaming

An optimized general monad transformer for streaming applications, with a simple prelude of functions
rei

9.2 0.0 groupBy VS rei

Process lists easily
bifunctors

9.2 5.6 groupBy VS bifunctors

Haskell 98 bifunctors, bifoldables and bitraversables
orgmode-parse

9.2 0.0 groupBy VS orgmode-parse

Attoparsec parser combinators for parsing org-mode structured text!
avro

9.2 4.2 groupBy VS avro

Haskell Avro Encoding and Decoding Native Support (no RPC)
scientific

9.2 0.0 groupBy VS scientific

Arbitrary-precision floating-point numbers represented using scientific notation
uuid

9.1 2.9 groupBy VS uuid

A Haskell library for creating, printing and parsing UUIDs
b-tree

9.1 1.8 groupBy VS b-tree

Haskell on-disk B* tree implementation
uuid-types

9.1 2.9 groupBy VS uuid-types

A Haskell library for creating, printing and parsing UUIDs
text-icu

9.1 6.1 groupBy VS text-icu

This package provides the Haskell Data.Text.ICU library, for performing complex manipulation of Unicode text.
witherable

9.1 6.4 groupBy VS witherable

Filter with effects
stdio

9.1 1.8 groupBy VS stdio

Haskell Standard Input and Output
tables

9.1 0.0 groupBy VS tables

Deprecated because of

Do you think we are missing an alternative of groupBy or a related project?

Add another 'Data' Package

Popular Comparisons

README

groupBy

This provides a drop-in replacement for Data.List.groupBy, with benchmarks and tests.

The original Data.List.groupBy has (perhaps unexpected) behaviour, in that it compares elements to the first in the group, not adjacent ones. In other words, if you wanted to group into ascending sequences:

>>> Data.List.groupBy (<=) [1,2,2,3,1,2,0,4,5,2]
[[1,2,2,3,1,2],[0,4,5,2]]

The replacement has three distinct advantages:

It groups adjacent elements, allowing the example above to function as expected:

   >>> Data.List.GroupBy.groupBy (<=) [1,2,2,3,1,2,0,4,5,2]
   [[1,2,2,3],[1,2],[0,4,5],[2]]

It is a good producer and consumer, with rules similar to those for Data.List.scanl. The old version was defined in terms of span:

   groupBy                 :: (a -> a -> Bool) -> [a] -> [[a]]
   groupBy _  []           =  []
   groupBy eq (x:xs)       =  (x:ys) : groupBy eq zs
                              where (ys,zs) = span (eq x) xs

Which prevents it from being a good producer/consumer.

It is significantly faster than the original in most cases.

Tests

Tests ensure that the function is the same as the original when the relation supplied is an equivalence, and that it performs the expected adjacent comparisons when the relation isn't transitive.

The tests also check that laziness is maintained, as defined by:

>>> head (groupBy (==) (1:2:undefined))
[1]

>>> (head . head) (groupBy undefined (1:undefined))
1

>>> (head . head . tail) (groupBy (==) (1:2:undefined))
2

Benchmarks

Benchmarks compare the function to three other implementations: the current Data.List.groupBy, a version provided by the utility-ht package, and a version provided by Brandon Simmons.

The benchmarks test functions that force the outer list:

length . groupBy eq

And functions which force the contents of the inner lists:

sum' = foldl' (+) 0

sum' . map sum' . groupBy eq

Each benchmark is run on lists where the groups are small, the groups are large, and where there is only one group. The default size is 10000, but other sizes can be provided with the --size=[x,y,z] flag to the benchmarks.

The new definition is slower than the old only when the size of the sublists is much larger than the size of the outer list. To make the newer definition faster in that case, you would simply force the pair (or use a strict pair) from the accumulator. However, this makes the new definition match the old speed in the other cases, which I would imagine are more common.

groupBy

An example replacement for Data.List.groupBy

groupBy alternatives and similar packages

Popular Comparisons

README

groupBy

Tests

Benchmarks