Popularity

9.2

Stable

Activity

0.0

Stable

Stars 144

Watchers 8

Forks 5

Last Commit over 7 years ago

Monthly Downloads: 32

Programming language: Haskell

License: MIT License

Tags: Data

Latest version: v0.4.0.3

rei alternatives and similar packages

Based on the "Data" category.
Alternatively, view rei alternatives based on common mentions on social networks and blogs.

semantic-source

10.0 9.1 rei VS semantic-source

Parsing, analyzing, and comparing source code across many languages
lens

10.0 6.8 rei VS lens

Lenses, Folds, and Traversals - Join us on web.libera.chat #haskell-lens

WorkOS - The modern identity platform for B2B SaaS

The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

Promo workos.com

hnix

9.9 6.8 rei VS hnix

A Haskell re-implementation of the Nix expression language
code-builder

9.8 0.0 rei VS code-builder

Packages for defining APIs, running them, generating client code and documentation.
text

9.8 8.4 rei VS text

Haskell library for space- and time-efficient operations over Unicode text.
compendium-client

9.7 0.0 rei VS compendium-client

Mu (μ) is a purely functional framework for building micro services.
Frames

9.7 6.1 rei VS Frames

Data frames for tabular data.
unordered-containers

9.7 5.0 rei VS unordered-containers

Efficient hashing-based container types
massiv

9.7 5.9 rei VS massiv

Efficient Haskell Arrays featuring Parallel computation
cassava

9.7 4.7 rei VS cassava

A CSV parsing and encoding library optimized for ease of use and high performance
holmes

9.6 0.0 rei VS holmes

A reference library for constraint-solving with propagators and CDCL.
binary

9.5 4.3 rei VS binary

Efficient, pure binary serialisation using ByteStrings in Haskell.
resource-pool

9.5 0.0 rei VS resource-pool

A high-performance striped resource pooling implementation for Haskell
critbit

9.5 0.0 rei VS critbit

A Haskell implementation of crit-bit trees.
hashable

9.5 5.7 rei VS hashable

A class for types that can be converted to a hash value
alfred-margaret

9.5 3.5 rei VS alfred-margaret

Fast Aho-Corasick string searching
refined

9.5 1.5 rei VS refined

Refinement types with static checking
primitive

9.5 5.9 rei VS primitive

This package provides various primitive memory-related operations.
diskhash

9.4 0.0 rei VS diskhash

Diskbased (persistent) hashtable
hashtables

9.4 1.0 rei VS hashtables

Mutable hash tables for Haskell, in the ST monad
caledon

9.4 0.0 rei VS caledon

higher order dependently typed logic programing
discrimination

9.4 3.1 rei VS discrimination

Fast linear time sorting and discrimination for a large class of data types
higgledy

9.4 2.2 rei VS higgledy

Higher-kinded data via generics
network-msgpack-rpc

9.4 rei VS network-msgpack-rpc

A MessagePack-RPC Implementation
data-msgpack

9.4 rei VS data-msgpack

A Haskell implementation of MessagePack
jump

9.4 0.0 rei VS jump

Jump start your Haskell development
json-autotype

9.4 0.0 rei VS json-autotype

Automatic Haskell type inference from JSON input
aeson-qq

9.4 3.6 rei VS aeson-qq

JSON quasiquoter for Haskell
reflection

9.3 4.8 rei VS reflection

Reifies arbitrary Haskell terms into types that can be reflected back into terms
dependent-map

9.3 0.0 rei VS dependent-map

Dependently-typed finite maps (partial dependent products)
IORefCAS

9.3 3.5 rei VS IORefCAS

A collection of different packages for CAS based data structures.
dependent-sum

9.3 4.3 rei VS dependent-sum

Dependent sums and supporting typeclasses for comparing and displaying them
cereal

9.3 0.0 rei VS cereal

A binary serialization library
certificate

9.3 0.0 rei VS certificate

Certificate and Key Reader/Writer in haskell
audiovisual

9.3 3.5 rei VS audiovisual

Extensible records, variants, structs, effects, tangles
protobuf

9.2 2.6 rei VS protobuf

An implementation of Google's Protocol Buffers in Haskell.
scientific

9.2 0.0 rei VS scientific

Arbitrary-precision floating-point numbers represented using scientific notation
orgmode-parse

9.2 0.0 rei VS orgmode-parse

Attoparsec parser combinators for parsing org-mode structured text!
streaming

9.2 0.0 rei VS streaming

An optimized general monad transformer for streaming applications, with a simple prelude of functions
bifunctors

9.2 5.6 rei VS bifunctors

Haskell 98 bifunctors, bifoldables and bitraversables
safecopy

9.2 3.1 rei VS safecopy

An extension to Data.Serialize with built-in version control
avro

9.2 4.2 rei VS avro

Haskell Avro Encoding and Decoding Native Support (no RPC)
uuid

9.1 2.9 rei VS uuid

A Haskell library for creating, printing and parsing UUIDs
MemoTrie

9.1 2.7 rei VS MemoTrie

Trie-based memo functions
text-icu

9.1 6.1 rei VS text-icu

This package provides the Haskell Data.Text.ICU library, for performing complex manipulation of Unicode text.
b-tree

9.1 1.8 rei VS b-tree

Haskell on-disk B* tree implementation
witherable

9.1 6.4 rei VS witherable

Filter with effects
stdio

9.1 1.8 rei VS stdio

Haskell Standard Input and Output
tables

9.1 0.0 rei VS tables

Deprecated because of
uuid-types

9.1 2.9 rei VS uuid-types

A Haskell library for creating, printing and parsing UUIDs

Do you think we are missing an alternative of rei or a related project?

Add another 'Data' Package

Popular Comparisons

README

Process lists easily with `rei`

While originally rei was not intended to be an abbreviation, one may think of it as of the Row Editing Interface. Working with lists is an important part of many people, including data-scientists and bioinformaticians, and rei aims to make that experience more pleasant.

Installation

rei can be easily installed from hackage with cabal :

cabal update
cabal install rei

# $PATH should contain ~/.cabal/bin directory
# (example for bash):
PATH=$PATH:~/.cabal/bin
export PATH

Getting started

rei "rname x y -> rname y x" example.csv
rei merge example_left.csv example_right.csv
rei unite example_top.ssv example_bottom.ssv
rei melt2 example_condensed.csv
rei condense2 example_melted.csv
rei join example_foo.ssv example_bar.ssv
rei subtract minuend.csv subtrahend.csv
rei transpose example_matrix.ssv
rei filter "_ _ chr => chr ~ chr21" elements.tsv
rei reduce "chr => chr ~ chrM" exons.bed
rei distinct "chr _ type => type chr" transcripts.gtf

Defining the rule

The main idea of the rei is to apply the rule over the lines of the file. The rule should consist of two parts — before and after — separated by the arrow sign («->»). The before part of the rule describes the fields (columns) in one record (line) in the initial file. The after part of the rule describes the desired format of the output.

The arrow sign should be surrounded with spaces. Like this: .. -> ... The fields in the rule should be surrounded with spaces too. The field delimiter in the output file is the same as in the input file by default, however it's possible to change it via the option -g, or --newdelim. (It's easy to remember, since -f is the flag to set the delimiter in the input file.)

Providing the file

There's several ways to provide rei with the content of the file. The first one is the-most-obvious-way-you-can-think-of: just provide the path to the file. Sometimes it is helpful to use process substitution. And if there's a need to pipe the content, just write a dash («-»). Well, here's the code:

> rei "x -> x" 0.ssv
...
> rei "x -> x" <(cat 0.ssv)
...
> cat 0.ssv | rei "x -> x" -
...

Simple examples

Let's use a small sample file with spaces as delimiters for these examples (saved as 0.ssv):

A B C D E
F G H I J
K L M N O
P Q R S T
U V W X Y

This is how easily we can address the columns:

> rei "a b c -> c b a" 0.ssv
C B A
H G F
M L K
R Q P
W V U

The columns are now in the reversed order.

We can extract the columns that we need:

> rei "a b c -> b" 0.ssv
B
G
L
Q
V

It is possible to define only columns needed:

> rei "a b -> a b" 0.ssv
A B
F G
K L
P Q
U V

You may want to keep the rest of the columns, here's the code for that:

> rei "a b ... -> a ..."
A C D E
F H I J
K M N O
P R S T
U W X Y

The beauty is that one may (and sometimes should) give columns descriptive titles. And that is great in so many ways, as it increases readability, productivity, descriptiveness, maintainability and awareness of what's happening with all that list processing. See some real-world examples below.

Keywords

Delimiters

You can define a delimiter (-f, or --delim, for the input file and -g, or --newdelim, for the output file). It's important to emphasize that only one-character long delimiters are used. Tabulation («\t») is considered one-character too. If multicharacter literal is provided, rei uses its first symbol as a delimiter.

For some common file formats rei doesn't require a delimiter to be provided individually:

.ssv → space (' '),
.csv → comma (','),
.tsv → tab ('\t'),
.txt → space (' '),
.list → space (' '),
.sam, .vcf, .bed, .gff, .gtf → tab ('\t'),

The flag -g is powerful as it allows for fast format conversion. That's how rei may be used to convert from .ssv to .csv:

> rei -g ","  "... -> ..." 0.ssv
A,B,C,D,E
F,G,H,I,J
K,L,M,N,O
P,Q,R,S,T

As you see, rei guessed the delimiter in the input file by its extension — space-separated values. The output won't change in the example above if -f " " is provided.

Skipping lines

Sometimes there is a need to cut out the header of the file or several lines in its end. It's generally accomplished by combining head and/or tail programs, piping, etc. Since rei is designed for easy list processing, such feature is implemented here. There are flags to define the number of lines to skip in the beginning (--skip, or -s) or in the end (--omit, or -t) of the file.

> rei -s 1 -t 2 "f g h i j -> f h j" 0.ssv
F H J
K M O

Enumerating lines

Sometimes it's handy to have line numbers in the data file. For that purpose rei offers -n flag (or --enum) which let the user treat the first variable in the rule as a line number (enumeration starts with 1):

> rei -n "# _ _ _ d -> d d #" 0.ssv
D D 1
I I 2
N N 3
S S 4
X X 5

Addressing columns with numbers

It happens that the columns in the file should be addressed with their indices. For those cases rei provides -a — from awk-like — flag (or --colnum). When using rei -a no before part of the rule should be provided. Please, note that the arrow -> should be preceded by a space in this case:

> rei -a ' -> 0 3' 0.ssv
A D
F I
K N
P S
U X

It is recommended to use -a with the -n flag so that the first column can be referred to as 1 and the line number as 0:

> rei -an ' -> 1' 0.ssv
A
F
K
P
U

Magic rules

There's are some common tasks that one may want to do with lists and tables, and it seems convenient to include them in rei: melt2, condense2, merge, unite, join, subtract, filter, reduce, distinct. Each magic rule has its own syntax.

Melting and condensing

TODO

Merging

Here, to merge several (typically two) lists means to get the data together. With merge one can add new columns. If the length of two lists (or tables) differs, the shortest possible list is returned. rei cares, as usually, about the delimiters, but not about finding and reassorting rows when data is being merged.

> rei merge 0.ssv <(rei -s 1 "a -> a" 0.ssv)
A B C D E F
F G H I J K
K L M N O P
P Q R S T U

Uniting

Uniting, or concatenating, several files can be achieved with unite rule. This rule has a synonym: concatenate, or concat for short. While simple file concatenation can be achieved using UNIX cat tool, rei unite <...> has to acknowledge the delimiter symbol (which should be the same for all input files) and can change the delimiter symbol for the whole output or skip / omit lines.

> rei unite 0.ssv <(head -n 1 0.ssv)
A B C D E
F G H I J
K L M N O
P Q R S T
U V W X Y
A B C D E

Joining

Another useful thing is finding common elements in multiple lists. rei allows that with join. (In most cases the order of the files provided does not matter. However, if the first file contains duplicates, so will the result.)

Let's prepare a file to join with 0.ssv and save it as 01.ssv.

A B C D E
K L M N O
X X X X X

The code for join is straightforward:

> rei -g ',' join 0.ssv 01.ssv
A,B,C,D,E
K,L,M,N,O

Retrieving unique data with subtr

Finding differences between multiple lists with a clear and concise syntax is not a trivial task. To deal with this, rei offers a magic rule called subtract (or subtr for short). It behaves exactly as it is titled: takes the first file and removes each row in it only if the row is present in any of the following files.

> tail -n 1 0.ssv > 02.ssv
> rei subtr 0.ssv 01.ssv 02.ssv
F G H I J
P Q R S T

Tranposing data

When you need to transpose the list, you can just do it with rei. It can be beautifully demonstrated for the following matrix (1.ssv):

11 12 13 14 15
21 22 23 24 25
31 32 33 34 35
41 42 43 44 45
51 52 53 54 55

> rei -g ',' transpose 1.ssv
11,21,31,41,51
12,22,32,42,52
13,23,33,43,53
14,24,34,44,54
15,25,35,45,55

Filtering data

For selecting lines that meet some condition use filter rule. Its syntax is simple:

> rei filter "a b => a ~ A" 0.ssv
A B C D E

The filter word should be followed by a rule consisting of two parts — before and patern. It is similar to the standard rule in rei, but in the filter case these two parts should be separated by a fat arrow (=>). The pattern can be defined as field_name ~ expression. One may use a regular expression as an expression in the pattern part of the rule.

You can use reduce rule for negative filtering:

> rei reduce "a b => a ~ A|U" 0.ssv
F G H I J
K L M N O
P Q R S T

Selecting distinct lines

Two lines are called distinct here if they are consecutive lines and have the same value in some fields.

Let's prepare a file 2.ssv:

! @ ?
! * *
? * ?

Then select lines that are distinct in terms of the second field:

> rei distinct "_ 2 _ => 2" 2.ssv
! @ ?
! * *

Sophisticated examples

Skip rownames and colnames:

> rei --skip 1 "rownames ... -> ..." example.ssv

It's easy to merge several files, and turn the output to .csv:

> rei -f ' ' -g ',' unite <(rei -t 3 "a b c -> a c" 0.ssv) <(rei -s2 "x y z -> y z" 1.ssv)
A,C
F,H
32,33
42,43
52,53

Real-world examples

TODO

.bam files stats
.bed files: counting elements
date and smth else
uniting data
merging data

Finding files that were not downloaded

You were downloading a set of fastq files from a list files.list when the connection was interrupted. It is handy to use rei to generate a list a files that were not downloaded:

> ls *fastq.gz > downloaded.list
> rei subtract files.list downloaded.list > to_download.list

Notes

Errors and warnings

rei tries to be friendly to the user. For example, when there's a field variable in the right part of the rule that is not present in the left part, rei hides implementation details behind the user-friendly message, trying to guess that Something's wrong with the rule...

TODO

Dev

rei is written in Haskell, uses regular expressions to parse the rule and Attoparsec to parse the file provided.

Requests

[x] guessing delimiters for "bioinformatic" formats, like: .sam, .vsf, etc.
[x] guessing delimiters for more formats: .bed, .gff, .gtf

rei

Process lists easily

rei alternatives and similar packages

Popular Comparisons

README

Process lists easily with rei

Installation

Getting started

Defining the rule

Providing the file

Simple examples

Keywords

Delimiters

Skipping lines

Enumerating lines

Addressing columns with numbers

Magic rules

*Melt*ing and *condens*ing

*Merg*ing

*Unit*ing

*Join*ing

Retrieving unique data with subtr

*Tranpos*ing data

*Filter*ing data

Selecting distinct lines

Sophisticated examples

Real-world examples

Finding files that were not downloaded

Notes

Errors and warnings

Dev

Requests

Process lists easily with `rei`

Melting and condensing

Merging

Uniting

Joining

Tranposing data

Filtering data