Popularity

5.0

Growing

Activity

0.0

Stable

Stars 7

Watchers 5

Forks 0

Last Commit almost 3 years ago

Monthly Downloads: 30

Programming language: Haskell

License: BSD 3-clause "New" or "Revised" License

Tags: Unclassified

dataflower alternatives and similar packages

Based on the "Unclassified" category.
Alternatively, view dataflower alternatives based on common mentions on social networks and blogs.

penrose

10.0 0.0 dataflower VS penrose

Haskell to JavaScript compiler, based on GHC
infernu

9.7 0.0 dataflower VS infernu

Type inference and checking for a safer JavaScript.

WorkOS - The modern identity platform for B2B SaaS

The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

Promo workos.com

elm-repl

9.6 0.0 dataflower VS elm-repl

a REPL for Elm
gotta-go-fast

9.6 0.0 dataflower VS gotta-go-fast

A command line utility for practicing typing and measuring your WPM and accuracy.
hascard

9.6 6.9 dataflower VS hascard

flashcard TUI with markdown cards
pointfree

9.3 5.4 dataflower VS pointfree

Maintenance of the pointfree Hackage package.
orgstat

8.9 2.6 dataflower VS orgstat

Statistics visualizer for org-mode
logging-effect

8.7 2.1 dataflower VS logging-effect

A very general logging effect for Haskell
heroku-persistent

8.7 0.0 dataflower VS heroku-persistent

Parse DATABASE_URL into configuration types for Persistent
type-sets

8.6 0.0 dataflower VS type-sets

type level sets
type-errors

8.6 3.3 dataflower VS type-errors

:warning: tools for writing better type errors
bit-stream

8.4 7.4 dataflower VS bit-stream

Lazy infinite compact streams with cache-friendly O(1) indexing and applications for memoization
load-balancing

8.4 1.9 dataflower VS load-balancing

Client-side load balancing utilities.
canteven-http

8.3 1.7 dataflower VS canteven-http

Utilities for HTTP programming.
ascii-art-to-unicode

8.2 0.0 dataflower VS ascii-art-to-unicode

Small program to convert ASCII box art to Unicode box drawings.
safe

8.2 6.8 dataflower VS safe

Haskell library for safe (pattern match free) functions
hackertyper

8.1 0.0 dataflower VS hackertyper

"Hack" like a programmer in movies and games! Inspired by hackertyper.net
rollbar-cli

8.0 6.4 dataflower VS rollbar-cli

A group of libraries written in Haskell to communicate with Rollbar API.
aeson-serialize

7.9 0.0 dataflower VS aeson-serialize

Functions for serializing a type that is an instance of ToJSON
base-unicode-symbols

7.9 0.0 dataflower VS base-unicode-symbols

Unicode alternatives for common functions and operators
domain

7.8 5.2 dataflower VS domain

Focused domain model declaration toolkit for Haskell
rebase

7.6 6.7 dataflower VS rebase

A more progressive alternative to the "base" package
SDL2-ttf

7.5 0.0 dataflower VS SDL2-ttf

Binding to libSDL-ttf
ble

7.3 0.0 dataflower VS ble

Bluetooth Low Energy (BLE) peripherals
hworker

7.3 0.0 dataflower VS hworker

A reliable at-least-once job queue built on Redis.
dependent-sum-template

7.2 0.0 dataflower VS dependent-sum-template

Template Haskell code to generate instances of classes in dependent-sum package
ispositive

7.1 0.0 dataflower VS ispositive

Haskell Module: Integer.IsPositive
argon2

7.1 0.0 L3 dataflower VS argon2

Haskell bindings to libargon2 - the reference implementation of the Argon2 password-hashing function
rss

7.0 3.4 dataflower VS rss

A library for generating RSS 2.0 feeds.
lrucaching

7.0 2.6 dataflower VS lrucaching

Haskell implementation of an LRU cache
dzen-dhall

7.0 0.0 dataflower VS dzen-dhall

Configure dzen2 bars in Dhall language
servant-streaming

6.9 0.0 dataflower VS servant-streaming

Support for servant requests and responses via the 'streaming' library
deque

6.8 6.2 dataflower VS deque

Double-ended queues
MissingPy

6.7 0.0 dataflower VS MissingPy

Support for calling Python from Haskell
webkit-javascriptcore

6.6 1.3 dataflower VS webkit-javascriptcore

webkit javascriptcore library FFI
Deadpan-DDP

6.6 0.0 dataflower VS Deadpan-DDP

A Haskell DDP Client
rerebase

6.5 4.4 dataflower VS rerebase

Reexports from "base" with a bunch of other standard libraries
google-oauth2

6.5 0.0 dataflower VS google-oauth2

Google OAuth2 token negotiation
gravatar

6.5 0.0 dataflower VS gravatar

Compute gravatar urls for email addresses
dag

6.4 0.0 dataflower VS dag

A well-typed Directed Acyclic Graph in Haskell
Ordinary

6.4 0.0 dataflower VS Ordinary

Short description of your package
preql

6.4 0.0 dataflower VS preql

experiments with SQL & Haskell
containers-unicode-symbols

6.3 0.0 dataflower VS containers-unicode-symbols

Unicode alternatives for common functions and operators
semver-range

6.2 0.0 dataflower VS semver-range

Implementation of semver and NPM-style semantic version ranges in Haskell
postgresql-simple-sop

6.2 0.0 dataflower VS postgresql-simple-sop

Generic functions for postgresql-simple
hasql-dynamic-statements

6.2 0.0 dataflower VS hasql-dynamic-statements

Dynamic statements for Hasql
reflex-dom-pandoc

6.2 0.0 dataflower VS reflex-dom-pandoc

Render Pandoc documents in reflex-dom
google-drive

6.0 0.0 dataflower VS google-drive

Google Drive API access
bitcoind-regtest

5.9 7.4 dataflower VS bitcoind-regtest

A cilent for the bitcoind JSON-RPC interface
ptr-poker

5.9 5.9 dataflower VS ptr-poker

Pointer poking action construction and composition toolkit

* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.

Do you think we are missing an alternative of dataflower or a related project?

Add another 'Unclassified' Package

Popular Comparisons

README

dataflower (data + 🌼)

Timely Dataflow for Haskell

Dataflower is an implementation of a Timely Dataflow-like system written in pure Haskell. The design uses the original 2013 Naiad paper from Microsoft Research as its design basis.

Dataflower leaves out many of Timely Dataflow and Naiad's capabilities for the time being. The focus is first to ensure the framework is easy to use and correct. Scalability will come after that.

What is "Dataflow"?

Dataflow Programming

Dataflow programming, sometimes called datastream programming, is a paradigm where programs are constructed as directed graphs. Communication occurs along edges, and computation happens at vertices. The communication code is taken care of by the runtime system, and the programmer is responsible for building each vertex as a black box, and then connecting together the flow graph.

Conventionally the flow graph must be acyclic. Microsoft Research's Naiad project took dataflow several steps further, and designed a system that allowed for cyclic directed graphs. This makes it possible to construct dataflow systems with feedback or iteration.

Dataflow vs. Streaming

Why not use Conduit, Streamly, Pipes, or so on? Streaming frameworks concern themselves with the world of I/O. Data comes from a source, it is transformed through a pipeline, and is passed out a sink. Their goal is to make writing robust linear pipelines easy. They do that very well.

Dataflow is concerned exclusively with the world of computation. Input is multiplexed through a single source node, and then flows through the computational graph in parallel.

Here's an example to show what is easy in Dataflow that would be rather unnatural in a streaming framework:

newtype Mean a = Mean a deriving (Eq, Show)

arithMean :: Edge (Mean Int) -> Dataflow (Edge Int)
arithMean next = ...

geomMean :: Edge (Mean Int) -> Dataflow (Edge Int)
geomMean next = ...

fanout :: [Edge a] -> Dataflow (Edge a)
fanout nexts = ...

computation :: TVar (Mean Int) -> TVar (Mean Int) -> TVar [Int] -> Dataflow Int
computation arithTV geomTV seenTV = do
  arithOutput <- outputTVar id arithTV
  geomOutput  <- outputTVar id geomTV
  seen        <- outputTVar (:) sumTV

  arith <- arithMean arithOutput
  geom  <- geomMean geomOutput

  fanout [arith, geom, seen]

Dataflow output gets placed in shared memory. The first TVar will contain the arithmetic mean of the inputs provided to computation. The second TVar will contain the geometric mean. The third TVar will contain the list of all the inputs in the order in which they were seen.

This last one is important: since dataflows are computational graphs, it's completely natural to "skip" a "stage" -- seen doesn't require anything special to get a copy of the input as-is because we don't have a single pipeline.

Now imagine that we also want to keep track of the list of arithmetic residues -- that is, the difference between the current input and the most recent arithmetic mean.

It's true we could do this by modifying arithMean so now instead of doing one thing well, it does two things adequately. The cost is greater complexity and more elaborate test needs.

We could also create a residue :: Edge Int -> Dataflow (Edge (Mean Int), Edge Int) vertex that keeps the most recent mean and passes on the absolute difference between its input and that mean. But notice something: nothing we just said has anything to do with arithmetic means. This residue vertex is generic and can be used with the output of any averaging vertex.

Essential Concepts

Dataflow programming in Haskell can be thought of as similar to continuation passing. Unlike normal CPS where the continuation is passed directly to a function, in Dataflow programming a typed pointer is passed and the continuation is invoked using the send :: Edge i -> Timestamp -> i -> Dataflow () operator.

Timestamps

The Timestamp is what makes Timely Dataflow timely. Timestamps causally separate collections of inputs. If t1 and t2 are each a Timestamp, then if t2 > t1 any data associated with t2 could have been the result of t1, and no data associated with t1 could have been the result of t2.

Timestamps, rather than being scalars, are vector-valued in timely dataflow. Entering a feedback loop in a timely dataflow graph adds a dimension to the Timestamp, and exiting it removes a dimension. This happens automatically and is opaque to the programmer.

Taken together, these two attributes allow timely dataflow to identify the next inputs to process to ensure the system makes progress towards a result.

Vertices & Edges

Computation happens at a vertex. Much of your work in Dataflower will consist of writing custom vertices to model your computations. The complexity varies depending upon how rich the behavior of your vertex is. None of that richness is visible outside your vertex though -- all your upstream knows is that it has an Edge a to send to. Your downstream doesn't even know you exist.

An Edge a is totally opaque. You get one when you define a vertex, and the only thing you can do is send to it.

Support

Assistance is available via the Github issue tracker for this project or in #Haskell on [FP Slack](functionalprogramming.slack.com). Paid support is available -- contact [email protected] for details.

Features

[x] Acyclic graph support
[x] Single threaded execution
[ ] Persistable checkpoints
[ ] Cyclic graph support
[ ] Multithreaded execution
[ ] Distributed execution

dataflower

Data flow programming for Haskell