Popularity
6.9
Declining
Activity
0.0
Stable
21
3
5
Monthly Downloads: 0
Programming language: Haskell
License: MIT License
punkt alternatives and similar packages
Based on the "Natural Language Processing" category.
Alternatively, view punkt alternatives based on common mentions on social networks and blogs.
-
minimorph
English spelling functions with an emphasis on simplicity. Originally by https://github.com/kowey. -
sentiwordnet-parser
Parser for the [SentiWordNet](http://sentiwordnet.isti.cnr.it/) tab-separated file
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
Promo
getstream.io

Do you think we are missing an alternative of punkt or a related project?
Popular Comparisons
README
punkt
Multilingual unsupervised sentence tokenization with Punkt.
Usage
Note that abbreviations are detected at run time without the aid of a pre-built abbreviation list:
import Data.Text (Text, pack)
import NLP.Punkt (split_sentences)
corpus :: Text
corpus = pack "Look, Ma! The quick brown Mr. T. rex swallowed the lazy dog. \
\It really did!"
main :: IO ()
main = mapM_ print (split_sentences corpus)
yields:
"Look, Ma!"
"The quick brown Mr. T. rex swallowed the lazy dog."
"It really did!"
References
Kiss, Tibor, and Jan Strunk. "Unsupervised multilingual sentence boundary detection." Computational Linguistics 32.4 (2006): 485-525.
TODO
- parallelize
- modularize tokenization
- custom tokenization rules
- needs to go fasterer