All Versions
19
Latest Version
Avg Release Cycle
171 days
Latest Release
1462 days ago
Changelog History
Page 1
Changelog History
Page 1
-
v0.6.2 Changes
April 18, 2020- โ Add the monad transformer
ScraperT
.
- โ Add the monad transformer
-
v0.6.1 Changes
January 19, 2020- ๐ Support GHC 8.8.
-
v0.6.0 Changes
February 18, 2019๐ฅ Breaking Changes
anySelector
now captures text nodes. This causes different results when used
with a plural scraper (e.g.chroots
). Usage with a singular scraper (e.g.
chroot
) should be unaffected.- The dependency on
curl
has been replaced withhttp-client
and
http-client-tls
. This has the following observable changes.scrapeURLWithOpts
is removed.- The
Config
type used withscrapeURLWithConfig
no longer contains a list
of curl options. Instead it now takes aMaybe Manager
fromhttp-client
. - The
Decoder
function type now takes in aResponse
type from
http-client
. scrapeURL
will now throw an exception if there is a problem connecting to
a URL.
Other Changes
- โ Remove
Ord
constraint from public APIs. - โ Add
atDepth
operator which allows for selecting nodes at a specified depth
in relation to another node (#21). - ๐ Fix issue selecting malformed HTML where
"a" // "c"
would not match
<a><b><c></c></a></b>
. - โ Add
textSelector
for selecting text nodes. - โ Add
SerialScraper
type and associated primitives (#48).
-
v0.5.1 Changes
April 25, 2017 -
v0.5.0 Changes
February 05, 2017- ๐ฆ Split
scalpel
into two packages:scalpel
andscalpel-core
. The latter
๐ does not provide networking support and does not depend on curl.
- ๐ฆ Split
-
v0.4.1 Changes
January 19, 2017- โ Added
notP
attribute predicate.
- โ Added
-
v0.4.0 Changes
October 17, 2016- โ Add the
chroot
tricks (#23 and #25) to README.md and added examples.- Fix backtracking that occurs when using
guard
andchroot
. - Fix bug where the same tag may appear in the result set multiple times.
- Fix backtracking that occurs when using
- ๐ Performance optimizations when using the (//) operator.
- ๐ Make Scraper an instance of MonadFail. Practically this means that failed
pattern matches in<-
expressions within a do block will evaluate to mzero
instead of throwing an error and bringing down the entire script. - Pluralized scrapers will now return the empty list instead mzero when there
are no matches. - โ Add the
position
scraper which provides the index of the current sub-tree
within the context of achroots
's do-block.
- โ Add the
-
v0.3.1 Changes
May 28, 2016- โ Added the
innerHTML
andinnerHTMLs
scraper. - โ Added the
match
function which allows for the creation of arbitrary
attribute predicates. - ๐ Fixed build breakage with GHC 8.0.1.
- โ Added the
-
v0.3.0 Changes
January 31, 2016- โ Added benchmarks and many optimizations.
- ๐ The
select
method is removed from the public API. - Many methods now have a constraint that the string type parametrizing
๐ท TagSoup's tag type now must be order-able. - โ Added
scrapeUrlWithConfig
that will hopefully put an end to multiplying
scrapeUrlWith*
methods. - 0๏ธโฃ The default behaviour of the
scrapeUrl*
methods is to attempt to infer the
character encoding from theContent-Type
header.
-
v0.3.0.1 Changes
January 31, 2016- ๐ Make tag and attribute matching case-insensitive.