HTTP library specifically designed for crawling the web. Built-in caching and per-domain queueing
published version 0.7.7, 5 years agoNode.js module that recursively crawls a website's sitemap and returns a stream of URLs
published version 0.11.0, 5 years agoThe simple and fast crawling framework. So you can focus on scraping.
published version 0.4.1, 5 years agoReadable stream of the Body of every object in an S3 bucket.
published version 1.3.2, 5 years ago