keywords:crawl - npm search

node-spider

Generic web crawler powered by Node.js

flesler

published version 1.4.1, 8 years ago5 dependents licensed under $BSD-2-Clause

280

read-art

Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.

tjatse

published version 0.5.6, 7 years ago2 dependents licensed under $MIT

215

images-downloader

A Node.js module for downloading a single image or multiple images to disk from a given Url (checking if url exist and detecting image type)

tekdreams

published version 1.0.3, 8 years ago1 dependents licensed under $MIT

219

manga-lib

A library for scraping manga from various websites.

zcrossoverz

published version 1.1.2, 2 years ago0 dependents licensed under $MIT

200

spider-node

爬虫工具,根据传入的配置和规则爬取数据

weoil

published version 1.1.4-0, 5 years ago0 dependents licensed under $ISC

200

bas

Behaviour Assertion Sheets: CSS-like declarative syntax for client-side integration testing and quality assurance.

cgiffard

published version 0.1.1, 8 years ago4 dependents licensed under $BSD-2-Clause

192

filespy

Spy on files

aleclarson

published version 1.2.4, 4 years ago1 dependents licensed under $MIT

189

sajari-website

Website extensions for the Sajari API. Automatically index site content, add user profiles, render search and recommendations, etc.

wwalser

published version 0.12.0, 2 years ago1 dependents licensed under $MIT

178

js-spider

JSpider 3 is a Chrome DevTools crawler framework that includes full crawler support. JSpider 3 是在 Chrome Devtools 中进行爬虫的爬虫框架, 这个框架包括了完整的爬虫支持。

konghayao

published version 3.2.3, 3 years ago0 dependents licensed under $Apache-2.0

180

huntsman

Super configurable async web spider

missinglink

published version 0.3.0, 9 years ago0 dependents licensed under $MIT

155

SyphonX is a tool that extracts data from HTML data, transforming it into JSON of any shape or size. It combines the power of CSS Selectors and jQuery, Regular Expressions, and Javascript into a declarative template format to elegantly solve the simplest