Search results

1000+ packages found

The fast, flexible & elegant library for parsing and manipulating HTML and XML.

published version 1.0.0, 9 months ago17673 dependents licensed under $MIT
44,294,428

A specification compliant robots.txt parser with wildcard (*) matching support.

published version 3.0.1, 2 years ago79 dependents licensed under $MIT
4,857,342

Browserless scraper module

published version 5.0.1, 9 months ago15 dependents licensed under $GPL-3.0-or-later
403,070

JavaScript SDK for Firecrawl API

published version 1.24.0, 8 days ago25 dependents licensed under $MIT
269,207

Apify API client for JavaScript

published version 2.12.3, 16 hours ago32 dependents licensed under $Apache-2.0
230,327

Node.js scraper module for Open Graph and Twitter Card info

published version 6.10.0, 3 days ago72 dependents licensed under $MIT
286,811

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

published version 3.13.2, 17 days ago27 dependents licensed under $Apache-2.0
178,389

A library to easily scrape metadata from an article on the web using Open Graph, JSON+LD, regular HTML metadata, and series of fallbacks.

published version 5.46.11, 21 days ago74 dependents licensed under $MIT
167,607

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

published version 3.13.2, 17 days ago4 dependents licensed under $Apache-2.0
131,466

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

published version 3.13.2, 17 days ago1 dependents licensed under $Apache-2.0
129,271

Lazy way to download images from Duck Duck Go search results in bulk

published version 0.1.11, 3 years ago0 dependents licensed under $MIT
127,797

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

published version 3.13.2, 17 days ago6 dependents licensed under $Apache-2.0
150,815

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

published version 3.13.2, 17 days ago7 dependents licensed under $Apache-2.0
145,688

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

published version 3.13.2, 17 days ago3 dependents licensed under $Apache-2.0
134,366

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

published version 3.13.2, 17 days ago1 dependents licensed under $Apache-2.0
128,957

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

published version 3.13.2, 17 days ago5 dependents licensed under $Apache-2.0
132,406

Templates for the crawlee projects

published version 3.13.2, 17 days ago1 dependents licensed under $Apache-2.0
129,409

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

published version 3.13.2, 17 days ago50 dependents licensed under $Apache-2.0
128,472

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

published version 3.13.2, 17 days ago1 dependents licensed under $Apache-2.0
126,648

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

published version 3.4.0, 24 days ago49 dependents licensed under $Apache-2.0
75,680