pdf-to-html - pdf2htmlEx shell wrapper for Node.js
https://github.com/alanhk89/pdftohtml
This is a customized package with more functionality inherited from@alanhk89
Thanks topdftohtml provides access to pdf2htmlEX via shell in node.js programs.
Requirements
Quick Setup
- Create a directory for you node js app
- Do
npm init -y
- crate app.js in that directory
- Download https://drive.google.com/drive/folders/10ar12wziNA76tGf0KaSLV_DCcljX8ORs?usp=sharing
- Extract the downloaded zip file and copy pdf2htmlEX.exe and data folder. Paste those 2 thing to your node js app directory.
- Do
npm i @dsardar099/pdf-to-html
- You are ready to go for writing code to convert PDF files to HTML files.
See documentation example for more information.
If you've docker env setup, just install it via docker
alias pdf2htmlEX="docker run -ti --rm -v ~/pdf:/pdf iapain/pdf2htmlex pdf2htmlEX"
~/pdf on host computer will be used as volume
Installation
via yarn:
yarn add @dsardar099/pdf-to-html
via npm:
npm install @dsardar099/pdf-to-html
Usage
var pdftohtml = require("@dsardar099/pdf-to-html");
var converter = new pdftohtml("test/pdfs/sample.pdf", "sample.html");
// See presets (ipad, default)
// Feel free to create custom presets
// see https://github.com/alanhk89/pdftohtml/blob/master/lib/presets/ipad.js
// convert() returns promise
converter
.convert("ipad")
.then(function () {
console.log("Success");
})
.catch(function (err) {
console.error("Conversion error: " + err);
});
// If you would like to disable printing of the converted html output,
// just call converter.disablePrinting()
converter
.convert()
.then(function () {
converter.disablePrinting();
console.log("Success");
})
.catch(function (err) {
console.error("Conversion error: " + err);
});
// If you would like to enable enableContentEditable of the converted html output,
// just call converter.enableContentEditable()
converter
.convert()
.then(function () {
converter.enableContentEditable();
console.log("Success");
})
.catch(function (err) {
console.error("Conversion error: " + err);
});
// If you would like to tap into progress then create
// progress handler
converter.progress(function (ret) {
console.log((ret.current * 100.0) / ret.total + " %");
});
Command line usage
yarn global add @dsardar099/pdf-to-html
@dsardar099/pdf-to-html sample.pdf
You may optionally provide your own filename and preset
@dsardar099/pdf-to-html sample.pdf sample.html ipad
Tests
$ yarn test
NodeJS Support
This library support nodejs v6+. Anything below v6 may still work but not tested.