@dsardar099/pdf-to-html

1.0.6 • Public • Published

pdf-to-html - pdf2htmlEx shell wrapper for Node.js

This is a customized package with more functionality inherited from https://github.com/alanhk89/pdftohtml

Thanks to @alanhk89

pdftohtml provides access to pdf2htmlEX via shell in node.js programs.

Requirements

Quick Setup

  1. Create a directory for you node js app
  2. Do npm init -y
  3. crate app.js in that directory
  4. Download https://drive.google.com/drive/folders/10ar12wziNA76tGf0KaSLV_DCcljX8ORs?usp=sharing
  5. Extract the downloaded zip file and copy pdf2htmlEX.exe and data folder. Paste those 2 thing to your node js app directory.
  6. Do npm i @dsardar099/pdf-to-html
  7. You are ready to go for writing code to convert PDF files to HTML files.

See documentation example for more information.

If you've docker env setup, just install it via docker

alias pdf2htmlEX="docker run -ti --rm -v ~/pdf:/pdf iapain/pdf2htmlex pdf2htmlEX"

~/pdf on host computer will be used as volume

Installation

via yarn:

yarn add @dsardar099/pdf-to-html

via npm:

npm install @dsardar099/pdf-to-html

Usage

var pdftohtml = require("@dsardar099/pdf-to-html");
var converter = new pdftohtml("test/pdfs/sample.pdf", "sample.html");

// See presets (ipad, default)
// Feel free to create custom presets
// see https://github.com/alanhk89/pdftohtml/blob/master/lib/presets/ipad.js
// convert() returns promise
converter
  .convert("ipad")
  .then(function () {
    console.log("Success");
  })
  .catch(function (err) {
    console.error("Conversion error: " + err);
  });

// If you would like to disable printing of the converted html output,
// just call converter.disablePrinting()
converter
  .convert()
  .then(function () {
    converter.disablePrinting();
    console.log("Success");
  })
  .catch(function (err) {
    console.error("Conversion error: " + err);
  });

// If you would like to enable enableContentEditable of the converted html output,
// just call converter.enableContentEditable()
converter
  .convert()
  .then(function () {
    converter.enableContentEditable();
    console.log("Success");
  })
  .catch(function (err) {
    console.error("Conversion error: " + err);
  });

// If you would like to tap into progress then create
// progress handler
converter.progress(function (ret) {
  console.log((ret.current * 100.0) / ret.total + " %");
});

Command line usage

yarn global add @dsardar099/pdf-to-html
@dsardar099/pdf-to-html sample.pdf

You may optionally provide your own filename and preset

@dsardar099/pdf-to-html sample.pdf sample.html ipad

Tests

$ yarn test

NodeJS Support

This library support nodejs v6+. Anything below v6 may still work but not tested.

Dependents (0)

Package Sidebar

Install

npm i @dsardar099/pdf-to-html

Weekly Downloads

10

Version

1.0.6

License

ISC

Unpacked Size

10.5 kB

Total Files

9

Last publish

Collaborators

  • dsardar099