@mkas3/pdf-table-parser
TypeScript icon, indicating that this package has built-in type declarations

1.2.18 • Public • Published

🐸 PDF Table Parser

Simplified parsing of tables from PDF

❔ Why?

PDF Table Parser is a library based on florpor's pdf-table-extractor with built-in types. I couldn't find any ready-made library for parsing tables from pdf, so I rewrote the source code of the library to modern TypeScript, output all types and slightly changed it.

🚀 Install

Using npm

npm install --save @mkas3/pdf-table-parser

Using yarn

yarn add @mkas3/pdf-table-parser

Using pnpm

pnpm add @mkas3/pdf-table-parser

Once the package is installed, you can import the library using import or require approach:

import { extractPdfTable } from '@mkas3/pdf-table-parser';

Example

Example

import fs from 'fs';
import extractPdfTable from 'pdf-table-extractor-ts';

const file = fs.readFileSync('example.pdf');

extractPdfTable(file).then(res => {
  console.log(JSON.stringify(res));
});

API

extractPdfTable(buffer, options)

  • buffer <[ArrayBuffer]> pdf file buffer.
  • options <[Object]>
    • maxEdgesPerPage <?[number]> maximum number of edges to process per page. if defined and number of identified edges surpasses the setting tables will not be processes for the current page.
    • progressFunc <?[function(Object)]> callback to call after each page is processes with the current result object.
  • returns: <[Promise]<[Object]>>

License

BSD License

Dependencies (1)

Dev Dependencies (9)

Package Sidebar

Install

npm i @mkas3/pdf-table-parser

Weekly Downloads

13

Version

1.2.18

License

MIT

Unpacked Size

163 kB

Total Files

7

Last publish

Collaborators

  • mkas3