@mkas3/pdf-table-parser
TypeScript icon, indicating that this package has built-in type declarations

1.2.18 • Public • Published

🐸 PDF Table Parser

Simplified parsing of tables from PDF

❔ Why?

PDF Table Parser is a library based on florpor's pdf-table-extractor with built-in types. I couldn't find any ready-made library for parsing tables from pdf, so I rewrote the source code of the library to modern TypeScript, output all types and slightly changed it.

🚀 Install

Using npm

npm install --save @mkas3/pdf-table-parser

Using yarn

yarn add @mkas3/pdf-table-parser

Using pnpm

pnpm add @mkas3/pdf-table-parser

Once the package is installed, you can import the library using import or require approach:

import { extractPdfTable } from '@mkas3/pdf-table-parser';

Example

Example

import fs from 'fs';
import extractPdfTable from 'pdf-table-extractor-ts';

const file = fs.readFileSync('example.pdf');

extractPdfTable(file).then(res => {
  console.log(JSON.stringify(res));
});

API

extractPdfTable(buffer, options)

  • buffer <[ArrayBuffer]> pdf file buffer.
  • options <[Object]>
    • maxEdgesPerPage <?[number]> maximum number of edges to process per page. if defined and number of identified edges surpasses the setting tables will not be processes for the current page.
    • progressFunc <?[function(Object)]> callback to call after each page is processes with the current result object.
  • returns: <[Promise]<[Object]>>

License

BSD License

/@mkas3/pdf-table-parser/

    Package Sidebar

    Install

    npm i @mkas3/pdf-table-parser

    Weekly Downloads

    1

    Version

    1.2.18

    License

    MIT

    Unpacked Size

    163 kB

    Total Files

    7

    Last publish

    Collaborators

    • mkas3