Developer-friendly & type-safe Typescript SDK specifically catered to leverage @abbyy-sdk/document-ai API.
Document AI API: A modern, simple, and easy-to-integrate OCR and document processing API service
The SDK can be installed with either npm, pnpm, bun or yarn package managers.
npm add @abbyy-sdk/document-ai
pnpm add @abbyy-sdk/document-ai
bun add @abbyy-sdk/document-ai
yarn add @abbyy-sdk/document-ai zod
# Note that Yarn does not install peer dependencies automatically. You will need
# to install zod as shown above.
[!NOTE] This package is published with CommonJS and ES Modules (ESM) support.
For supported JavaScript runtimes, please consult RUNTIMES.md.
import { DocumentAi } from "@abbyy-sdk/document-ai";
const documentAi = new DocumentAi({
apiKeyAuth: process.env["DOCUMENTAI_API_KEY_AUTH"] ?? "",
});
async function run() {
const result = await documentAi.documents.list({
cursor: "xyz",
});
for await (const page of result) {
// Handle the page
console.log(page);
}
}
run();
This SDK supports the following security scheme globally:
Name | Type | Scheme | Environment Variable |
---|---|---|---|
apiKeyAuth |
http | HTTP Bearer | DOCUMENTAI_API_KEY_AUTH |
To authenticate with the API the apiKeyAuth
parameter must be set when initializing the SDK client instance. For example:
import { DocumentAi } from "@abbyy-sdk/document-ai";
const documentAi = new DocumentAi({
apiKeyAuth: process.env["DOCUMENTAI_API_KEY_AUTH"] ?? "",
});
async function run() {
const result = await documentAi.documents.list({
cursor: "xyz",
});
for await (const page of result) {
// Handle the page
console.log(page);
}
}
run();
Available methods
- beginFieldExtraction - Begin Air Waybill Field Extraction
- getExtractedFields - Get Air Waybill Fields
- beginFieldExtraction - Begin Arrival Notice Field Extraction
- getExtractedFields - Get Arrival Notice Fields
- beginFieldExtraction - Begin Bank Statement Field Extraction
- getExtractedFields - Get Bank Statement Fields
- beginFieldExtraction - Begin Basic Contract Field Extraction
- getExtractedFields - Get Basic Contract Fields
- beginFieldExtraction - Begin Bill Of Lading Field Extraction
- getExtractedFields - Get Bill Of Lading Fields
- beginFieldExtraction - Begin Brokerage Statement Field Extraction
- getExtractedFields - Get Brokerage Statement Fields
- beginFieldExtraction - Begin Certificate Of Origin Field Extraction
- getExtractedFields - Get Certificate Of Origin Fields
- beginFieldExtraction - Begin Commercial Invoice Field Extraction
- getExtractedFields - Get Commercial Invoice Fields
- beginFieldExtraction - Begin Customs Declaration Field Extraction
- getExtractedFields - Get Customs Declaration Fields
- beginFieldExtraction - Begin Dangerous Goods Declaration Field Extraction
- getExtractedFields - Get Dangerous Goods Declaration Fields
- beginFieldExtraction - Begin Delivery Note Field Extraction
- getExtractedFields - Get Delivery Note Fields
- beginConversion - Begin Document Conversion
- getConversion - Get Conversion
- downloadConvertedDocument - Download Converted Document
- beginFieldExtraction - Begin Hotel Invoice Field Extraction
- getExtractedFields - Get Hotel Invoice Fields
- beginTextExtraction - Begin Image To Text Extraction
- getExtractedText - Get Extracted Text
- beginFieldExtraction - Begin International Consignment Note Field Extraction
- getExtractedFields - Get International Consignment Note Fields
- beginFieldExtraction - Begin Invoice Field Extraction
- getExtractedFields - Get Invoice Fields
- beginFieldExtraction - Begin Packing List Field Extraction
- getExtractedFields - Get Packing List Fields
- beginFieldExtraction - Begin Personal Earnings Statement Field Extraction
- getExtractedFields - Get Personal Earnings Statement Fields
- beginFieldExtraction - Begin Purchase Order Field Extraction
- getExtractedFields - Get Purchase Order Fields
- beginFieldExtraction - Begin Receipt Field Extraction
- getExtractedFields - Get Receipt Fields
- beginFieldExtraction - Begin Remittance Advice Field Extraction
- getExtractedFields - Get Remittance Advice Fields
- beginFieldExtraction - Begin Sea Waybill Field Extraction
- getExtractedFields - Get Sea Waybill Fields
- beginFieldExtraction - Begin Taxi Receipt Field Extraction
- getExtractedFields - Get Taxi Receipt Fields
- beginFieldExtraction - Begin US Form 1040 Field Extraction
- getExtractedFields - Get US Form 1040 Fields
- beginFieldExtraction - Begin US Form W2 Field Extraction
- getExtractedFields - Get US Form W2 Fields
- beginFieldExtraction - Begin Utility Bill Field Extraction
- getExtractedFields - Get Utility Bill Fields
All the methods listed above are available as standalone functions. These functions are ideal for use in applications running in the browser, serverless runtimes or other environments where application bundle size is a primary concern. When using a bundler to build your application, all unused functionality will be either excluded from the final bundle or tree-shaken away.
To read more about standalone functions, check FUNCTIONS.md.
Available standalone functions
-
documentsDelete
- Delete Document -
documentsList
- List Documents -
modelsAirWaybillBeginFieldExtraction
- Begin Air Waybill Field Extraction -
modelsAirWaybillGetExtractedFields
- Get Air Waybill Fields -
modelsArrivalNoticeBeginFieldExtraction
- Begin Arrival Notice Field Extraction -
modelsArrivalNoticeGetExtractedFields
- Get Arrival Notice Fields -
modelsBankStatementBeginFieldExtraction
- Begin Bank Statement Field Extraction -
modelsBankStatementGetExtractedFields
- Get Bank Statement Fields -
modelsBasicContractBeginFieldExtraction
- Begin Basic Contract Field Extraction -
modelsBasicContractGetExtractedFields
- Get Basic Contract Fields -
modelsBillOfLadingBeginFieldExtraction
- Begin Bill Of Lading Field Extraction -
modelsBillOfLadingGetExtractedFields
- Get Bill Of Lading Fields -
modelsBrokerageStatementBeginFieldExtraction
- Begin Brokerage Statement Field Extraction -
modelsBrokerageStatementGetExtractedFields
- Get Brokerage Statement Fields -
modelsCertificateOfOriginBeginFieldExtraction
- Begin Certificate Of Origin Field Extraction -
modelsCertificateOfOriginGetExtractedFields
- Get Certificate Of Origin Fields -
modelsCommercialInvoiceBeginFieldExtraction
- Begin Commercial Invoice Field Extraction -
modelsCommercialInvoiceGetExtractedFields
- Get Commercial Invoice Fields -
modelsCustomsDeclarationBeginFieldExtraction
- Begin Customs Declaration Field Extraction -
modelsCustomsDeclarationGetExtractedFields
- Get Customs Declaration Fields -
modelsDangerousGoodsDeclarationBeginFieldExtraction
- Begin Dangerous Goods Declaration Field Extraction -
modelsDangerousGoodsDeclarationGetExtractedFields
- Get Dangerous Goods Declaration Fields -
modelsDeliveryNoteBeginFieldExtraction
- Begin Delivery Note Field Extraction -
modelsDeliveryNoteGetExtractedFields
- Get Delivery Note Fields -
modelsDocumentConversionBeginConversion
- Begin Document Conversion -
modelsDocumentConversionDownloadConvertedDocument
- Download Converted Document -
modelsDocumentConversionGetConversion
- Get Conversion -
modelsHotelInvoiceBeginFieldExtraction
- Begin Hotel Invoice Field Extraction -
modelsHotelInvoiceGetExtractedFields
- Get Hotel Invoice Fields -
modelsImageToTextBeginTextExtraction
- Begin Image To Text Extraction -
modelsImageToTextGetExtractedText
- Get Extracted Text -
modelsInternationalConsignmentNoteBeginFieldExtraction
- Begin International Consignment Note Field Extraction -
modelsInternationalConsignmentNoteGetExtractedFields
- Get International Consignment Note Fields -
modelsInvoiceBeginFieldExtraction
- Begin Invoice Field Extraction -
modelsInvoiceGetExtractedFields
- Get Invoice Fields -
modelsPackingListBeginFieldExtraction
- Begin Packing List Field Extraction -
modelsPackingListGetExtractedFields
- Get Packing List Fields -
modelsPersonalEarningsStatementBeginFieldExtraction
- Begin Personal Earnings Statement Field Extraction -
modelsPersonalEarningsStatementGetExtractedFields
- Get Personal Earnings Statement Fields -
modelsPurchaseOrderBeginFieldExtraction
- Begin Purchase Order Field Extraction -
modelsPurchaseOrderGetExtractedFields
- Get Purchase Order Fields -
modelsReceiptBeginFieldExtraction
- Begin Receipt Field Extraction -
modelsReceiptGetExtractedFields
- Get Receipt Fields -
modelsRemittanceAdviceBeginFieldExtraction
- Begin Remittance Advice Field Extraction -
modelsRemittanceAdviceGetExtractedFields
- Get Remittance Advice Fields -
modelsSeaWaybillBeginFieldExtraction
- Begin Sea Waybill Field Extraction -
modelsSeaWaybillGetExtractedFields
- Get Sea Waybill Fields -
modelsTaxiReceiptBeginFieldExtraction
- Begin Taxi Receipt Field Extraction -
modelsTaxiReceiptGetExtractedFields
- Get Taxi Receipt Fields -
modelsUsForm1040BeginFieldExtraction
- Begin US Form 1040 Field Extraction -
modelsUsForm1040GetExtractedFields
- Get US Form 1040 Fields -
modelsUsFormW2BeginFieldExtraction
- Begin US Form W2 Field Extraction -
modelsUsFormW2GetExtractedFields
- Get US Form W2 Fields -
modelsUtilityBillBeginFieldExtraction
- Begin Utility Bill Field Extraction -
modelsUtilityBillGetExtractedFields
- Get Utility Bill Fields
Some of the endpoints in this SDK support pagination. To use pagination, you
make your SDK calls as usual, but the returned response object will also be an
async iterable that can be consumed using the for await...of
syntax.
Here's an example of one such pagination call:
import { DocumentAi } from "@abbyy-sdk/document-ai";
const documentAi = new DocumentAi({
apiKeyAuth: process.env["DOCUMENTAI_API_KEY_AUTH"] ?? "",
});
async function run() {
const result = await documentAi.documents.list({
cursor: "xyz",
});
for await (const page of result) {
// Handle the page
console.log(page);
}
}
run();
Some of the endpoints in this SDK support retries. If you use the SDK without any configuration, it will fall back to the default retry strategy provided by the API. However, the default retry strategy can be overridden on a per-operation basis, or across the entire SDK.
To change the default retry strategy for a single API call, simply provide a retryConfig object to the call:
import { DocumentAi } from "@abbyy-sdk/document-ai";
const documentAi = new DocumentAi({
apiKeyAuth: process.env["DOCUMENTAI_API_KEY_AUTH"] ?? "",
});
async function run() {
const result = await documentAi.documents.list({
cursor: "xyz",
}, {
retries: {
strategy: "backoff",
backoff: {
initialInterval: 1,
maxInterval: 50,
exponent: 1.1,
maxElapsedTime: 100,
},
retryConnectionErrors: false,
},
});
for await (const page of result) {
// Handle the page
console.log(page);
}
}
run();
If you'd like to override the default retry strategy for all operations that support retries, you can provide a retryConfig at SDK initialization:
import { DocumentAi } from "@abbyy-sdk/document-ai";
const documentAi = new DocumentAi({
retryConfig: {
strategy: "backoff",
backoff: {
initialInterval: 1,
maxInterval: 50,
exponent: 1.1,
maxElapsedTime: 100,
},
retryConnectionErrors: false,
},
apiKeyAuth: process.env["DOCUMENTAI_API_KEY_AUTH"] ?? "",
});
async function run() {
const result = await documentAi.documents.list({
cursor: "xyz",
});
for await (const page of result) {
// Handle the page
console.log(page);
}
}
run();
Some methods specify known errors which can be thrown. All the known errors are enumerated in the models/errors/errors.ts
module. The known errors for a method are documented under the Errors tables in SDK docs. For example, the list
method may throw the following errors:
Error Type | Status Code | Content Type |
---|---|---|
errors.BadRequestError | 400 | application/json |
errors.UnauthorizedError | 401 | application/json |
errors.TooManyRequestsError | 429 | application/json |
errors.InternalServerError | 500 | application/json |
errors.APIError | 4XX, 5XX | */* |
If the method throws an error and it is not captured by the known errors, it will default to throwing a APIError
.
import { DocumentAi } from "@abbyy-sdk/document-ai";
import {
BadRequestError,
InternalServerError,
SDKValidationError,
TooManyRequestsError,
UnauthorizedError,
} from "@abbyy-sdk/document-ai/models/errors";
const documentAi = new DocumentAi({
apiKeyAuth: process.env["DOCUMENTAI_API_KEY_AUTH"] ?? "",
});
async function run() {
let result;
try {
result = await documentAi.documents.list({
cursor: "xyz",
});
for await (const page of result) {
// Handle the page
console.log(page);
}
} catch (err) {
switch (true) {
// The server response does not match the expected SDK schema
case (err instanceof SDKValidationError): {
// Pretty-print will provide a human-readable multi-line error message
console.error(err.pretty());
// Raw value may also be inspected
console.error(err.rawValue);
return;
}
case (err instanceof BadRequestError): {
// Handle err.data$: BadRequestErrorData
console.error(err);
return;
}
case (err instanceof UnauthorizedError): {
// Handle err.data$: UnauthorizedErrorData
console.error(err);
return;
}
case (err instanceof TooManyRequestsError): {
// Handle err.data$: TooManyRequestsErrorData
console.error(err);
return;
}
case (err instanceof InternalServerError): {
// Handle err.data$: InternalServerErrorData
console.error(err);
return;
}
default: {
// Other errors such as network errors, see HTTPClientErrors for more details
throw err;
}
}
}
}
run();
Validation errors can also occur when either method arguments or data returned from the server do not match the expected format. The SDKValidationError
that is thrown as a result will capture the raw value that failed validation in an attribute called rawValue
. Additionally, a pretty()
method is available on this error that can be used to log a nicely formatted multi-line string since validation errors can list many issues and the plain error string may be difficult read when debugging.
In some rare cases, the SDK can fail to get a response from the server or even make the request due to unexpected circumstances such as network conditions. These types of errors are captured in the models/errors/httpclienterrors.ts
module:
HTTP Client Error | Description |
---|---|
RequestAbortedError | HTTP request was aborted by the client |
RequestTimeoutError | HTTP request timed out due to an AbortSignal signal |
ConnectionError | HTTP client was unable to make a request to a server |
InvalidRequestError | Any input used to create a request is invalid |
UnexpectedClientError | Unrecognised or unexpected error |
The default server can be overridden globally by passing a URL to the serverURL: string
optional parameter when initializing the SDK client instance. For example:
import { DocumentAi } from "@abbyy-sdk/document-ai";
const documentAi = new DocumentAi({
serverURL: "https://api.abbyy.com/document-ai",
apiKeyAuth: process.env["DOCUMENTAI_API_KEY_AUTH"] ?? "",
});
async function run() {
const result = await documentAi.documents.list({
cursor: "xyz",
});
for await (const page of result) {
// Handle the page
console.log(page);
}
}
run();
The TypeScript SDK makes API calls using an HTTPClient
that wraps the native
Fetch API. This
client is a thin wrapper around fetch
and provides the ability to attach hooks
around the request lifecycle that can be used to modify the request or handle
errors and response.
The HTTPClient
constructor takes an optional fetcher
argument that can be
used to integrate a third-party HTTP client or when writing tests to mock out
the HTTP client and feed in fixtures.
The following example shows how to use the "beforeRequest"
hook to to add a
custom header and a timeout to requests and how to use the "requestError"
hook
to log errors:
import { DocumentAi } from "@abbyy-sdk/document-ai";
import { HTTPClient } from "@abbyy-sdk/document-ai/lib/http";
const httpClient = new HTTPClient({
// fetcher takes a function that has the same signature as native `fetch`.
fetcher: (request) => {
return fetch(request);
}
});
httpClient.addHook("beforeRequest", (request) => {
const nextRequest = new Request(request, {
signal: request.signal || AbortSignal.timeout(5000)
});
nextRequest.headers.set("x-custom-header", "custom value");
return nextRequest;
});
httpClient.addHook("requestError", (error, request) => {
console.group("Request Error");
console.log("Reason:", `${error}`);
console.log("Endpoint:", `${request.method} ${request.url}`);
console.groupEnd();
});
const sdk = new DocumentAi({ httpClient });
You can setup your SDK to emit debug logs for SDK requests and responses.
You can pass a logger that matches console
's interface as an SDK option.
[!WARNING] Beware that debug logging will reveal secrets, like API tokens in headers, in log messages printed to a console or files. It's recommended to use this feature only during local development and not in production.
import { DocumentAi } from "@abbyy-sdk/document-ai";
const sdk = new DocumentAi({ debugLogger: console });
You can also enable a default debug logger by setting an environment variable DOCUMENTAI_DEBUG
to true.