markitdown-ts

markitdown-ts is a TypeScript library designed for converting various file formats to Markdown. This makes it suitable for indexing, text analysis, and other applications that benefit from structured text. It is a TypeScript implementation of the original markitdown Python library.

It supports:

[x] PDF
[x] Word (.docx)
[x] Excel (.xlsx)
[x] Images (EXIF metadata extraction and optional LLM-based description)
[x] Audio (EXIF metadata extraction only)
[x] HTML
[x] Text-based formats (plain text, .csv, .xml, .rss, .atom)
[x] Jupyter Notebooks (.ipynb)
[x] Bing Search Result Pages (SERP)
[x] ZIP files (recursively iterates over contents)
[ ] PowerPoint

[!NOTE]

Speech Recognition for audio converter has not been implemented yet. I'm happy to accept contributions for this feature.

Installation

Install markitdown-ts using your preferred package manager:

pnpm add markitdown-ts

Usage

import { MarkItDown } from "markitdown-ts";

const markitdown = new MarkItDown();
try {
  const result = await markitdown.convert("path/to/your/file.pdf");
  if (result) {
    console.log(result.text_content);
  }
} catch (error) {
  console.error("Conversion failed:", error);
}

Pass additional options as needed for specific functionality.

YouTube Transcript Support

When converting YouTube files, you can pass the enableYoutubeTranscript and the youtubeTranscriptLanguage option to control the transcript extraction. By default it will use "en" if the youtubeTranscriptLanguage is not provided.

const markitdown = new MarkItDown();
const result = await markitdown.convert("https://www.youtube.com/watch?v=V2qZ_lgxTzg", {
  enableYoutubeTranscript: true,
  youtubeTranscriptLanguage: "en"
});

LLM Image Description Support

To enable LLM functionality, you need to configure a model and client in the options for the image converter. You can use the @ai-sdk/openai to get an LLM client.

import { openai } from "@ai-sdk/openai";

const markitdown = new MarkItDown();
const result = await markitdown.convert("test.jpg", {
  llmModel: openai("gpt-4o-mini"),
  llmPrompt: "Write a detailed description of this image"
});

API

The library uses a single function convert for all conversions, with the options and the response type defined as such:

export interface DocumentConverter {
  convert(local_path: string, options: ConverterOptions): Promise<ConverterResult>;
}

export type ConverterResult =
  | {
      title: string | null;
      text_content: string;
    }
  | null
  | undefined;

export type ConverterOption = {
  file_extension?: string;
  url?: string;
  fetch?: typeof fetch;
  enableYoutubeTranscript?: boolean; // false by default
  youtubeTranscriptLanguage?: string; // "en" by default
  llmModel: string;
  llmPrompt?: string;
  styleMap?: string | Array<string>;
  _parent_converters?: DocumentConverter[];
  cleanup_extracted?: boolean;
};

Examples

Check out the examples folder.

markitdown-ts

markitdown-ts

Installation

Usage

YouTube Transcript Support

LLM Image Description Support

API

Examples

License

Readme

Keywords

Package Sidebar

Install

Repository

Homepage

Weekly Downloads

Version

License

Unpacked Size

Total Files

Last publish

Collaborators

markitdown-ts

markitdown-ts

Installation

Usage

YouTube Transcript Support

LLM Image Description Support

API

Examples

License

Readme

Keywords

Package Sidebar

Install

Repository

Homepage

DownloadsWeekly Downloads

Version

License

Unpacked Size

Total Files

Last publish

Collaborators

Weekly Downloads