A simple Node.js library to transcribe YouTube video captions.
yt-transcribe
allows you to fetch and transcribe captions from YouTube videos programmatically. It uses node-fetch
to make HTTP requests, xmldom
to parse XML data and he
to decode special characters.
You can install the library using npm:
npm install yt-transcribe
Here's an example of how to use `yt-transcribe` to get the transcript of a YouTube video:
import { transcribe } from 'yt-transcribe';
const videoUrl = 'https://www.youtube.com/watch?v=dQw4w9WgXcQ'; // Replace with a real video URL
transcribe(videoUrl).then(result => {
if (result.error) {
console.error('Error transcribing video:', result.error);
} else {
console.log('Transcript:', result.transcript);
}
});
Another simplified example:
import { transcribe } from 'yt-transcribe';
const videoUrl = 'https://www.youtube.com/watch?v=dQw4w9WgXcQ'; // Replace with a real video URL
const { transcript } = await transcribe(videoUrl);
console.log('Transcript:', transcript);
Fetches and transcribes the captions from a YouTube video.
videoUrl
(string): The URL of the YouTube video.
- A
Promise
that resolves to an object containing the transcript or an error message.
{
"transcript": "The transcribed text"
}
If an error occurs, the object will contain an error
field with the error message.
Contributions are welcome! Please open an issue or submit a pull request.
- Fork the repository.
- Create a new branch for your feature or bugfix.
- Make your changes.
- Commit your changes with a meaningful commit message.
- Push your changes to your fork.
- Open a pull request to the main repository.
This project is licensed under the MIT License. See the LICENSE file for details.
Kevin Klatt - klattkev@gmail.com