Search results
107 packages found
Sort by: Default
- Default
- Most downloaded this week
- Most downloaded this month
- Most dependents
- Recently published
Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.
- audio
- javascript
- youtube
- typescript
- sdk
- ffmpeg
- speech
- subtitles
- srt
- webvtt
- speech-to-text
- transcription
- stt
- asr
- View more
A client for Amazon Transcribe using the websocket interface
Speech-to-text, text-to-speech, and speaker diarization using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
Speech-to-text and text-to-speech using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- embedded systems
- open source
- zipformer
- asr
- tts
- stt
- c++
- onnxruntime
- onnx
- View more
Make your app understand language. Summarize conversations, categorize articles, and more.
- nlp
- language
- natural language processing
- oneai
- one ai
- ai
- one
- natural language understanding
- natural language
- text
- text processing
- text classification
- text analysis
- language ai
- View more
Speech-to-text, text-to-speech, and speaker diarization using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
React hook for Cheetah Web SDK
Speech-to-text, text-to-speech, and speaker diarization using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
Cheetah Speech-to-Text engine for web browsers (via WebAssembly)
Speech-to-text, text-to-speech, and speaker diarization using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
The 134,000+ words and their pronunciations in the CMU pronouncing dictionary
SpeechToText is a lightweight, multi-language voice-to-text tool for real-time transcription in web apps.
- speech-to-text
- speech
- transcription
- voice-recognition
- into-text
- multi-language
- speech to text
- speak and text
- text
Speech-to-text, text-to-speech, and speaker diarization using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
Transposer connector is a PeerTube language tool plugin to transcribe and translate with Whisper
Convert AWS transcription JSON to srt
Speech-to-text, text-to-speech, and speaker diarization using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
openai-whisper-js is a Node.js wrapper for the OpenAI Whisper library, enabling seamless audio transcription using Whisper models. This package simplifies the process of interacting with Whisper by providing a JavaScript interface to execute transcription
- openai
- whisper
- audio
- transcription
- speech-to-text
- openai-whisper
- whisper-js
- audio-processing
- ai-transcription
- python-wrapper
- typescript
- nodejs
- speech-recognition
Speech-to-text, text-to-speech, and speaker diarization using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
NodeJS wrapper for Deepgram
Leopard Speech-to-Text engine for web browsers (via WebAssembly)