Local Speech-to-Text MCP Server by SmartLittleApps

MCP Servers Chevron right icon

Local Speech-to-Text

Provider

SmartLittleApps

Classification

community

Est. Downloads

This is our estimate of how many downloads occurred of this server across the MCP ecosystem (not specific to any single platform). We use a mix of publicly available data, social signals, and more to feed an algorithm that drives this estimation.

730

Released On

Jun 3, 2025

Popularity Ranking

Our estimate as to where this MCP server implementation ranks on the global leaderboard of usage.

#3,974 (#5,457 this week)

Provides local speech-to-text transcription using whisper.cpp with automatic audio format conversion, intelligent chunking for long files, speaker diarization, and multiple output formats for private transcription workflows without cloud dependencies.

GitHub Repo (5 stars)

Related Servers

ElevenLabs

Mamerto Fabian

Integrates with ElevenLabs API to provide text-to-speech capabilities for generating high-quality audio from text...

Classification

community

Est Downloads (All Time)

17.1k

Release Date

Dec 21, 2024

macOS Say

Barton Rhodes

Leverages macOS 'say' command for customizable text-to-speech functionality, enabling dynamic voice output.

Classification

community

Est Downloads (All Time)

2.6k

Release Date

Jan 4, 2025

Text To Speech (Windows)

ExpressionsBot

Integrates with Windows speech services to enable text-to-speech and speech-to-text capabilities using native system...

Classification

community

Est Downloads (All Time)

730

Release Date

Jan 18, 2025

TTS Say

Hiroki Daichi

Integrates with OpenAI's API and local sound playback to convert text into audible speech, enabling voice output for...

Classification

community

Est Downloads (All Time)

876

Release Date

Feb 1, 2025

Zonos TTS

PhialsBasement

Integrates with Zonos TTS API to generate expressive, multi-language speech output for AI applications using...

Classification

community

Est Downloads (All Time)

218

Release Date

Feb 15, 2025

ElevenLabs Text-to-Speech

Sebastian Georgi

Integrates ElevenLabs' text-to-speech capabilities for high-quality, customizable voice output in interactions,...

Classification

community

Est Downloads (All Time)

146

Release Date

Feb 25, 2025

Voice Recorder (Whisper)

DefiBax

Integrates with OpenAI's Whisper model to provide voice recording and transcription capabilities for applications...

Classification

community

Est Downloads (All Time)

876

Release Date

Mar 1, 2025

Speech Interface (Faster Whisper)

Max Novich

Integrates voice interaction capabilities using faster-whisper and PyAudio for speech recognition and synthesis,...

Classification

community

Est Downloads (All Time)

13.4k

Release Date

Mar 4, 2025

Kokoro TTS

Giannis Anni

Integrates with the Kokoro TTS engine to provide customizable text-to-speech capabilities, supporting cross-platform...

Classification

community

Est Downloads (All Time)

1.5k

Release Date

Mar 6, 2025

Kokoro Speech

hammeiam

Provides text-to-speech capabilities using the Kokoro TTS model, enabling natural-sounding voice output with...

Classification

community

Est Downloads (All Time)

866

Release Date

Mar 21, 2025

AivisSpeech

Kentaro Kuribayashi

Enables AI systems to generate and play speech audio from text input through the AivisSpeech API, with configurable...

Classification

community

Est Downloads (All Time)

—

Release Date

Mar 15, 2025

OpenAI TTS

Yuichi Nakamura

Enables high-quality voice generation from text using OpenAI's TTS API with customizable voices, formats, and speech...

Classification

community

Est Downloads (All Time)

610

Release Date

Mar 23, 2025

Transcripter

Zentala

Enables audio transcription and analysis through TypeScript and Express, providing tools for searching, summarizing,...

Classification

community

Est Downloads (All Time)

—

Release Date

Mar 23, 2025

Say (Text-to-Speech)

blacktop

Provides text-to-speech capabilities through both native system voices and ElevenLabs integration, enabling...

Classification

community

Est Downloads (All Time)

5.4k

Release Date

Mar 24, 2025

Kokoro TTS

mberg

Converts text to speech using the Kokoro TTS engine with configurable voices, speeds, and languages, supporting both...

Classification

community

Est Downloads (All Time)

10.2k

Release Date

Mar 24, 2025

Audio Transcriber (OpenAI Whisper)

Ichigo3766

Provides speech-to-text transcription capabilities using OpenAI's Whisper API with configurable language settings and...

Classification

community

Est Downloads (All Time)

247

Release Date

Mar 25, 2025

Video Digest

R-lz

Transcribes and analyzes video content from sources like YouTube using multiple transcription services with automatic...

Classification

community

Est Downloads (All Time)

3.9k

Release Date

Apr 3, 2025

Typecast AI

Neosapience

Bridges to Typecast AI text-to-speech service, enabling high-quality voice synthesis with customizable emotional...

Classification

official

Est Downloads (All Time)

292

Release Date

Apr 3, 2025

Blabber (OpenAI TTS)

Pink Pixel

Converts text into natural-sounding speech with multiple voice options, audio formats, and automatic playback...

Classification

community

Est Downloads (All Time)

155

Release Date

Apr 7, 2025

Rime Text-to-Speech

Matthew Dailey

Text-to-speech server that converts text into spoken audio through Rime's API, streaming with optimized buffering for...

Classification

community

Est Downloads (All Time)

888

Release Date

Apr 8, 2025

ElevenLabs

Integrates with ElevenLabs to provide high-quality text-to-speech, voice cloning, and conversational capabilities...

Classification

official

Est Downloads (All Time)

66k

Release Date

Apr 7, 2025

Edge Text-to-Speech

yuiseki

Provides a bridge to Microsoft's Edge Text-to-Speech service for converting text into natural-sounding speech across...

Classification

community

Est Downloads (All Time)

730

Release Date

Apr 20, 2025

Mobvoi TTS

Mobvoi

Enables AI to generate natural-sounding speech and clone voices using Mobvoi's text-to-speech APIs with customizable...

Classification

official

Est Downloads (All Time)

—

Release Date

May 9, 2025

VOICEVOX

Yuki Kobayashi

Enables AI to generate natural-sounding Japanese voice audio from text through integration with the VOICEVOX engine,...

Classification

community

Est Downloads (All Time)

146

Release Date

May 20, 2025

SiliconFlow Voice Transcription

AIO-2030

Provides voice transcription capabilities by processing audio files with the FunAudioLLM/SenseVoiceSmall model,...

Classification

community

Est Downloads (All Time)

Release Date

May 22, 2025

Whissle

vmehta14

Integrates with Whissle's speech processing API to enable speech-to-text transcription with timestamps and speaker...

Classification

community

Est Downloads (All Time)

292

Release Date

May 27, 2025

Chatterbox TTS

digitarald

Converts text to speech using either high-quality Chatterbox TTS neural models or macOS's built-in 'say' command with...

Classification

community

Est Downloads (All Time)

1.2k

Release Date

Jun 7, 2025

Voice MCP

mbailey

Enables two-way voice conversations through multiple transport methods including local microphone recording and...

Classification

community

Est Downloads (All Time)

14.1k

Release Date

Jun 9, 2025

AivisSpeech

shinshin86

Integrates with AivisSpeech engine to provide Japanese text-to-speech synthesis with customizable voice parameters...

Classification

community

Est Downloads (All Time)

3.6k

Release Date

Jun 30, 2025

Fish Audio

Daichi Okazaki

Integrates with Fish Audio's API to generate high-quality speech from text with configurable voice models, audio...

Classification

community

Est Downloads (All Time)

1.2k

Release Date

Jul 4, 2025

Video & Audio Text Extraction

Seazhang

Extracts text from videos and audio files across platforms like YouTube, Bilibili, TikTok, Instagram, Twitter/X,...

Classification

community

Est Downloads (All Time)

4.6k

Release Date

Jul 23, 2025

Voice Gen (Minimax AI)

mylxsw

Converts text to high-quality speech audio using Minimax AI API with automatic S3 storage, organized directory...

Classification

community

Est Downloads (All Time)

146

Release Date

Sep 7, 2025

Voice Interface

shantur

Provides browser-based voice input/output capabilities for conversations, featuring real-time speech-to-text...

Classification

community

Est Downloads (All Time)

Release Date

Sep 21, 2025

VOICEPEAK

k2wanko

Integrates with VOICEPEAK engine to generate natural-sounding Japanese speech with customizable narrators, emotions,...

Classification

community

Est Downloads (All Time)

985

Release Date

Oct 5, 2025

Video Extraction Plus

takereshui

Extracts text from video platforms and audio files using multiple ASR services including OpenAI Whisper, Bilibili's...

Classification

community

Est Downloads (All Time)

876

Release Date

Nov 23, 2025

Speaker Diarization

snailbrainx

Real-time speaker diarization with automatic enrollment, transcription, and speaker tracking for multi-party...

Classification

community

Est Downloads (All Time)

146

Release Date

Nov 10, 2025