Connect Claude, Cursor, Windsurf & AI Assistants to Your Speak AI Workspace
83 tools, 5 resources, 3 prompts, and 26 CLI commands. Transcribe, analyze, search, and manage media at scale from any MCP-compatible AI assistant or your terminal. Free and open source.
Or pay as you go from $1.50/hr. No subscription required. See pricing.
https://mcp.speakai.co/mcp
Connect your AI — choose your platform
Remote URL connection works with no installation. Or install locally via npm. The auto-setup wizard (speakai-mcp init) detects installed MCP clients and configures them automatically.
Claude (Web)claude.ai browser
- Open claude.ai and go to Settings → Integrations.
- Click Add Integration and paste:
https://mcp.speakai.co/mcp - Authenticate with your Speak AI account and click Save. All 83 tools are now available in Claude.
Claude DesktopMac & Windows app
- Go to Settings → Developer → Edit Config and add:
{ "mcpServers": { "speakai": { "command": "npx", "args": ["-y", "@speakai/mcp-server"], "env": { "SPEAKAI_API_KEY": "YOUR_API_KEY" } } } } - Replace
YOUR_API_KEYwith your Speak AI API key. - Save and restart Claude Desktop.
ChatGPTOpenAI web & app
- Go to ChatGPT Settings → Integrations (Plus or Team required).
- Click Connect to MCP Server and enter:
https://mcp.speakai.co/mcp - Authenticate with your Speak AI credentials. Tools appear instantly.
CursorAI-first code editor
- Go to Cursor Settings → MCP and click Add MCP Server.
- Set the URL to:
https://mcp.speakai.co/mcp - Add your Speak AI API key and save. Tools appear in the AI panel.
VS Codewith GitHub Copilot
- Open VS Code Settings (JSON) and add:
{ "github.copilot.mcpServers": { "speakai": { "url": "https://mcp.speakai.co/mcp" } } } - Reload VS Code and open GitHub Copilot Chat. Speak AI tools appear automatically.
Claude Code CLITerminal · Official Plugin
- Recommended — install from the official marketplace. Type inside Claude Code:
/plugin install speakai@claude-plugins-official
- Activate the plugin:
/reload-plugins
- Run the
getting-startedskill to connect your Speak AI API key. Get your key from app.speakai.co/developers. - Alternative — manual HTTP transport:
claude mcp add speakai \ --transport http \ --url https://mcp.speakai.co/mcp
Any MCP Client (JSON)Windsurf, Zed, Codeium…
- Find MCP / external tools settings in your client and add:
{ "servers": { "speakai": { "type": "http", "url": "https://mcp.speakai.co/mcp" } } } - Use HTTP/SSE transport and authenticate with your Speak AI API key.
- Not sure how? Email us — we’ll walk you through it.
npm (Local Install)Auto-setup for all clients
- Install globally:
npm install -g @speakai/mcp-server - Run the auto-setup wizard:
speakai-mcp init - It detects Claude Desktop, Cursor, Windsurf, VS Code and configures them automatically.
- Set your API key:
speakai-mcp config set-key
83 tools across 12 categories
Every Speak AI capability is available as an MCP tool. Your AI assistant can chain tools together for multi-step workflows: upload a recording, wait for processing, pull the transcript, extract action items, and export to PDF in a single conversation.
14 tools
Media
Upload from URL or local file, transcribe, get insights, export captions, update metadata, reanalyze with latest models. Upload-and-analyze in one call.
12 tools
Magic Prompt / AI Chat
Ask AI questions about any media, folder, or your entire workspace. Get chat history, manage favorites, export answers, and view usage statistics.
11 tools
Folders & Views
Create, clone, and organize folders. Save custom views with filters and sorting. Manage your media library structure programmatically.
10 tools
Recorder & Survey
Create embeddable recorders and surveys. Manage questions, branding, permissions. List submissions and generate shareable public URLs.
5 tools
Automations
Create and manage automation rules. Toggle automations on or off. Trigger workflows based on events in your workspace.
4 tools
Clips
Create highlight clips from time ranges across media files. Update, tag, and share clips. Pull the best moments from long recordings.
4 tools
Custom Fields
Define and manage custom metadata fields. Batch update fields across multiple media items for structured data workflows.
4 tools
Webhooks
Create webhook endpoints for real-time event notifications. Update, list, and manage webhooks for event-driven integrations.
4 tools
Meeting Assistant
Schedule the AI assistant to join meetings. List events, remove from active meetings, or cancel scheduled sessions. Works with Zoom, Google Meet, Teams.
4 tools
Media Embed
Create embeddable player widgets for your website. Update settings, check status, and get iframe URLs for any media file.
4 tools
Text Notes
Create text notes for AI analysis. Get NLP insights on any text. Re-run analysis with latest models. Update note content to trigger re-analysis.
5 tools
Exports, Search & Stats
Export as PDF, DOCX, SRT, VTT, TXT, CSV, or Markdown. Batch export with optional merge. Deep search across transcripts and insights. Workspace statistics and supported languages.
26 CLI commands for scripting and automation
Everything the MCP server can do, the CLI can do from your terminal. Upload files, pull transcripts, search across your library, and pipe results to other tools. Every command supports --json for scripting.
Media management
upload, list-media, get-transcript, get-insights, status, export, update, delete, favorites, captions, reanalyze
AI and search
ask lets you query media, folders, or your whole workspace with AI. chat-history lists past conversations. search does full-text search across all transcripts and insights.
Organization and more
list-folders, create-folder, clips, clip, stats, languages, schedule-meeting, create-text. Plus config and init for setup.
speakai-mcp upload ./interview.mp3 -n “Q1 Interview” –wait
# Get plain-text transcript
speakai-mcp transcript abc123 –plain > meeting.txt
# Ask AI about a specific recording
speakai-mcp ask “What were the action items?” -m abc123
# Search all transcripts
speakai-mcp search “pricing concerns” –from 2026-01-01
# Export as PDF with speaker names
speakai-mcp export abc123 -f pdf –speakers
# List videos as JSON for piping
speakai-mcp ls –type video –json | jq ‘.mediaList[].name’
What you can do
Example workflows that combine multiple tools. Your AI assistant chains these automatically in a single conversation.
Transcribe and analyze a meeting
“Upload and transcribe this recording.” The assistant calls upload_and_analyze, waits for processing, and returns the full transcript with speaker labels, sentiment, action items, and key topics.
Research across your library
“What themes came up across all our customer interviews this month?” The assistant searches your media, retrieves insights from each file, and synthesizes patterns across 12+ recordings in one response.
Join a meeting and summarize
“Join my 2pm Zoom call and send me a summary with action items.” The assistant schedules the meeting bot, then pulls the transcript and insights after the meeting ends.
Build a weekly brief
“Prepare a brief from all meetings in the last week.” The assistant lists recent media, pulls insights from each, and creates a consolidated brief with decisions, action items grouped by owner, and follow-ups.
Create highlight clips
“Find the best quotes about pricing from the last 5 interviews and create clips.” The assistant searches transcripts, identifies time ranges, and creates shareable clips from each recording.
Batch export for a report
“Export all recordings from the Q1 Research folder as PDFs with speaker names.” The assistant lists folder contents and runs batch export, merging results into downloadable files.
How Speak AI compares
Most transcription tools offer 5–15 MCP tools focused on meeting notes. Speak AI provides a complete media intelligence platform with 83 tools, a full CLI, and capabilities no competitor offers.
| Capability | Speak AI | Typical Meeting MCP |
|---|---|---|
| MCP tools | 83 | 5–15 |
| CLI commands | 26 | None |
| MCP resources | 5 | 0–1 |
| Built-in prompts | 3 | 0 |
| NLP analytics (sentiment, keywords, entities) | Yes, built in | No |
| Multi-model AI Chat | Claude, Gemini, GPT | Single model or none |
| Cross-file analysis | Search and query across entire library | Single meeting only |
| Upload from URL or local file | Both | URL only or none |
| Export formats | PDF, DOCX, SRT, VTT, TXT, CSV, Markdown | Text only |
| Embeddable recorder and surveys | Yes, 10 tools | No |
| Automations and webhooks | Yes, 9 tools | No |
| Meeting bot scheduling | Yes, 4 tools | Limited |
| Languages | 70+ | 10–30 |
| Open source | MIT license | Varies |
Resources and built-in prompts
MCP resources provide direct data access without tool calls. Built-in prompts run multi-step workflows with a single command.
5 MCP Resources
Direct access to your media library, folders, supported languages, and per-file transcripts and insights. Clients can read these URIs without making explicit tool calls.
analyze-meeting prompt
Upload a recording and get a complete analysis: transcript, insights, action items, and key takeaways. One prompt runs the full pipeline.
research-across-media prompt
Search for themes, patterns, or topics across multiple recordings or your entire library. Specify a topic and optional folder to scope the research.
meeting-brief prompt
Prepare a brief from recent meetings. Pulls transcripts, extracts decisions, and summarizes open items. Defaults to the last 7 days.
Why Speak AI built the most comprehensive MCP server for media intelligence
MCP (Model Context Protocol) is the open standard that lets AI assistants like Claude and ChatGPT connect to external tools and data. Instead of copying and pasting transcripts into a chat window, you give your AI assistant direct access to your entire Speak AI workspace. It can upload files, pull transcripts, run NLP analysis, search across recordings, create clips, and export results, all through natural conversation.
Most transcription platforms that offer MCP servers provide a handful of tools focused on meeting notes. Speak AI takes a fundamentally different approach. With 83 MCP tools across 12 categories, plus 26 CLI commands, the Speak AI MCP server covers every capability of the platform: media management, AI-powered analysis, folder organization, embeddable recorders and surveys, automations, webhooks, meeting bot scheduling, custom fields, and multi-format exports.
The CLI changes how teams work with media data
The 26 CLI commands turn Speak AI into a scriptable media intelligence engine. Upload recordings from a shell script. Pull transcripts and pipe them to other tools. Search across your entire library from the command line. Every command supports --json output for integration with jq, Python scripts, or CI/CD pipelines. Teams use the CLI for batch processing, automated reporting, and building custom workflows that would be impractical through a web interface.
Built for researchers, not just meetings
Meeting transcription is table stakes. Speak AI is built for qualitative researchers, consulting firms, market research teams, and anyone who needs to analyze conversations at scale. The MCP server lets you query across dozens or hundreds of recordings: “What pricing concerns came up in customer interviews this quarter?” or “Compare sentiment across focus groups by region.” Cross-file analysis is the differentiator that single-meeting tools cannot match.
NLP analytics built into every tool
When you upload a recording through the MCP server, Speak AI does not just transcribe it. Every file gets automatic NLP analysis: sentiment scoring, keyword extraction, topic detection, theme identification, and named entity recognition. These structured insights are available as JSON through the get_media_insights tool. Your AI assistant can use them to compare sentiment across recordings, find trending topics, or pull specific data points without re-processing the audio.
Open source and extensible
The Speak AI MCP server is open source under the MIT license. View the full source on GitHub, install from npm, and extend it for your own workflows. The server authenticates using your personal API key and only accesses data in your Speak AI workspace. All communication uses HTTPS encryption. Full API documentation is available at docs.speakai.co.
Frequently asked questions
What is an MCP server?
MCP (Model Context Protocol) is an open standard that lets AI assistants connect to external tools and data. The Speak AI MCP server gives AI assistants like Claude, ChatGPT, Cursor, Windsurf, and VS Code direct access to 83 transcription, analysis, and media management tools in your Speak AI workspace.
Which AI assistants work with the Speak AI MCP server?
Claude (web, Desktop, and Code), ChatGPT, Cursor, Windsurf, VS Code with MCP extensions, and any AI tool that supports the MCP protocol. The auto-setup wizard (speakai-mcp init) detects installed clients and configures them automatically.
Do I need to be a developer to use this?
No. The remote connector setup for Claude.ai and ChatGPT requires no coding or installation. Add the remote MCP URL in settings and authenticate with your API key. The npm install is for local setups and the CLI.
What is the CLI and how is it different from the MCP server?
The CLI provides 26 commands you run directly in your terminal: speakai-mcp upload, speakai-mcp ask, speakai-mcp search, and more. The MCP server provides 83 tools that AI assistants call during conversation. Both are included in the same npm package.
Is my data secure?
Yes. The server authenticates using your personal API key and only accesses data in your Speak AI workspace. All communication uses HTTPS encryption. The server is open source under the MIT license, so you can audit the code on GitHub.
How much does it cost?
The MCP server and CLI are free and open source. You need a Speak AI account to use them. API access is available on all paid plans, and you get full access during the free 7-day trial with no credit card required. See pricing for plan details.
What languages are supported?
Speak AI supports transcription in over 70 languages with automatic language detection. All languages work through both the MCP server and CLI. Speaker diarization, timestamps, and NLP analytics are available across all supported languages.
Can I use both the MCP server and CLI together?
Yes, and this is the recommended approach for power users. Use the MCP server when you want your AI assistant to orchestrate complex workflows through conversation. Use the CLI for quick tasks, batch operations, and scripting. Both share the same API key and access the same workspace data.
Start using Speak AI from your AI assistant or terminal
83 MCP tools, 26 CLI commands, and the most comprehensive media intelligence integration available for AI assistants. Get started in 2 minutes.
Get started free
Create an account, grab your API key, and connect your AI assistant. Full access during the 7-day trial. No credit card required.
Need help connecting?
Our team will walk you through setup in 15 minutes. Or email us and we’ll respond same day.