Qualitative Research

Decrease the time and cost to transcribe, analyze, and publish research.

No more transcribing by hand. No more expensive transcribers. Instantly transcribe and extract insights from your audio and video files. Keep all your data together in one intuitive application and media player.

Speak is built for Research

3 Easy Steps

Upload

Import your files or record live in a secure portal.

Analyze

Understand the meaning, not just the words.

Export

The possibilities are instant and ongoing.

Audio and video to text instantly

Language identification

Speak automatically detects languages and is capable of accurately analyzing multi-lingual audio and video.

Automated transcription

Speak give you the ability to easily convert speech to text in 10 languages. With high-quality audio and video, Speak can immediately deliver a time-stamped transcript with up to 98% accuracy.

Speaker identification

Speak labels and timestamps speakers so you can easily understand who spoke when.

Captioning

With Speak, you can easily export your audio and video files into three popular subtitle formats: WebVTT, TTML, or SRT.

Automatic Punctuation

Speak automatically punctuates transcriptions like commas, question marks, and periods using our machine learning models.

Translation (Coming Soon)

Immediately translate the transcription and insights into more than 7 languages.

Embed Transcript Player

Once your transcription and insights are returned, you can immediately embed or create custom interactive media players to share both publicly and privately.

Searchable Media

Because we transcribe and analyze media for you, you can search directly through the media. No more scrolling through audio and video thumbnails.

Personalized Vocabulary

Increase the accuracy of your transcription by adding custom vocabulary. You'll have to make a manual request now but we'll be adding lists for you on the front-end soon in premium plans so you can easily add words you use!

Transcript Editor

Once your transcription and insights are returned, you can edit both directly within the platform. Clean up any inaccuracies and export in a wide range of formats!

Integrations & APIs

We are adding a comprehensive range of integrations and APIs for you access our powerful speech-to-text in multiple ways. Find us in Zapier to connect with thousands of application and request access to our APIs!

Team Management

Collaborate and share media, transcripts, and insights with your team! Manage different roles. Improve team productivity and output.

Coming Soon: Android & iOS Apps

In addition to our already live web app, you will soon be able to record audio right from your phone. At any moment, you're only a few taps away from unlocking the full potential of recording your research.

Capture Your Voice

When it feels right, record audio notes. Once you're done, instantly send the audio to the web app for analysis and transcription. Export the transcription in multiple formats.

Go On A Journey

Don't worry about being offline or losing valuable insights! Capture audio notes locally on your phone at no cost. This is beautiful for when you want to disconnect, roam, enjoy nature and heal like we are supposed to and still do your research.

Generate Metadata

Our platform will automatically generate insights from your audio including keywords, topics, brands, locations, people and more. Soon, we will even help you automate link-generation so you don't have to manually link ever again.

FAQs

Here are some of the most frequent questions and answers amazing researchers like you ask us.

How accurate is the automated transcription?

With good audio quality and a clear articulate speaker, you can get an 85% to 98% accurate transcription. Poor audio quality, industry-specific terms, and accents can reduce accuracy and speaker identification. Speak will analyze the file and clean up telephony audio or noisy recordings. We continue to improve our technology and increase our automated analysis accuracy.

What files types do you take?

Speak is built for ease-of-use. We are capable of analyzing most popular video files including MP4, QuickTime, FLV, WebM and AVI. We also support mainstream audio files including MP3, FLAC, AAC and WAV.

What is the going rate for speech-to-text?

As speech recognition grows, several companies have built speech-to-text technology. Most automated transcription companies range from $0.10 USD to $2.00 USD per minute. We are competitively priced and unlike transcription companies, analyze video or audio which provides additional value through export options. This includes valuable insights like topics, keywords, and brands using our machine learning algorithms. Soon, you will be able to access our automated analysis at any time with our intuitive web and mobile application.

How do we receive our transcription?

When you create an account, you can easily upload audio and video files through a web interface. As soon as your transcription is done, you will get an interactive media player. You can navigate your file and edit the media there, or export to a Word Doc (.doc), PDF (.pdf), SRT and VTT.

How long does the automated transcription take?

Although it can range depending on how optimized your audio and video files are and how busy our servers are, Speak aims to deliver a 1:1 ratio. A 10-minute video should take 10 minutes to get back after upload. Audio is often much quicker.

How do we get billed?

We currently have monthly and annual plans and a pay-as-you-go system that allows upload audio and video at anytime. Upload your file, payment is subtracted from your balance or allotted hours, or charged to your credit card and placed in the audio or video folder.

"This is a complete paradigm shift for how we do research."

Trauma Researcher at London Health Sciences Foundation

Capture. Analyze. Excel.

Automated speech-to-text to help you reduce the time, cost, and frustration of transcribing and managing media.

Decrease the time and cost to transcribe, analyze, and publish research.

Speak is built for Research

Faster Publications

Knowledge Mobilization

Nonprofit Growth

Grant Opportunities

Graduate Students

Community Sentiment

3 Easy Steps

Upload

Analyze

Export

Audio and video to text instantly

Language identification

Automated transcription

Speaker identification

Captioning

Automatic Punctuation

Translation (Coming Soon)

Embed Transcript Player

Searchable Media

Personalized Vocabulary

Transcript Editor

Integrations & APIs

Team Management

Coming Soon: Android & iOS Apps

Capture Your Voice

Go On A Journey

Generate Metadata

FAQs

Here are some of the most frequent questions and answers amazing researchers like you ask us.

"This is a complete paradigm shift for how we do research."

Trauma Researcher at London Health Sciences Foundation

Capture. Analyze. Excel.

Save 99% of your time and costs!