Transcription apps—apps that report and transcribe the textual content of a dialog—have confirmed to be a particularly helpful method for many people to take notes. This is applicable not solely to journalists, who, in fact, typically report their interviews. For instance, when you find yourself caring for a sick member of the family, it is rather helpful to have a recorded and transcribed transcript of your dialog with the physician. After which whenever you cope with the insurance coverage firm consultant, properly,” Naff mentioned.
There are two varieties of transcription companies accessible on the web at present: one makes use of a man-made intelligence engine and the opposite makes use of human transcribers. The latter technique is normally way more correct, but in addition considerably costlier. In consequence, most individuals use AI-powered companies to interpret and transcribe audio—and admittedly, as AI companies enhance, so do transcriptions. Here’s a checklist of some accessible AI-powered transcription companies.
One factor to remember is that the standard of the transcription offered by these apps can fluctuate enormously relying not solely on the AI engine the app makes use of, but in addition on the standard of your audio file. If there are numerous voices talking on the similar time, if there may be a number of background noise, if the accents of the audio system are unfamiliar to the synthetic intelligence instrument, all this may result in a lower within the accuracy of the transcription. So, it is a good suggestion to check out the transcription service with a typical file to see how properly it really works.
And take into consideration which utility is likely to be essentially the most cost-effective for you. Should you solely have to obtain recordsdata sometimes, it is best to make use of the free model or one of many pay-as-you-go companies. Should you obtain content material usually, a month-to-month or annual subscription could also be best for you.
Otter was some of the standard transcription companies—properly, that’s, till August 2022, when it introduced a discount within the stage of companies offered beneath two of its plans and raised the worth of their month-to-month plan.
That mentioned, Otter gives a fairly spectacular array of companies, together with the power to simply report Zoom and Google Meet conferences, in addition to arrange transcriptions into folders and contacts into teams. There’s additionally a separate AI function that helps with content material discovery, and every transcription consists of an AI-generated abstract, together with a listing of actions and a top level view.
As talked about, there have been numerous modifications within the firm’s pricing and options. For instance, free customers not have entry to all of their previous transcriptions – solely the final 25 (the remaining can be archived). You should utilize as much as 300 minutes of transcription per thirty days, with a most of half-hour per dialog, and import as much as three audio or video recordsdata.
Paying prospects utilizing Otter’s Professional plan ($16.99 per thirty days or $110.04 per yr) as soon as had a month-to-month restrict of 6,000 minutes of transcribed audio and a most of 4 hours of discuss time; these days they’re given 1200 minutes and 90 minutes to speak; however all their conversations can be found and so they can import 10 audio or video conversations per thirty days.
Otter’s marketing strategy ($30 per thirty days or $240 per yr) nonetheless consists of 6,000 minutes per thirty days/4 hours of discuss time, in addition to different options.
Temi is a primary transcription service owned by the identical firm as Rev. Actually, whenever you first go there, you may seemingly be informed to attempt Rev first. When you get previous that, Temi gives options like the power to view and edit your transcriptions, decelerate playback, and export your recordsdata to textual content (Microsoft Phrase, PDF) or closed captioning recordsdata (SRT, VTT). Its cell apps for Android and iOS will let you report audio; You possibly can then transcribe it for 25 cents per audio minute, or add your personal recordings for a similar value. New customers obtain the primary 45 minutes freed from cost.
The Reverend has been round for some time; till lately, it was accessible primarily to those that wanted human transcription companies. The corporate then launched Rev Max, an AI-powered transcription service that provides 20 hours of automated transcription and Zoom transcripts for $29.99 per thirty days. (Should you cross the 20-hour mark, you may be charged 25 cents per minute earlier than the beginning of the following month.) You additionally get a 5 % low cost on any human transcription companies. There’s a 14-day free trial, however you need to insert a bank card to obtain it.
MeetGeek calls itself an “AI-powered assembly assistant.” In different phrases, its focus is on transcribing conferences (although it may be used for different audio as properly). It has a free model that permits you to create transcripts from audio and video sources—you possibly can report 5 hours of audio per thirty days and save three months’ price of transcripts and one month’s price of audio. For $19 per thirty days or $180 per yr, the Professional model provides you 20 hours of transcription per thirty days, one yr of transcript storage, and 6 months of video storage. There are additionally Enterprise and Enterprise variations. New customers get a 14-day trial of the marketing strategy, which prices $39 per thirty days or $372 per yr and provides you 100 hours of transcription per thirty days, limitless transcript storage, and 12 months of video storage.
Trint’s web site clearly exhibits that the corporate gives its AI transcription companies to inventive customers; one of many headlines on the entrance web page reads: “Our DNA is a Storytelling.” In accordance with Trint, he can transcribe in additional than 40 completely different languages. The Starter 300 plan ($80 per thirty days or $624 per yr) permits you to transcribe as much as 300 minutes per thirty days and make three translations per thirty days, report audio from a cell app (iPhone or Android), and edit and share transcripts. The Superior Plan ($100/month or $720/yr) provides 1,200 minutes of transcription, 20 translations, and the power to automate workflows. The seven-day free trial permits you to attempt the superior plan.
Sonix gives computerized translations into greater than 49 languages. It consists of the standard potential to edit your transcripts, a word-by-word timestamp, and the power to load transcripts from different applications and sew them into new ones. Like many trendy transcription companies, it has added some synthetic intelligence options resembling computerized subtitles and summaries. You possibly can export your transcripts to DOCX, TXT, and PDF codecs, and export subtitles to SRT and VTT codecs. Sonix begins with an ordinary pay-as-you-go plan that prices $10 per audio hour (prorated to the minute). There’s additionally a Premium subscription plan ($5 per audio hour, plus $22 per thirty days or $198 per yr), which provides quite a lot of options and 100GB of storage. New customers obtain 30 free minutes of transcription.
Whereas MeetGeek focuses on assembly transcriptions, Alice payments itself as a transcription service for journalists. Different companies retailer your transcripts (some with closing dates, some with out) and will let you edit them on-line, however Alice does not; as an alternative, it sends the audio file and transcript to your e mail deal with and provides them to your Google Drive or Dropbox. It is also straightforward to make use of; simply faucet wherever on the app in your cellphone to launch it and swipe to pause it. Alice pays as you go: $9.99 for one or two hours of audio; $99.80 for 20 hours; or $299 for 100 hours. You get your first 60 minutes free and may use it with the iOS app or on-line. There isn’t a Android utility.
When you’ve got an Android cellphone, one of many best methods to get an honest transcription is to make use of the free Google Recorder app. (When you’ve got a Pixel, you might have already got it; in any other case, you possibly can obtain Recorder from the Play Retailer and see if it really works along with your cellphone.) To start out recording, simply press the large purple button. To pause, press it once more. Smaller buttons on either side will let you delete or save a recording. Above the button is the audio playback time, and above it are two buttons for audio and transcript. To view the textual content, click on “Transcript”. You possibly can edit the textual content, search it (it is Google, in spite of everything), and share the audio or transcript. When you’ve got a Pixel 6 or later, you possibly can allow completely different labels for various audio system.
Whisper by OpenAI is an open supply transcription undertaking that’s straightforward to make use of, particularly when you choose to retailer transcriptions within the cloud. Eat Mac app accessible it makes it straightforward to put in and use in case you are not accustomed to Python and developer instruments; if sure, then a lot the higher. Should you’re utilizing the Mac desktop app, the free model gives a number of ranges of transcription (the slower the higher); The Professional model prices $6.99 per thirty days or $24.99 per yr (with a seven-day free trial) and allows you to do issues like transcribe podcasts and YouTube URLs. (Will OpenAI be in hassle because of educating your software program utilizing YouTube that is one other query.)
Replace April 11, 2024 4:42 pm ET: This text was initially printed on August 24, 2022. Since then, a number of posts have been up to date, together with details about Otter’s marketing strategy, and posts have been added for Rev Max, Alice, Google Recorder, and Whisper, and a put up has been added for Scribie. was eliminated.