Transcribe Recordings With MacSpeech Scribe
In recent years, dictation software has become a firmly entrenched reality. It is perfectly possible to sit at your computer wearing a headset and speak to the computer and have it transcribe, with astonishing accuracy, the words that you speak. But what if you are not sitting at your computer? What if you have an idea that requires later transcription, and all you have with you is some sort of recording device? The promise of MacSpeech Scribe is a solution to that problem.
MacSpeech Scribe (from the makers of MacSpeech Dictate, the speech recognition application) does not pretend to have the human ability to recognize just anyone’s speech. You have to train it, and the speech that it recognizes is yours, and yours alone, the result of your deliberately dictating into a particular digital recording device. So you’re not going to be using MacSpeech Scribe to transcribe a teacher’s lecture, let alone a debate.
Nevertheless, there are good reasons why the capability to transcribe one’s own speech from a recording might be preferable to real-time dictation. As I’ve already suggested, you might not have a computer with you at the moment you’d like to dictate something. Also, there are significant psychological and even physical differences between dictating directly to your computer and speaking into a recording device. I find that something about the computer sitting there waiting for me, the necessity of wearing the headset, the importance of maintaining strict silence, and other factors combine to make me extremely nervous and tongue-tied. I feel more relaxed talking into a digital recorder. I feel I have time to collect my thoughts. Also, I
can clean up the digital file a little with a sound editor program before I hand it over to MacSpeech Scribe, so I’m less nervous about errors than I am when the computer is listening to me.
Another reason why MacSpeech Scribe might be more congenial than MacSpeech Dictate is that the user interface is simpler. MacSpeech Dictate allows you to dictate directly into any application. The price of that power is that you then have to use your voice and some floating windows to make any corrections; you must not make corrections directly by typing, because then you would be acting behind the program’s back, as it were, and it would not know what edits you had made to the dictated material. MacSpeech Scribe, on the other hand, is far simpler, both when you are doing your original training, and when you are transcribing an actual sound file. The folks at MacSpeech, which was recently bought by Nuance, the company from which MacSpeech licensed the speech recognition engine used in Dragon NaturallySpeaking for PC, reduced the interface to a single extremely simple window. I find it quick and easy to make the very few corrections that might be necessary when MacSpeech Scribe transcribes a recording into text.
As a demonstration of the sort of thing that MacSpeech Scribe can do, I dictated almost the entirety of the first draft of this article using MacSpeech Scribe and a digital recording device, the Zoom H2. To give you a sense of what the experience is like, I’ve uploaded a portion of the actual recording of myself speaking the original first draft, just making it up out of my head and saying it to the H2, along with MacSpeech Scribe’s transcription of that section of the recording, without any edits or changes. You can compare
the two and see how accurately the program is able to interpret the recording. I think the results speak for themselves.
Training MacSpeech Scribe is simple. You speak to your device, enough to make a recording of at least 2 minutes in length; then you hand that recording over to MacSpeech Scribe. The program transcribes the first 15 seconds of the recording, and you run through the transcription phrase by phrase, either accepting or correcting each phrase. The program then starts over and transcribes the first 90 seconds of the recording, and you do the same thing. This is enough for MacSpeech Scribe to generate an initial voice profile for you; you can give it more recorded material for additional training and additional resulting accuracy.
Transcribing is equally simple. You hand your recording over to MacSpeech Scribe. It presents the text result very quickly (much more quickly than it was spoken originally; I’m not sure how that magic is performed), in a window with two panes. When you click on any part of the text in the first pane, possible corrections appear in the second pane. If the correction you want isn’t there, you can edit a correction that is there. You then click a button to enter that correction in place of the original interpretation. The accuracy seems very high, especially for non-technical subjects. Vocabulary can be added manually, a word or phrase at a time, or by giving MacSpeech Scribe a text file to analyze.
Despite all this simplicity, the program has some bugs. For example, there’s a checkbox to stop MacSpeech Scribe from checking online for a new version of the program every time it starts up, but your setting here is forgotten. And I several times got mysterious error dialogs about not being able to find a needed file or folder, and had to quit the program and start it up again.
My biggest complaint is about the manual and online help. Nothing tells you what punctuation you’re allowed to say, a serious omission. Beyond that, I confess, I have a dog in this fight: I wrote the original manual and online help for MacSpeech Dictate, and these have been edited badly to create the help for MacSpeech Scribe. Thus the Scribe manual starts out with some material that’s true of Dictate but false and irrelevant for Scribe, and a careless global replacement turned my sentence “Dictate, don’t talk” into “Scribe, don’t talk.” I wasn’t paid or credited for this reuse of my work, and considering the nature of the result, perhaps that’s just as well.
Still, I find it astonishing that a program like MacSpeech Scribe is even possible. You’re up and running, with the program trained and ready to go, in just a few minutes; after that, you have your own personal transcription secretary and you’re ready to dictate the Great American Novel while you’re out for a walk in the woods.
MacSpeech Scribe costs $149.99, and requires an Intel-based Mac running Mac OS X 10.6 Snow Leopard. Audio files must be WAV, AIFF, or AAC, and should be as high quality as possible; you can dictate into your computer or into a digital recorder (including, according to the manual, an iPhone).
If you buy MacSpeech Scribe, is there any reason (besides controlling your computer) to buy MacSpeech Dictate?
Along with controlling your computer, MacSpeech Dictate is for ad-hoc dictation, where you can't predict ahead of time what you'd want to say. So, for instance, you wouldn't use Scribe to dictate email messages, since you wouldn't know what you wanted to say until you'd read them and started the reply, whereas Dictate would be useful there.
It's also about your workflow. If you're away from your Mac, and want to capture ideas, it's convenient just to speak your thoughts. So, how do you turn that speech to text? Well, as an example, the iPhone has a really good microphone, so it's a pretty great tool for "voice productivity". Get the Dragon Dictation app for dictating brief notes that you need transcribed right then. Or use the Voice Memos app to record your speech -- and move the resulting audio file to your Mac (via e-mail or via iTunes) and let MacSpeech Scribe transcribe it. HTH.
Yes, that does. Thanks!
Also, does MacSpeech Scribe handle more than one voice (assuming you train it)?
Another great Neuburg article.
I intend to abandon Dictate for Scribe. Especially to be able to use my iPhone as a dictating machine.
But I was puzzled by the mechanics of transferring files. Emailing a dictated text is no problem, but Matt writes about editing audio files:
"Also, I can clean up the digital file a little with a sound editor program before I hand it over to MacSpeech Scribe . . ." Audio files still puzzle me after some 20mac years.Which sound program? In Voice Memo I find the Trim Memo file, but one can't really edit the audio with that command.
And Jay G writes about moving an audio file to my Mac. Does he mean at text file? Where does one find the audio file?
Sorry to so much like a Newbie.
I can tell you how I edit sound files - with Amadeus Pro, which is my sound file editor of choice (I've reviewed it in TidBITS). I'm not clear on how you'd move the sound file from an iPhone to the Mac, though; that's why I didn't give any details about that. I'll try to figure it out...!
The Voice Memos program works on iPhone or iPod touch. There are some other good audio recording apps for iPhone/iPod touch/iPad, but we haven't tested these. They need to produce audio files in .wav, .mp4, .m4a, .m4v, .aif, or .aiff.
For the iPod touch, you'll need a 2nd generation or later iPod touch with iPhone OS 3.x...which will automatically install the Voice Memos app. You'll also need an input mic, such the Apple Earphones with Remote and Mic.
When you sync your iPhone or iPod touch with your Mac, it will sync Voice Memos to iTunes. From iTunes, you can drag them or save them to the desktop.
Or, you can click the "Share" button in Voice Memos, and choose "Email"...and email yourself the file. I prefer this method.
MacSpeech Scribe has an audio player built in, and you can use it to trim the audio file, as well as play it back.
I bought MacSpeech Dictate last year thinking it had voice recorder capability like the old iListen / iDictate apps had before. I should have researched before I purchased of course. Turns out MacSpeech put that functionality into recently released Scribe. MacSpeech have now been bought by Nuance and I was hoping that there would be an attractive crossgrade price. Nope. Frustrating that Mac users are required to pay such a premium when you compare to Dragon Systems cost about 200, has USB mic or digital recorder input and uses the same voice recognition engine as the MacSpeech products. Sigh
I know this thread is for Scribe but this might be useful. MacSpeech Medical is a crossgrade for about 500.00. I had Dictate "learn" medical terms from a list of terms I found on the net (http://www.medword.com/medterms_aTEXT.html). A bit tedious to to work through the alphabet but works great for me.