现在的位置: 首页 > 综合 > 正文

How to Mac OS X speech to text API

2014年09月05日 ⁄ 综合 ⁄ 共 777字 ⁄ 字号 评论关闭

I have a program that receives an audio (mono) stream of bits from TCP/IP. I am wondering whether the speech (speech-recognition) API in Mac OS X would be able to do a speech-to-text transform for me.

(I don't mind saving the audio into .wav first and read it as oppose to do the transform on the fly).

I have read the official docs online, it is a bit confusing. And I couldn't find any good example about this topic.

Also, should I do it in Cocoa/Carbon/Java or Objective-C?

Can someone please shed some light?

Thanks.

There's a number of examples that get copied under /Developer/Examples/Speech/Recognition when you install XCode.

Cocoa class for speech recognition is NSSpeechRecognizer.
I've not used it but as far as I know speech recognition requires you to build a grammar to help the engine choose from a number of choices rather then allowing you to pass free-form input. This is all explained in the examples referred above.

抱歉!评论已关闭.