我是黑客使用iOS 10内置语音识别的一个小项目。我有使用设备的麦克风的工作结果,我的讲话非常准确地识别。
我的问题是,对于每个可用的部分转录,都会调用识别任务回调,我希望它能够检测停止说话的,并调用isFinal属性设置为true的回调。这是不可能的-应用程序正在无限期地监听。
SFSpeechRecognizer是否有能力检测句子的结尾?
这是我的代码-它是基于在互联网上发现的例子,它主要是一个样板,需要从麦克风源识别。我修改了它,增加了识别taskHint。我还将shouldReportPartialResults设置为false,但它似乎被忽略了。
func startRecording() {
if recognitionTask != nil {
recognitionTask?.cancel()
recognitionTask = nil
}
let audioSession = AVAudioSession.sharedInstance()
do {
try audioSession.setCategory(AVAudioSessionCategoryRecord)
try audioSession.setMode(AVAudioSessionModeMeasurement)
try audioSession.setActive(true, with: .notifyOthersOnDeactivation)
} catch {
print("audioSession properties weren't set because of an error.")
}
recognitionRequest = SFSpeechAudioBufferRecognitionRequest()
recognitionRequest?.shouldReportPartialResults = false
recognitionRequest?.taskHint = .search
guard let inputNode = audioEngine.inputNode else {
fatalError("Audio engine has no input node")
}
guard let recognitionRequest = recognitionRequest else {
fatalError("Unable to create an SFSpeechAudioBufferRecognitionRequest object")
}
recognitionRequest.shouldReportPartialResults = true
recognitionTask = speechRecognizer?.recognitionTask(with: recognitionRequest, resultHandler: { (result, error) in
var isFinal = false
if result != nil {
print("RECOGNIZED \(result?.bestTranscription.formattedString)")
self.transcriptLabel.text = result?.bestTranscription.formattedString
isFinal = (result?.isFinal)!
}
if error != nil || isFinal {
self.state = .Idle
self.audioEngine.stop()
inputNode.removeTap(onBus: 0)
self.recognitionRequest = nil
self.recognitionTask = nil
self.micButton.isEnabled = true
self.say(text: "OK. Let me see.")
}
})
let recordingFormat = inputNode.outputFormat(forBus: 0)
inputNode.installTap(onBus: 0, bufferSize: 1024, format: recordingFormat) { (buffer, when) in
self.recognitionRequest?.append(buffer)
}
audioEngine.prepare()
do {
try audioEngine.start()
} catch {
print("audioEngine couldn't start because of an error.")
}
transcriptLabel.text = "Say something, I'm listening!"
state = .Listening
}发布于 2017-03-21 11:21:12
当用户按预期停止说话时,isFinal标志似乎不成立。我想这是苹果想要的行为,因为“用户停止交谈”事件是一个未定义的事件。
我相信,要达到你的目标,最简单的方法是做以下工作:
audio session计时器var timer = NSTimer.scheduledTimerWithTimeInterval(2, target: self, selector: "didFinishTalk", userInfo: nil, repeats: false)
recognitionTask中获得新的转录时,请使定时器失效并重新启动。
timer.invalidate() timer = NSTimer.scheduledTimerWithTimeInterval(2, target: self, selector: "didFinishTalk", userInfo: nil, repeats: false)发布于 2018-09-03 08:33:11
根据我在iOS10上的测试,当shouldReportPartialResults设置为false时,您必须等待60秒才能得到结果。
发布于 2018-04-24 14:47:24
我正在使用语音短信,目前在一个应用程序,它是好的工作对我。我的recognitionTask块如下:
recognitionTask = speechRecognizer?.recognitionTask(with: recognitionRequest, resultHandler: { (result, error) in
var isFinal = false
if let result = result, result.isFinal {
print("Result: \(result.bestTranscription.formattedString)")
isFinal = result.isFinal
completion(result.bestTranscription.formattedString, nil)
}
if error != nil || isFinal {
self.audioEngine.stop()
inputNode.removeTap(onBus: 0)
self.recognitionRequest = nil
self.recognitionTask = nil
completion(nil, error)
}
})https://stackoverflow.com/questions/42530634
复制相似问题