文章/答案/技术大牛

发布

社区首页 >问答首页 >google-语音API转录反应被重复多次

问google-语音API转录反应被重复多次
EN

Stack Overflow用户

提问于 2018-07-25 13:09:31

回答 1查看 532关注 0票数 0

我正在使用最新的python库的google (0.35.0)，我得到的结果如下，第一个转录结果的单词在第二个转录结果中被重复，等等直到结束。在先前的版本中，情况并非如此(0.34.0)

参考源代码。

源代码：

config = speech.types.RecognitionConfig(
            encoding=enums.RecognitionConfig.AudioEncoding.FLAC,
            sample_rate_hertz=48000,
            language_code='en-US',
            alternative_language_codes={'en-IN'},
            # max_alternatives=10,
            profanity_filter=True,
            enable_word_time_offsets=True,
            enable_word_confidence=True,
            enable_automatic_punctuation=True,
            enable_speaker_diarization=True,
            diarization_speaker_count=5,
            #model="video",
            use_enhanced=True)

结果：

results {
    alternatives {
        transcript: "start"
        confidence: 0.632519185543
        words {
            start_time {}
            end_time {
                seconds: 5
                nanos: 900000000
            }
            word: "start"
            confidence: 0.655210196972
            speaker_tag: 1
        }
    }
}

.....
.....
.....

results {
    alternatives {
        transcript: "end"
        confidence: 0.632519185543
        words {
            start_time {}
            end_time {
                seconds: 5
                nanos: 900000000
            }
            word: "start"
            confidence: 0.655210196972
            speaker_tag: 1
        }
        words {
            start_time {
                seconds: 129
                nanos: 300000000
            }
            end_time {
                seconds: 130
                nanos: 400000000
            }
            word: "end"
            confidence: 0.624447464943
            speaker_tag: 1
        }

    }
}

问题：

为什么我在回应中得到了多个结果？
在所有结果集中重复单词的原因是什么？以前，每个结果集只包含在这一时间框架内所说的话。

python

google-cloud-speech

回答 1

Stack Overflow用户

发布于 2019-01-09 00:40:14

看起来Google在他们的文档中注意到了类似的东西

注:当这是正确的，我们发送所有的文字从音频开头的顶部选择在每一个连续的回应。这样做是为了改善我们的说话人标签，因为我们的模型学习识别说话人在谈话中随着时间的推移。

https://cloud.google.com/speech-to-text/docs/reference/rpc/google.cloud.speech.v1p1beta1#google.cloud.speech.v1p1beta1.RecognitionConfig

票数 0

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/51519720

复制

相似问题

问google-语音API转录反应被重复多次
EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问google-语音API转录反应被重复多次EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问google-语音API转录反应被重复多次
EN