google-cloud-platform - 谷歌语音 API 单一话语

Question

Google Speech API 是如何SingleUtterance工作的？根据文档，这是谷歌确定说话者何时说出单一话语的方式。我明白它的作用，但我想知道怎么做？API 是否只是等待一段时间的“无声”音频？如果是这样，无声音频持续多长时间会触发话语的结束？

它是否有其他类型的 AI 算法来帮助确定某人何时停止说话？

谢谢

score 2 · Accepted Answer

I don't think details are exposed, in my opinion detection of audio ending is a decision of the API. Instead, the it offers the way to identify when such decision has been made.

In normal conditions the stream will continue to listen and process audio until either the stream is closed directly, or the stream's limit length has been exceeded. In such situation single_utterance is not required to be set.

When you require it (voice commands, for example) and set single_utterance=true, the API decides when to finish recognition and sends to your client the END_OF_SINGLE_UTTERANCE event and cease recognition.

google-cloud-platform - 谷歌语音 API 单一话语

1 回答 1

Related

Reference