IBM Watson Speech to Text using WebSockets

debugcn 投稿 Dev

Zaid Amir

I am trying to use the Watson Developer Cloud java SDK to transcribe large audio files. I tried the Sessionless method and it works fine, however when I try the WebSockets method things become unreliable.

Most of the time the method will just return with no SpeechResult passed to the delegates; rarely it works, but it only transcribes the first couple of seconds.

This is what my code looks like:

static SpeechResults transcript = null;
private static String SpeechToText(String audioFile) throws FileNotFoundException {
        SpeechToText service = new SpeechToText();
        service.setUsernameAndPassword("<!!USERNAME!!>", "<!!PASSWORD!!>");
        service.setEndPoint("https://stream.watsonplatform.net/speech-to-text/api");

        RecognizeOptions options = new RecognizeOptions();
        options.contentType("audio/ogg;codecs=opus");
        options.continuous(Boolean.TRUE);
        options.inactivityTimeout(-1);
        options.model(Models.GetModelName(Models.SpeechModelEnums.ArabicBroadband));
        options.timestamps(Boolean.TRUE);
        options.wordAlternativesThreshold(0.5);
        options.wordConfidence(Boolean.TRUE);

        options.interimResults(Boolean.FALSE);

        File audio = new File(audioFile);

        //This is my sessionless call
        //SpeechResults transcript = service.recognize(audio, options);


        service.recognizeUsingWebSockets(new FileInputStream(audio),  options, new BaseRecognizeDelegate()
        {
                @Override
                public void onMessage(SpeechResults speechResults){
                System.out.println(speechResults);                
                }
            }
        );

        return "";//transcript.toString();
    }

I have continuous enabled. I tried fiddling with interimResults but that did not work.

What am I doing wrong?

German Attanasio

The issue you are mentioning was fixed in the 3.0.0-RC1 version.
I've answered a similar question and added a code snippet that recognizes an audio file using WebSockets.

Starting from the 3.0.0-RC1 there is a WebSocket example in the README.

この記事はインターネットから収集されたものであり、転載の際にはソースを示してください。

侵害の場合は、連絡してください[email protected]

編集2021-05-29

コメントを追加

サインイン

分類Dev

Related 関連記事

記事

IBM Watson Speech to Text using WebSockets

IBM Watson Speech to Text using WebSockets

IBM Watson Speech to Text Only Returning First Word With Java SDK

ibm-watsonサービスC＃を使用したSpeech-to-Text

IBM Watson Speech To Text：SwiftSDKを使用してテキストを書き写すことができません

IBM Watson Speech-to-Text Python、「DetailedResponse」オブジェクトには属性「getResult」がありません

IBM Watson Text-to-Speechは、カスタム単語の後に文末点を発音します

Watson speech to text live stream C# code example

Error in jumps in IBM Watson

Using the AT&T Speech to Text API With Python

Using Chrome text-to-speech in a chrome extension

〜7mbより大きいファイルは、「応答が受信されていません」をスローします。IBM Watson Speech-To-Text asynccreateJob呼び出しで

IBM Watson IAMトークンは、すべてのサービスに適していますか、それとも各サービスに固有ですか（Speech-to-Textなど）？

IBM watson image recognition : time taken for training

IBM Speech to Textに最適なサウンド形式はどれですか？

Watson Speech to Textの精度を向上させるにはどうすればよいですか？

Watson Speech to Text SDKの出力全体をPythonで受信するにはどうすればよいですか？

Watson Speech to Text : Windows 10 でユーザー名とパスワードを設定する方法

Speech to Text api / library

iOS Text To Speech API

Annyang converting speech to text

Speech to text for single word

Text to speech android code

100MBを超える長い音声を使用したSpeechto Text Ibm Watson C＃

IBM Text to Speech: ドイツ語のテキストで英語の単語を正しく発音するには?

IBM Watson Assistant-ウムラウト

Is there a way to access IBM cloud watson personality insights service now(19.12.2020)?

IBM Watson VisualRecognitionを使用した顔認識

writing a text file in line by line from the speech recognition method using audio

IBM Watsonでcom.ibm.watson.developer_cloud.service.exception.NotFoundExceptionを解決する方法は？

Calling SpeechAPI for text to speech on Azure