IBM Waston Speech to text APIのキーワードスポッティング機能を使用するにはどうすればよいですか？

debugcn 投稿 Dev

Sourav Bhattacharjee

IBM Watson Speech to text APIを使用して、オーディオファイルをテキストに変換しています。すべての機能が正常に機能しています。しかし、キーワードスポッティング機能を使用できません。出力は、発見されたキーワードに関する情報を提供していません。

これが私のコードです：

SpeechToText service = new SpeechToText();
    service.setUsernameAndPassword("*********", "********");
    //SpeechModel model =service.getModel("en-US_NarrowbandModel");


    service.setEndPoint("https://stream.watsonplatform.net/speech-to-text/api");

    String[] keys= {"abuse","bullying","parents","physical","assaulting"};
    RecognizeOptions options = new RecognizeOptions().contentType("audio/wav").model("en-US_NarrowbandModel").continuous(true).inactivityTimeout(500).keywords(keys).keywordsThreshold(0.7);


    File audio = new File("C:\\Users\\AudioFiles\\me.wav");

    SpeechResults transcript = service.recognize(audio, options);
    //Speech t1 = service.recognize(audio, options);
    System.out.println(transcript);

発見されたキーワードをトランスクリプトとともに出力として取得するための特別な機能はありますか？

ドイツのAttanasio

これはJavaSDKで修正されましたv3.2.0。必ず最新バージョン（4.2.1）jar：java-sdk-4.2.1-jar-with-dependencies.jarをダウンロードするか、Gradle / Mavenを更新して最新バージョンをプルしてください。

以下のコードは、質問のコードに基づいています。

SpeechToText service = new SpeechToText();
service.setUsernameAndPassword("USERNAME", "PASSWORD");

File audio = new File("C:\\Users\\AudioFiles\\me.wav");    

RecognizeOptions options = new RecognizeOptions().Builder()
  .contentType("audio/wav)
  .inactivityTimeout(500)
  .keywords({"abuse", "bullying", "parents", "physical", "assaulting"})
  .keywordsThreshold(0.5)
  .build();

  SpeechResults transcript = service.recognize(audio, options).execute();
  System.out.println(transcript);

この記事はインターネットから収集されたものであり、転載の際にはソースを示してください。

侵害の場合は、連絡してください[email protected]

編集2021-07-11

コメントを追加

サインイン

分類Dev

Related 関連記事

記事

IBM Waston Speech to text APIのキーワードスポッティング機能を使用するにはどうすればよいですか？

IBM Waston Speech to text APIのキーワードスポッティング機能を使用するにはどうすればよいですか？

IBM Speech to Textに最適なサウンド形式はどれですか？

IBM Watson Speech to Text using WebSockets

IBM Text to Speech: ドイツ語のテキストで英語の単語を正しく発音するには?

ibm-watsonサービスC＃を使用したSpeech-to-Text

IBM Watson Speech to Text Only Returning First Word With Java SDK

Speech to Text api / library

iOS Text To Speech API

IBM Watson Speech To Text：SwiftSDKを使用してテキストを書き写すことができません

Bluemixサービスの会話、Speech-To-Text、Text-To-SpeechをAndroidに統合するにはどうすればよいですか？

IBM Watson Text-to-Speechは、カスタム単語の後に文末点を発音します

IBM Watson IAMトークンは、すべてのサービスに適していますか、それとも各サービスに固有ですか（Speech-to-Textなど）？

AzureのText-To-Speechを使用して、ライブテキスト読み上げの代わりにオーディオファイルを作成するにはどうすればよいですか？（C＃Unity SDK）

Microsoft Text to Speechエンジンに音声を追加するにはどうすればよいですか？

Speech-to-text large audio files [Microsoft Speech API]

Watson Speech to Text SDKの出力全体をPythonで受信するにはどうすればよいですか？

IBM Watson Speech-to-Text Python、「DetailedResponse」オブジェクトには属性「getResult」がありません

Google Cloud Text-to-Speech Interface Confusion（mp3ファイルをダウンロードするにはどうすればよいですか？）

Watson Speech to Textの精度を向上させるにはどうすればよいですか？

Google Speech-to-Text APIで複数のstreamingRecognizeリクエストを続行するにはどうすればよいですか？

Siri Kit (Speech to Text) で TTS (Text to Speech) iOS を無効にする

Using the AT&T Speech to Text API With Python

プログラムの実行時にMicrosoftAzure Speech To Textで文字起こしを開始するにはどうすればよいですか？（Unity、C＃）

〜7mbより大きいファイルは、「応答が受信されていません」をスローします。IBM Watson Speech-To-Text asynccreateJob呼び出しで

Annyang converting speech to text

Speech to text for single word

Text to speech android code

ExtJS 4.2.1で要素（Text、Label、displayField）のテキストでワードラップを作成するにはどうすればよいですか？

Speech to Text オーディオ形式

Text2Speechエラー、ブラウザにURLを直接入力してオーディオを再生するにはどうすればよいですか？