黒岩眞吾

クロイワシンゴ (Shingo Kuroiwa)

基本情報

所属: 千葉大学大学院工学研究院教授

学位: 博士(電気通信大学大学院電気通信学研究科電子工学専攻)

研究者番号: 20333510
J-GLOBAL ID: 200901017262764603
researchmap会員ID: 1000356498

外部リンク: http://www.ailab.tj.chiba-u.jp/~kuroiwa/

研究キーワード

研究分野

経歴

2007年10月 - 現在

千葉大学大学院工学研究院教授

受賞

2017年4月

2017年電気通信大学同窓会賞, 音声認識システムの実用化，失語症の方向けのコミュニケーション支援機器の開発等で大きく社会に貢献一般社団法人目黒会

黒岩眞吾
2017年3月

千葉エリア産学官連携オープンフォーラム2016千葉大学長賞（優秀賞）ロボットやタブレットを活用した『失語症者向け言語訓練システム』千葉大学

黒岩眞吾
1997年

第５回（平成９年度）技術開発賞日本音響学会

黒岩眞吾, 中村誠, 山本誠一, 酒寄信一, 武笠貴史, 藤岡雅宣, 阿部信子
1997年

社長表彰(業務改善) 国際電信電話株式会社

黒岩眞吾, 中村誠, 山本誠一, 酒寄信一, 武笠貴史, 藤岡雅宣, 阿部信子
1997年

平８年度電子情報通信学会学術奨励賞電子情報通信学会

山本誠一, 武田一哉, 井ノ上直己, 黒岩眞吾

もっとみる

論文

136

Food Recognition Using Large-scale Pre-trained Speech Models

Satoshi Naito, Masafumi Nishimura, Masafumi Nishida, Yasuo Horiuchi, Shingo Kuroiwa

GCCE 2024 - 2024 IEEE 13th Global Conference on Consumer Electronics 119-120 2024年

Obese and overweight individuals are at high risk for chronic diseases such as sleep apnea and diabetes. Therefore, it is necessary to track eating behavior to determine the causes of obesity; however, it is time- and labor-intensive to follow the lives of specific individuals and observe their eating behavior. Thus, a method to automatically monitor eating behavior should be considered. As one approach to monitoring methods, we propose a method for convenient recognition of food category for food intake sounds recorded by microphones (below the ear microphone, throat microphone and acoustic microphone), which is less burdensome to the body and better from the viewpoint of privacy protection. Furthermore, a comparison of MFB and large-scale pre-trained speech models (wav2vec2.0, wavLM, and HuBERT) showed the effectiveness of large-scale pre-trained speech models in the food recognition task.
Text-Dependent Speaker Verification Using SSI-DNN Trained on Short Utterance

Kentaro Kameda, Satoru Tsuge, Shingo Kuroiwa, Yasuo Horiuchi, Masafumi Nishida

GCCE 2024 - 2024 IEEE 13th Global Conference on Consumer Electronics 808-810 2024年

To enhance speaker verification for short utterances, we have developed a Same Speaker Identification Deep Neural Network (SSI-DNN). This network identifies whether two utterances are uttered by the same speaker with greater accuracy by focusing on the same texts. In this paper, we extend the detection target of the SSI-DNN from monosyllabic utterances to word utterances to improve the speaker recognition performance. Experimental results showed that the SSI-DNN trained on word utterances achieved an EER of 0.1% to 2.8%. These results indicated that the SSI-DNN outperformed the x-vector-based speaker verification method, which is a representative speaker verification method.
Template-Based Speech Recognition Using Pre-trained Large Speech Models for Voice-Activated Shower Control

Takumi Uehara, Shingo Kuroiwa, Yasuo Horiuchi, Masafumi Nishida, Satoru Tsuge

GCCE 2024 - 2024 IEEE 13th Global Conference on Consumer Electronics 141-143 2024年

Hands-free control of shower settings, such as temperature, is highly desirable, enhancing user convenience when both hands are occupied or eyes are closed. In this paper, we propose a speaker-dependent, template-based isolated word recognition system using pre-trained large speech models (LSMs) to realize voice-activated shower control with a single microphone. Specifically, we examine the performance of 3 LSMs (wav2vec2.0, HuBERT, WavLM) as well as conventional MFCC as features. Additionally, we investigate speech enhancement using a Convolutional Recurrent Neural Network (CRN) to improve robustness against shower noise. Our experiments for recognizing 30 words with SNRs ranging from -5 dB to 20 dB demonstrate that HuBERT achieves the highest recognition accuracy (77.8 to 95.6%). CRN, on the other hand, improved recognition accuracy only under -5 dB conditions, but its accuracy was only 80.8%.
Emotion-Dependent Speaker Verification Based on Score Integration

Hibiki Takayama, Masafumi Nishida, Satoru Tsuge, Shingo Kuroiwa

GCCE 2024 - 2024 IEEE 13th Global Conference on Consumer Electronics 805-807 2024年

Recent advances in AI technology have brought not only many benefits but also considerable risks due to malicious use of the technology. One key example is spoofing through speech synthesis and voice conversion technologies against speaker verification system. To tackle this challenge, we proposed a two-step matching method as a robust speaker verification, in which a user specifies an emotion to a system in advance, and the user is accepted only when the user speaks with the specified emotion. This previous method reduced the false acceptance rate. However, the false rejection rate increased. To overcome this problem, we propose a novel method that integrates speaker and emotion verification scores in this work. Experiments revealed that the proposed method can reduce the equal error rate compared with that of the conventional method to assign the optimal weight to the speaker and emotional information contained in the speech.
Utterance-style-dependent Speaker Verification by Utilizing Emotions

Hibiki Takayama, Masafumi Nishida, Satoru Tsuge, Shingo Kuroiwa, Masafumi Nishimura

2023 IEEE 12th Global Conference on Consumer Electronics (GCCE) 2023年10月10日

もっとみる

MISC

588

Data collection and evaluation of AURORA-2 Japanese corpus (共著)

Proceedings of IEEE Automatic speech recognition and understanding workshop (ASRU2003) 619-623 2003年
Researches on the emotion measurement system (共著)

IEEE International Conference on System, Man & Cybernetics 2003 (SMC2003) 1666-1672 2003年
Blind equalization via minimization of VQ distortion for ETSI standard DSR front-end (共著)

Proc. of NLPKE 2003 585-590 2003年
Super-Function based Japanese-English machine translation (共著)

Proc. of NLPKE2003 555-560 2003年
An acoustic model adaptation using HMM-based speech synthesis (共著)

Proceedings of IEEE International Conference on Natural Language Processing and Knowledge Engineering (NLPKE2003) 368-373 2003年
Missing Feature Theory applied to Robust Speech Recognition over IP Network (共著)

Proc. of Eurospeech 2003 3081-3084 2003年
Evaluation of ETSI Advanced DSR Front-end and Bias Removal Method on the Japanese Newspaper Article Sentences Speech Corpus (共著)

Proc. of Eurospeech 2003 2145-2148 2003年
Integration of Noise Reduction Algorithms for Aurora2 Task (共著)

Proc. of Eurospeech 2003 1769-1772 2003年
Blind Equalization Techniques for ETSI Standard DSR Front-end (共著)

Proceedings of 2003 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2003) 1 392-395 2003年
サポートベクターマシンによる適合性フィードバックを用いた情報検索 (共著)

情報処理学会論文誌 44(1) 59-67 2003年
ＳＬＰ雑音下音声認識評価ワーキンググループ活動報告

中村哲, 武田一哉, 黒岩眞吾, 山田武志, 北岡教英, 山本一公, 西浦敬信, 藤本雅清, 水町光徳

情報処理学会研究報告音声言語情報処理（SLP） 2002(65) 65-69 2002年7月12日

本稿では，2001年10月に音声言語情報処理研究会内に設立した雑音下音声認識の評価に関するワーキンググループの活動状況の報告を行う．このワーキンググループでは，雑音下音声認識に於ける評価法，共通のコーパスの策定に加えて，欧州で進められているETSI AURORA雑音下音声認識アルゴリズム開発プロジェクトに合わせたアルゴリズム開発を目指している．This paper reports current status of the SLP working group establised in October 2001 on the noisy speech recognition. The working group aims to develop standards, common corpus, and noisy speech recognition system in conjunction with Europian ETSI AURORA evaluation projects.
周波数特性の変動に頑健な分散音声認識手法

柘植覚, 黒岩眞吾

情報処理学会研究報告音声言語情報処理（SLP） 2002(65) 77-84 2002年7月12日

携帯電話の発展にともない急激に携帯端末によるワイアレスモバイル環境の普及が進んでいる。一般に携帯端末は非常に小型であるため、携帯端末に付属する入力デバイスによる操作は困難である。この問題を解決する一方法として、音声による携帯端末操作が考えられる。しかし、携帯端末内のメモリやCPUなどのハードウェアは、中・大語彙の音声認識処理の全てを行うまでには至っていない。そこで、音響分析、特徴パラメータの圧縮を携帯端末内で行いサーバに伝送し、サーバで特徴パラメータの復元、音声認識を行う分散音声認識 (DSR: Distributed SpeechRecognition)が提案された。分散音声認識では、携帯端末とサーバ間で伝送するデータ形式等を共通化する必要があり、現在、欧州電気通信標準化機構 (ETSI: the European Telecommunications StandardsInstitute)において、標準化が進められている。本稿では、ETSI標準分散音声認識フロントエンドを用い日本語連続音声認識実験を行った結果を報告する。同フロントエンドは、特徴パラメータの圧縮にベクトル量子化を用いるため、入力系の周波数特性の差異はベクトル量子化歪みを増加させ、認識精度を低下させる原因となる可能性が高い。そこで、本稿では、入力系の周波数特性の差異によるベクトル量子化歪みを減少させる手法を提案する。音声認識実験結果より、提案手法は周波数特性の差異による認識精度の劣化を低減することが可能であった。This paper reports an evaluation of European Telecommunications Standards Institute (ETSI) standard Distributed Speech Recognition (DSR) front-end through continuous word recognition on a Japanese speech corpus and proposes a method, the Bias Removal Method (BRM), that reduces the distortion between feature vector and VQ codebook. Experimental results show that using non-quantized features in acoustic model training procedure can improve the recognition performance of DSR fornt-end features and that the proposed method can improve recognition performances of DSR front-end feature.
ETSI標準分散音声認識フロントエンドを用いた音声認識実験

柘植覚, 黒岩眞吾, 任福継, 北研二

日本音響学会研究発表会講演論文集 2002(1) 171-172 2002年3月18日
「情報検索アルゴリズム」, 北研二,津田和彦,獅々堀正幹著, 共立出版, 2002年, (<コーヒーブレーク>私のすすめるこの一冊)

黒岩眞吾

音響学会誌 59(1) 59-60 2002年
電話サービスのための音声認証技術

黒岩眞吾, 柘植覚

2002信学総大, March 276-277 2002年
Multi-Lingual Multi-Function Multi-Media Intelligent System (共著)

Ren Fuji, Kuroiwa Shingo

Bulletin of Faculty of Engineering 47 21-34 2002年
To define the feature function in extracting Japanese-Chinese bilingual word pairs using maximum entropy modeling (共著)

Proceedings of IEEE International Conference on SYSTEMS, MAN, AND CYBERNETICS 2002 (SMC2002) TA1E5 2002年
Semi-automatic acquisition translation knowledge from parallel corpora (共著)

Proceedings of IEEE International Conference on SYSTEMS, MAN, AND CYBERNETICS 2002 (SMC2002) TA1E3 2002年
Robust Feature Extraction in a Variety of Input Devices on the Basis of ETSI Standard DSR Front-end (共著)

7th International Conference on Spoken Language Processing (ICSLP2002) 2221-2224 2002年
Japanese-Chinese Machine Translation System Using SFBMT (共著)

ZHAO X.

International Conference on Information-2002, Series of Information & Management Sciences 3 16-21 2002年
Machine-Aided English Writing Prototype System (共著)

International Conference on Information-2002, Series of Information & Management Sciences 3 22-26 2002年
Automatic Map-Generation Based on Verb Frames (共著)

International Conference on Information-2002, Series of Information & Management Sciences 3 107-112 2002年
Japanese-English Machine Translation Systems for Web Users Using Super-Function (共著)

International Conference on Information-2002, Series of Information & Management Sciences 3 366-371 2002年
The Influence of Speech Coders for IP telephone on Speech Recognition Performance (共著)

International Conference on Information-2002, Series of Information & Management Sciences 3 44-48 2002年
An Evaluation of Japanese Speech Recognition Using ETSI Standard DSR Front-end (共著)

International Conference on Information-2002, Series of Information & Management Sciences 3 372-376 2002年
Latent Semantic Indexing Based on Simple Principal Component Analysis (共著)

nternational Conference on Information-2002, Series of Information & Management Sciences 3 172-177 2002年
Some Advances on Multi-Lingal Multi-Function Multi-Media Inteligent System Project (共著)

International Journal of Asian Information-Science-Life 1(1) 21-34 2002年
音素依存線形判別分析の検討

柘植覚, 黒岩眞吾, 任福継, 北研二

日本音響学会研究発表会講演論文集 2001(2) 177-178 2001年10月1日
Simple PCAを用いたベクトル空間情報検索モデルの次元削減

黒岩眞吾, 柘植覚, 田仁宏典, Tai Xiaoying, 獅々堀正幹, 北研二

情報処理学会研究報告自然言語処理（NL） 2001(69) 61-66 2001年7月16日

ベクトル空間モデル(VSM)は情報検索における代表的な検索モデルである．同モデルでは文書が単語の出現頻度に基づくベクトルで表現されるため，そのベクトル空間は一般にスパースかつ高次元となりメモリや検索時間の増大を招くとともに，文書中に含まれる無意味な単語がノイズ的な影響を及ぼし検索精度を低下させるという問題を生じる．これに対し特異値分解(SVD)を用い次元数を削減した空間で類似度を計算する潜在的意味インデキシング(Latent Semantic Indexing; LSI)が提案され，その効果が報告されている．本稿ではSVDに比べより少ない演算量で近似的に主成分分析を行うことが可能なSimple Principal Component Analysis(SPCA)を次元削減に適用する．MEDLINEコレクションを用いた検索実験を行った結果，SVDと同等以上の検索性能をSPCAにより達成した．The Vector Space Model (VSM) is a popular information retrieval model, which represents a document collection by a term-by-document matrix. Since term-by-document matrices are usually high-dimensional and sparse, they are susceptible to noise and are also difficult to capture the underlying semantic structure. Additionally, computing resources necessary for the storage and processing of such data is enormous. Dimensionality reduction is a way to overcome these problems. Principal Component Analysis (PCA) and Singular Value Decomposition (SVD) are popular techniques for dimensionality reduction based on matrix decomposition. However, such methods consume a large amount of computation resources. In the work described here, we use Simple Principal Component Analysis (SPCA), which is a data-oriented fast method, for dimensionality reduction of the vector space model. Experiments based on the MEDLINE collection showed that SPCA achieved significant improvement compared to the conventional vector space model.
Simple PCA を用いたベクトル空間情報検索モデルの次元削減

黒岩眞吾, 柘植覚, 田仁宏典, TAI Xiaoying, 獅々堀正幹, 北研二

電子情報通信学会技術研究報告. NLC, 言語理解とコミュニケーション 101(189) 61-66 2001年7月9日

ベクトル空間モデル(VSM)は情報検索における代表的な検索モデルである.同モデルでは文書が単語の出現頻度に基づくベクトルで表現されるため, そのベクトル空間は一般にスパースかつ高次元となりメモリや検索時間の増大を招くとともに, 文書中に含まれる無意味な単語がノイズ的な影響を及ぼし検索精度を低下させるという問題を生じる.これに対し特異値分解(SVD)を用い次元数を削減した空間で類似度を計算する潜在的意味インデキシング(Latent Semantic Indexing; LSI)が提案され, その効果が報告されている.本稿ではSVDに比べより少ない演算量で近似的に主成分分析を行うことが可能なSimple Principal Component Analysis(SPCA)を次元削減に適用する.MEDLINEコレクションを用いた検索実験を行った結果, SVDと同等以上の検索性能をSPCAにより達成した.
音声認識を用いたホームカントリーダイレクト向けいたずら電話自動排除システム

黒岩眞吾, 内藤正樹, 中村誠, 酒寄信一, 武笠貴史

電子情報通信学会論文誌. D-2, 情報・システム 2-パターン処理 84(6) 859-867 2001年6月1日

本論文では, 国際電話サービスの一つであるホームカントリーダイレクトに入呼する海外からのいたずら電話を音声認識により自動的に排除するシステムについて論ずる.ホームカントリーダイレクトは海外旅行者らが母国の国際局オペレータに直接アクセスし国際電話サービスを母国語のみで利用できるサービス形態である.同サービスではオペレータを呼び出すための料金を必要としないため, 現地の子供らによるいたずら電話が問題となっていた.そこで, 利用者に特定の単語を発声するように日本語のアナウンスで指示し, その単語が正しく復唱されれば正当な利用者と判断し, さもなければいたずらと判断する「いたずら電話自動排除システム」を開発した.同システムを商用サービスに適用したところ, 94.7%のいたずら電話を排除することができた.このとき正当な利用者を誤排除したのは0.8%である.誤排除された正当な利用者も, 何度か電話を掛け直すことで最終的には正しい単語を復唱して接続に至っていることを確認している.同システムは1996年3月よりKDD国際電話センターで運用されており1日約10, 000呼のいたずら電話を排除している.
Dimensionality Reduction Using Non-negative Matrix Factorization for Information Retrieval (共著)

Proc. of Natural Language Processing and Knowledge Engineering Mini Symposium, IEEE SYSTEMS, MAN, AND CYBERNETICS 2001 (NLPKE) 960-965 2001年
A New Machine Translation Approach using Multiple Translation Engines and Sentence Partitioning (共著)

Proc. of Natural Language Processing and Knowledge Engineering Mini Symposium, IEEE Systems, Man, And Cybernetics 2001 (NLPKE) 3 1699-1704 2001年
Dimensionality reduction of vector space information retrieval model based on non-negative matrix factorization (共著)

Proc. of Knowledge-Based Intelligent Information Engineering Systems & Allied Technologies (KES) 69 367-371 2001年
Dimensionality reduction of vector space model based on Simple PCA (共著)

Proc. of Knowledge-Based Intelligent Information Engineering Systems & Allied Technologies (KES) 69 362-366 2001年
Rapid CODEC Adaptation for Cellular Phone Speech Recognition (共著)

Proc. of Eurospeech 2001 857-860 2001年
Relevance feedback with support vector machine for information retrieval (共著)

Proc. of 19th International Conference on Computer Processing of Oriental Languages (ICCPOL) 35-40 2001年
Efficient mixture Gaussian synthesis for decision tree based state tying (共著)

Proceedinngs of 26th International Conference on Acoustics, Speech and Signal Processing (ICASSP2001) 1 493-496 2001年
音声認識を用いたホームカントリーダイレクト向けいたずら電話自動排除システム (共著)

電子情報通信学会論文誌(D-II) J84-D-II(6) 859-867 2001年
混合分布HMMにおけるTree-based クラスタリング

加藤恒夫, 黒岩眞吾, 清水徹, 樋口宜男

電子情報通信学会論文誌. D-2, 情報・システム 2-パターン処理 83(11) 2128-2136 2000年11月25日

Tree-basedクラスタリングは, 音素コンテクストを分割条件としてコンテクスト依存モデルの集合に対してクラスタリングを行い, HMM状態の共有化を図る有効な手法である.従来の報告では, 計算量の増大を抑えるために対象が単一分布HMMに限られていた.しかし, 単一分布HMMでは音響的特徴を表現するのに不十分であるため, 必ずしも適切なトポロジー(HMM状態の共有関係)が得られていないと考えられる.また所望の混合数の状態共有モデルを獲得するためには, tree-basedクラスタリングの後, 混合数を倍増する操作と連結学習を繰り返す必要があり, 学習に長時間を要するという問題点があった.そこで本論文では, 単一分布HMMを対象としたtree-basedクラスタリングアルゴリズムを混合分布HMMを対象にクラスタリングが行えるように拡張する手法を提案する.本手法により単一分布HMMを扱う従来手法に比べて学習時間が1/3程度に短縮され, 音節タイプライタによる認識実験及び連続単語認識実験において認識率が1〜2ポイント改善された.
テキスト指定型話者照合におけるしきい値設定法

内部利明, 黒岩眞吾, 樋口宜男

電子情報通信学会論文誌. D-2, 情報・システム 2-パターン処理 83(11) 2291-2299 2000年11月25日

話者照合を行う場合, 本人か否かを判断するためのしきい値を事前に設定する必要があるが, 話者により照合時のゆう度にばらつきが生じるため, 最適なしきい値を事前設定するのは困難であった.このゆう度がばらつく原因として, 話者モデルを学習する際に話者に適応する度合(話者適応度)が各話者により異なることが一因と考えられる.そこで隠れマルコフモデル(Hidden Markov Model;HMM)による話者照合において, 話者モデルを不特定話者モデルからの適応学習する際に, 適応によって得られるゆう度の増分を話者適応度として用い, しきい値を話者適応度の関数として表すことで話者ごとに事前にしきい値を設定する方法を提案する.評価実験の結果, この話者適応度としきい値の関係が学習データ数によらずに安定していることを確認した.更に提案手法により照合誤り率を30%削減することができた.
D-14-9 電話音声認識を用いた株価情報案内システム

黒岩眞吾, 加藤恒夫, 内藤正樹, 清水徹, 樋口宜男

電子情報通信学会ソサイエティ大会講演論文集 2000 282-282 2000年9月7日
多数話者電話音声データベースを用いた話者クラスタリング

加藤恒夫, 黒岩眞吾, 清水徹, 樋口宜男

電子情報通信学会技術研究報告. SP, 音声 100(136) 1-8 2000年6月15日

話者クラスタリングは音響的に近い話者集合を作成する方法であり, 話者集合毎に音響モデルを学習することで特定の話者集合にチューニングされたモデルが得られる.従来の報告では少ない話者数(数十名から数百名)の音声を学習データとして用いているが, 多数話者の音声を学習データに用いれば, 1話者クラスタあたりのデータ量が増加しモデルパラメータの推定精度が向上すること, 任意の話者に近い話者集合の音響モデルを認識に利用できることにより認識性能の改善が期待できる.本稿では, 1, 000名規模の電話音声データベースを用いで話者クラスタリングを行い, 学習データの話者数の増加に伴い認識率が上昇することを確認した.また, 話者集合の作成が理想的に行われた場合の認識率を求めることを目的として, 従来の尤度に基づく方法に替えて, 認識対象の話者に対して高い認識率を与える話者のデータからモデルを学習したところ, 不特定話者モデルと特定話者モデルの性能差の約60%を改善可能であることがわかった.
大語彙汎用音声認識エンジンの開発

清水徹, 黒岩眞吾, 河井恒, 内部利明, 加藤恒夫, 樋口宜男

電子情報通信学会総合大会講演論文集 2000(1) 190-190 2000年3月7日
大語彙汎用音声認識エンジンの評価

黒岩眞吾, 加藤恒夫, 内部利明, 河井恒, 清水徹, 樋口宜男

電子情報通信学会総合大会講演論文集 2000(1) 190-190 2000年3月7日
音声認識のためのCS-ACELP符号の音響パラメータ変換方式

内部利明, 黒岩眞吾, 樋口宜男

電子情報通信学会総合大会講演論文集 2000(1) 195-195 2000年3月7日
連続数字認識を利用した悪戯電話排除システム

河井恒, 黒岩眞吾, 清水徹, 樋口宜男, 鈴木信雄, 大野晃生

電子情報通信学会総合大会講演論文集 2000(1) 197-197 2000年3月7日
話者モデル学習時の尤度上昇幅を用いた話者識別手法

内部利明, 黒岩眞吾, 樋口宜男

日本音響学会研究発表会講演論文集 2000(1) 95-96 2000年3月1日
多数話者電話音声データベースを用いた話者クラスタリングの検討

加藤恒夫, 黒岩眞吾, 河井恒, 清水徹, 樋口宜男

日本音響学会研究発表会講演論文集 2000(1) 107-108 2000年3月1日
高齢者用HMMによる認識実験

井ノ上直己, 黒岩眞吾, 橋本和夫, 樋口宜男

2000信学全大 193-193 2000年

講演・口頭発表等

Cross-Lingual Speaker Identification for Japanese-English Bilinguals

Ryotaro Sano, Masafumi Nishida, Satoru Tsuge, Shingo Kuroiwa, Hiroyuki Yoshimura

2023 IEEE 12th Global Conference on Consumer Electronics 2023年10月12日
Utterance-Style-Dependent Speaker Verification by Utilizing Emotions

Hibiki Takayama, Masafumi Nishida, Satoru Tsuge, Shingo Kuroiwa

2023 IEEE 12th Global Conference on Consumer Electronics 2023年10月11日
感情を想定した発話スタイル依存型話者照合

高山響, 西田昌史, 柘植覚, 黒岩眞吾, 西村雅史

日本音響学会第150回(2023年秋季)研究発表会 2023年9月27日
単語発声による同一話者判定 DNN の学習と話者照合

亀田健太郎, 黒岩眞吾, 堀内靖雄, 柘植覚, 西田昌史

日本音響学会第150回(2023年秋季)研究発表会 2023年9月27日
食行動の自動評価及び分析のためのデータベース構築

伴野司, 森野智子, 黒岩眞吾, 西田昌史, 西村雅史

情報処理学会第85回全国大会 2023年3月

もっとみる

所属学協会

Works(作品等)

ActVoice Smart (音声認識を用いた絵カード呼称訓練ソフト）

株式会社エスコアール

2017年8月 - 現在ソフトウェア
ハナセル（音声認識を用いた失語症を持つ人向け言語訓練タブレット）

株式会社イントロム

2018年6月 - 2025年3月ソフトウェア
リハログ（言語訓練プラン作成及び記録システム）

株式会社イントロム

2017年1月 - 2025年3月 Web Service
ActVoice for Pepper(会話ロボット向け呼称訓練アプリ）

株式会社ロボキュア

2017年1月 - 2018年6月ソフトウェア
CD版「そのまま使える失語症教材１」

鈴木勉, 宇野園子, 佐藤ゆう子, 朝田真理, 石戸純子, 泉谷聡子, 前川友絵*, 井堀奈美, 鶴田, 薫, 堀田牧子, 4コマ画, 阿部裕実*, 有賀恵子, 小川節子, 須田悦子, 相馬肖美, 寺田奈々, 中嶋基充, 西脇恵子, 文章読解, 統括:宇野園子, 100字, 片山芳恵, 斎藤敬子, 嶋田真砂美, 栁澤瑶貴, 高山亜希子*, 井上澄香, 上杉由美, 鈴木和子, 村西幸代, 鈴木直哉小熊真由, 木村佐知子, 相楽涼子, 治田寛之, 山本弘美, 黒岩眞吾

2018年教材