黒岩眞吾

クロイワシンゴ (Shingo Kuroiwa)

基本情報

所属: 千葉大学大学院工学研究院教授

学位: 博士(電気通信大学大学院電気通信学研究科電子工学専攻)

研究者番号: 20333510
J-GLOBAL ID: 200901017262764603
researchmap会員ID: 1000356498

外部リンク: http://www.ailab.tj.chiba-u.jp/~kuroiwa/

研究キーワード

研究分野

経歴

2007年10月 - 現在

千葉大学大学院工学研究院教授

受賞

2017年4月

2017年電気通信大学同窓会賞, 音声認識システムの実用化，失語症の方向けのコミュニケーション支援機器の開発等で大きく社会に貢献一般社団法人目黒会

黒岩眞吾
2017年3月

千葉エリア産学官連携オープンフォーラム2016千葉大学長賞（優秀賞）ロボットやタブレットを活用した『失語症者向け言語訓練システム』千葉大学

黒岩眞吾
1997年

第５回（平成９年度）技術開発賞日本音響学会

黒岩眞吾, 中村誠, 山本誠一, 酒寄信一, 武笠貴史, 藤岡雅宣, 阿部信子
1997年

社長表彰(業務改善) 国際電信電話株式会社

黒岩眞吾, 中村誠, 山本誠一, 酒寄信一, 武笠貴史, 藤岡雅宣, 阿部信子
1997年

平８年度電子情報通信学会学術奨励賞電子情報通信学会

山本誠一, 武田一哉, 井ノ上直己, 黒岩眞吾

もっとみる

論文

136

Food Recognition Using Large-scale Pre-trained Speech Models

Satoshi Naito, Masafumi Nishimura, Masafumi Nishida, Yasuo Horiuchi, Shingo Kuroiwa

GCCE 2024 - 2024 IEEE 13th Global Conference on Consumer Electronics 119-120 2024年

Obese and overweight individuals are at high risk for chronic diseases such as sleep apnea and diabetes. Therefore, it is necessary to track eating behavior to determine the causes of obesity; however, it is time- and labor-intensive to follow the lives of specific individuals and observe their eating behavior. Thus, a method to automatically monitor eating behavior should be considered. As one approach to monitoring methods, we propose a method for convenient recognition of food category for food intake sounds recorded by microphones (below the ear microphone, throat microphone and acoustic microphone), which is less burdensome to the body and better from the viewpoint of privacy protection. Furthermore, a comparison of MFB and large-scale pre-trained speech models (wav2vec2.0, wavLM, and HuBERT) showed the effectiveness of large-scale pre-trained speech models in the food recognition task.
Text-Dependent Speaker Verification Using SSI-DNN Trained on Short Utterance

Kentaro Kameda, Satoru Tsuge, Shingo Kuroiwa, Yasuo Horiuchi, Masafumi Nishida

GCCE 2024 - 2024 IEEE 13th Global Conference on Consumer Electronics 808-810 2024年

To enhance speaker verification for short utterances, we have developed a Same Speaker Identification Deep Neural Network (SSI-DNN). This network identifies whether two utterances are uttered by the same speaker with greater accuracy by focusing on the same texts. In this paper, we extend the detection target of the SSI-DNN from monosyllabic utterances to word utterances to improve the speaker recognition performance. Experimental results showed that the SSI-DNN trained on word utterances achieved an EER of 0.1% to 2.8%. These results indicated that the SSI-DNN outperformed the x-vector-based speaker verification method, which is a representative speaker verification method.
Template-Based Speech Recognition Using Pre-trained Large Speech Models for Voice-Activated Shower Control

Takumi Uehara, Shingo Kuroiwa, Yasuo Horiuchi, Masafumi Nishida, Satoru Tsuge

GCCE 2024 - 2024 IEEE 13th Global Conference on Consumer Electronics 141-143 2024年

Hands-free control of shower settings, such as temperature, is highly desirable, enhancing user convenience when both hands are occupied or eyes are closed. In this paper, we propose a speaker-dependent, template-based isolated word recognition system using pre-trained large speech models (LSMs) to realize voice-activated shower control with a single microphone. Specifically, we examine the performance of 3 LSMs (wav2vec2.0, HuBERT, WavLM) as well as conventional MFCC as features. Additionally, we investigate speech enhancement using a Convolutional Recurrent Neural Network (CRN) to improve robustness against shower noise. Our experiments for recognizing 30 words with SNRs ranging from -5 dB to 20 dB demonstrate that HuBERT achieves the highest recognition accuracy (77.8 to 95.6%). CRN, on the other hand, improved recognition accuracy only under -5 dB conditions, but its accuracy was only 80.8%.
Emotion-Dependent Speaker Verification Based on Score Integration

Hibiki Takayama, Masafumi Nishida, Satoru Tsuge, Shingo Kuroiwa

GCCE 2024 - 2024 IEEE 13th Global Conference on Consumer Electronics 805-807 2024年

Recent advances in AI technology have brought not only many benefits but also considerable risks due to malicious use of the technology. One key example is spoofing through speech synthesis and voice conversion technologies against speaker verification system. To tackle this challenge, we proposed a two-step matching method as a robust speaker verification, in which a user specifies an emotion to a system in advance, and the user is accepted only when the user speaks with the specified emotion. This previous method reduced the false acceptance rate. However, the false rejection rate increased. To overcome this problem, we propose a novel method that integrates speaker and emotion verification scores in this work. Experiments revealed that the proposed method can reduce the equal error rate compared with that of the conventional method to assign the optimal weight to the speaker and emotional information contained in the speech.
Utterance-style-dependent Speaker Verification by Utilizing Emotions

Hibiki Takayama, Masafumi Nishida, Satoru Tsuge, Shingo Kuroiwa, Masafumi Nishimura

2023 IEEE 12th Global Conference on Consumer Electronics (GCCE) 2023年10月10日

もっとみる

MISC

588

A Study on Automatic Text Summarization and News Collection (共著)

Proceedings of International Symposium on Artificial Intelligence and Affective Computing 2006 81-88 2006年
Different Approaches for Bilingual Sentence Alignment (共著)

Proceedings of International Symposium on Artificial Intelligence and Affective Computing 2006 70-80 2006年
A Preliminary Research of Chinese Emotion Classification Model (共著)

Proceedings of International Symposium on Artificial Intelligence and Affective Computing 2006 59-69 2006年
An Information Retrieval Library for Japanese and English News Articles (共著)

Proceedings of International Symposium on Artificial Intelligence and Affective Computing 2006 47-58 2006年
Classification of Facemarks Using Statistics of Characters (共著)

Proceedings of International Symposium on Artificial Intelligence and Affective Computing 2006 2-9 2006年
Study of Relationships Between Intra-Speaker’s Speech Variability and Speech Recognition Performance (共著)

2006 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS 2006) 33-+ 2006年
Evaluation of EMD-based Speaker Recognition using ISCSLP2006 Chinese Speaker Recognition Evaluation Corpus (共著)

Proceedings of 5th International Symposium on Chinese Spoken Language Processing (ISCSLP2006), (Lecture Notes in Artificial Intelligence, Vol.4274) 4274 539-+ 2006年
“A speech emphasis method for noise-robust speech recognition by using repetitive phrase (共著)

IEEE International Conference on Communication Technology (ICCT-06) 1269-+ 2006年
The Construction of The Facial Expression Video Database (共著)

IEEE International Conference on Communication Technology (ICCT-06) 1265-+ 2006年
Speech bandwidth extension method using speech recognition and speech synthesis (共著)

IEEE International Conference on Communication Technology (ICCT-06) 1273-+ 2006年
Wind noise reduction method for speech recording using multiple noise templates and observed spectrum fine structure (共著)

IEEE International Conference on Communication Technology (ICCT-06) 1567-+ 2006年
A Chinese Automatic Text Summarization System for mobile devices (共著)

the 20th Pacific Asia Conference on Language, Information and Computation 426-429 2006年
Machine Transliteration (共著)

the 20th Pacific Asia Conference on Language, Information and Computation 370-373 2006年
Lost Speech Reconstruction Method using Speech Recognition based on Missing Feature Theory and HMM-based Speech Synthesis (共著)

Proc. of INTERSPEECH 2006 1105-1108 2006年
Emotion Estimation System based on Emotion Occurrence Sentence Pattern (共著)

Proceedings of International Conference on Intelligent Computing 2006 902-911 2006年
Treatment of Quantifiers in Chinese-Japanese Machine Translation (共著)

Proceedings of International Conference on Intelligent Computing 2006 4114 930-935 2006年
Determining the Emotion of News Articles (共著)

Proceedings of International Conference on Intelligent Computing 2006 4114 918-923 2006年
Statistical Analysis of a Japanese Emotion Corpus for Natural Language Processing (共著)

Proceedings of International Conference on Intelligent Computing 2006 4114 924-929 2006年
Semi-Automatic Emotion Recognition from Chinese Text (共著)

Proceedings of the 9th IASTED International Conference on Intelligent Systems and Control 113-+ 2006年
Category Classification and Topic Discovery of News Articles (共著)

Proceedings of The Fourth International Conference on Information and The Fourth Irish Conference on the Mathematical Foundations of Computer Science and Information Technology 345-348 2006年
Automatic extraction of Super-Function from corpus and experimental evaluation (共著)

Proceedings of The Fourth International Conference on Information and The Fourth Irish Conference on the Mathematical Foundations of Computer Science and Information Technology 395-398 2006年
Chinese Idiomatic Expression Reading Support for Foreign Learners (共著)

Proceedings of The Fourth International Conference on Information and The Fourth Irish Conference on the Mathematical Foundations of Computer Science and Information Technology 184-188 2006年
Maximum Entropy Based Semantic Analysis of Chinese (共著)

Proceedings of The Fourth International Conference on Information and The Fourth Irish Conference on the Mathematical Foundations of Computer Science and Information Technology 416-419 2006年
Phone Based Speaker Modeling to Improve Speaker Identification (共著)

ICGST International Conference on Artificial Intelligence and Machine Learning (AIML-06) 7-10 2006年
A Machine Learning Approach to Determine Semantic Dependency Structure in Chinese (共著)

Proceedings of the 19th Florida Artificial Intelligence Research Society Conference (FLAIRS-06) 782-786 2006年
Study of Intra-Speaker’s Speech Variability over Long and Short Time Periods for Speech Recognition (共著)

Proc. of 2006 IEEE International Conference on Acoustic, Speech, and Signal Processing (ICASSP2006) 1 397-400 2006年
Wind noise reduction method using the observed spectrum fine structure and estimated spectrum envelope (共著)

Proceedings of 2006 RISP International Workshop on Nonlinear Circuits and Signal Processing (NCSP’06) 78-81 2006年
Probabilistic Neural Network Based English-Arabic Sentence Alignment (共著)

Proceedings of 7th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing 2006) 458-470 2006年
A Question Answering System on Special Domain and the Implementation of Speech Interface (共著)

Proceedings of 7th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing 2006) 3878 458-469 2006年
A CIE (Chinese Idiomatic Expression) Reading Support System for Japanese CSL Learners (共著)

The Journal of Information and Systems in Education 5(1) 17-27 2006年
A Rule and Super Function-based Machine Translation System for Chinese-Japanese Causative Sentences (共著)

WSEAS Transactions on COMPUTERS 5(9) 2122-2129 2006年
The Validity of Metaphor in Emotion Recognizing Model (共著)

WSEAS Transactions on COMPUTERS 5(9) 2049-2055 2006年
Rule-based Translation of Quantifiers for Chinese-Japanese Machine Translation (共著)

WSEAS Transactions on COMPUTERS 5(9) 2031-2036 2006年
The Emotion Recognition through classification with the Support Vector Machines (共著)

WSEAS Transactions on COMPUTERS 5(9) 2008-2013 2006年
An Integrated System for Semantic Analysis of Chinese (共著)

WSEAS Transactions on COMPUTERS 5(9) 1886-1891 2006年
Building Frames of Knowledge for Causal Agents inWordNet (共著)

WSEAS Transactions on COMPUTERS 5(9) 1880-1885 2006年
A CIE Extraction Syatem for CSL Learners (共著)

International Journal of Computer Science and Network Security 6(7A) 152-162 2006年
A Mind Model for an Affecitive Computer (共著)

International Journal of Computer Science and Network Security 6(6) 62-69 2006年
Translation of Japanese Noun Compounds at Super-Function Based MT System (共著)

IEEJ Transactions on Electronics, Information and Systems 126(5) 645-653 2006年
A Preliminary Research of Chinese Emotion Classification Model (共著)

Research in Computing Science 19(1) 95-106 2006年
The Recognition system of CCE for CSL Learners (共著)

Research in Computing Science 19(1) 49-61 2006年
Specific Speaker’s Japanese Speech Corpus over Long and Short Time Periods (共著)

Journal of Research on Computing Science 17(1) 115-124 2006年
Acoustic Model Adaptation for Codec Speech based on Learning-by-Doing Concept (共著)

Journal of Research on Computing Science 17(1) 105-114 2006年
Phoneme Based Speaker Modeling to Improve Speaker Recognition (共著)

Information 9(1) 135-147 2006年
音声認識・音声合成を用いた音声途切れ補間手法 (共著)

日本音響学会誌 62(1) 3-9 2006年
Speaker Recognition for Wire/Wireless Communication Systems (共著)

International Arab Journal of Information Technology 3(1) 28-34 2006年
SLP雑音下音声認識評価WG活動報告 : 評価用データと評価手法について

中村哲, 武田一哉, 黒岩眞吾, 北岡教英, 山田武志, 山本一公, 西浦敬信, 佐宗晃, 水町光徳, 宮島千代美, 藤本雅清, 遠藤俊樹, 滝口哲也

電子情報通信学会技術研究報告. SP, 音声 105(496) 49-54 2005年12月22日

現在の音声認識は, 実使用環境に存在する雑音などの外的要因により性能劣化を免れない.このため, これまで数々の研究が行われてきた.しかしながら, 異なるタスク, 異なる評価データが用いられてきたため性能の比較が非常に困難であった.このため, 情報処理学会音声言語情報処理研究会の下に雑音下音声認識評価のワーキンググループを2001年10月に組織し、評価用標準コーパス、標準バックエンドの作成、配布を行ってきた。本稿では, 本活動の現状と今後の予定, 狙いについて述べる.
SLP雑音下音声認識評価WG活動報告 : 評価用データと評価手法について

中村哲, 武田一哉, 黒岩眞吾, 北岡教英, 山田武志, 山本一公, 西浦敬信, 佐宗晃, 水町光徳, 宮島千代美, 藤本雅清, 遠藤俊樹, 滝口哲也

電子情報通信学会技術研究報告. NLC, 言語理解とコミュニケーション 105(494) 49-54 2005年12月22日

現在の音声認識は, 実使用環境に存在する雑音などの外的要因により性能劣化を免れない.このため, これまで数々の研究が行われてきた.しかしながら, 異なるタスク, 異なる評価データが用いられてきたため性能の比較が非常に困難であった.このため, 情報処理学会音声言語情報処理研究会の下に雑音下音声認識評価のワーキンググループを2001年10月に組織し、評価用標準コーパス、標準バックエンドの作成、配布を行ってきた。本稿では, 本活動の現状と今後の予定, 狙いについて述べる.
SLP雑音下音声認識評価WG活動報告－評価用データと評価手法について－

中村哲, 武田一哉, 黒岩眞吾, 北岡教英, 山田武志, 山本一公, 西浦敬信, 佐宗晃, 水町光徳, 宮島千代美, 藤本雅清, 遠藤俊樹, 滝口哲也

情報処理学会研究報告音声言語情報処理（SLP） 2005(127) 139-144 2005年12月22日

現在の音声認識は，実使用環境に存在する雑音などの外的要因により性能劣化を免れない．このため，これまで数々の研究が行われてきた．しかしながら，異なるタスク，異なる評価データが用いられてきたため性能の比較が非常に困難であった．このため，情報処理学会音声言語情報処理研究会の下に雑音下音声認識評価のワーキンググループを2001年10月に組織し、評価用標準コーパス、標準バックエンドの作成、配布を行ってきた。本稿では，本活動の現状と今後の予定，狙いについて述べる．Performance degradation by environmental interference such as noise and reverberation is inevitable for the current state of the art speech recognition. So far there have been many researches to overcome this problem. However, it has been very difficult to know actual improvements and compare those methods since those methods were developed for individual tasks and on different corpus. To overcome these problems, we organized a working group under Information Processing Society of Japan. This paper introduces current activities and a future road-map of a common standardized framework for noisy speech recognition by the working group organized by the authors.
感情コーパス作成支援システムの開発

松本和幸, David B.Bracewell, 任福継, 黒岩眞吾

情報処理学会研究報告自然言語処理（NL） 2005(117) 91-96 2005年11月22日

近年の情報処理技術の発達に伴い，情報処理の分野ではあまり取り扱われることの無かった人間の感性をコンピュータで処理する研究が盛んになってきている．擬人化エージェントや感性ロボットが人のように振舞うためには，人間の感性を認識し，自らの感情を表出することが必要である．感情を認識し，表出する感性ロボットには，ifbotなどがある．我々は，感性ロボットに応用するための感情認識技術について研究している．しかし，感情認識の研究は始まったばかりであり，感情認識のために利用できる言語コーパスが少ない．また、そのようなコーパスは人手により作成する必要があるが，感情情報の付与手法やデータのフォーマットなどが統一されておらず，コーパスの構築を行い研究を進めるための環境としては不十分だと考えられる．我々は，感性情報処理の研究のための言語コーパスの作成を支援するシステムの開発を行っている．本稿では感情コーパス作成支援システムの開発概要について述べる．In recent years, computer automation have developed in various types of industries, making research about processing human sensibility more active. Emotion recognition and expression technologies are needed to create anthropomorphic agents and sensibility robots that behave like humans. The "ifbot" is an example of a sensibility robot which expresses emotions and recognizes emotions. However, language corpora for emotion recognition are small because emotion recognition is still in the primitive stage of research. We need to construct emotion corpora manually in order to progress the research efficiently, but there doesn't exist a unified format or methods for constructing such emotion corpora. We are developing a support system for constructing a large emotion corpus. In this paper, we propose a system which supports making a natural language corpus of tagged emotion information and describe the outline of the system development.

講演・口頭発表等

Cross-Lingual Speaker Identification for Japanese-English Bilinguals

Ryotaro Sano, Masafumi Nishida, Satoru Tsuge, Shingo Kuroiwa, Hiroyuki Yoshimura

2023 IEEE 12th Global Conference on Consumer Electronics 2023年10月12日
Utterance-Style-Dependent Speaker Verification by Utilizing Emotions

Hibiki Takayama, Masafumi Nishida, Satoru Tsuge, Shingo Kuroiwa

2023 IEEE 12th Global Conference on Consumer Electronics 2023年10月11日
感情を想定した発話スタイル依存型話者照合

高山響, 西田昌史, 柘植覚, 黒岩眞吾, 西村雅史

日本音響学会第150回(2023年秋季)研究発表会 2023年9月27日
単語発声による同一話者判定 DNN の学習と話者照合

亀田健太郎, 黒岩眞吾, 堀内靖雄, 柘植覚, 西田昌史

日本音響学会第150回(2023年秋季)研究発表会 2023年9月27日
食行動の自動評価及び分析のためのデータベース構築

伴野司, 森野智子, 黒岩眞吾, 西田昌史, 西村雅史

情報処理学会第85回全国大会 2023年3月

もっとみる

所属学協会

Works(作品等)

ActVoice Smart (音声認識を用いた絵カード呼称訓練ソフト）

株式会社エスコアール

2017年8月 - 現在ソフトウェア
ハナセル（音声認識を用いた失語症を持つ人向け言語訓練タブレット）

株式会社イントロム

2018年6月 - 2025年3月ソフトウェア
リハログ（言語訓練プラン作成及び記録システム）

株式会社イントロム

2017年1月 - 2025年3月 Web Service
ActVoice for Pepper(会話ロボット向け呼称訓練アプリ）

株式会社ロボキュア

2017年1月 - 2018年6月ソフトウェア
CD版「そのまま使える失語症教材１」

鈴木勉, 宇野園子, 佐藤ゆう子, 朝田真理, 石戸純子, 泉谷聡子, 前川友絵*, 井堀奈美, 鶴田, 薫, 堀田牧子, 4コマ画, 阿部裕実*, 有賀恵子, 小川節子, 須田悦子, 相馬肖美, 寺田奈々, 中嶋基充, 西脇恵子, 文章読解, 統括:宇野園子, 100字, 片山芳恵, 斎藤敬子, 嶋田真砂美, 栁澤瑶貴, 高山亜希子*, 井上澄香, 上杉由美, 鈴木和子, 村西幸代, 鈴木直哉小熊真由, 木村佐知子, 相楽涼子, 治田寛之, 山本弘美, 黒岩眞吾

2018年教材