堀内靖雄

ホリウチヤスオ (Yasuo Horiuchi)

基本情報

所属: 千葉大学大学院情報学研究院准教授

学位: 博士(工学)(1995年3月東京工業大学)

J-GLOBAL ID: 200901021029331583
researchmap会員ID: 1000191929

研究分野

情報通信 / 知能情報学 /

受賞

論文

Determining the base frequency of the <i>F</i><sub>0</sub> contour generation model for the diverse expression of speech

Yoshiko Arimoto, Yasuo Horiuchi, Sumio Ohno

Acoustical Science and Technology 46(1) 2025年1月査読有り
「対話のことば」に共通な機能を形成する要因の考察

市川熹, 長嶋祐二, 堀内靖雄

日本音響学会誌 80(7) 355-366 2024年7月査読有り
Food Recognition Using Large-scale Pre-trained Speech Models

Satoshi Naito, Masafumi Nishimura, Masafumi Nishida, Yasuo Horiuchi, Shingo Kuroiwa

GCCE 2024 - 2024 IEEE 13th Global Conference on Consumer Electronics 119-120 2024年

Obese and overweight individuals are at high risk for chronic diseases such as sleep apnea and diabetes. Therefore, it is necessary to track eating behavior to determine the causes of obesity; however, it is time- and labor-intensive to follow the lives of specific individuals and observe their eating behavior. Thus, a method to automatically monitor eating behavior should be considered. As one approach to monitoring methods, we propose a method for convenient recognition of food category for food intake sounds recorded by microphones (below the ear microphone, throat microphone and acoustic microphone), which is less burdensome to the body and better from the viewpoint of privacy protection. Furthermore, a comparison of MFB and large-scale pre-trained speech models (wav2vec2.0, wavLM, and HuBERT) showed the effectiveness of large-scale pre-trained speech models in the food recognition task.
Text-Dependent Speaker Verification Using SSI-DNN Trained on Short Utterance

Kentaro Kameda, Satoru Tsuge, Shingo Kuroiwa, Yasuo Horiuchi, Masafumi Nishida

GCCE 2024 - 2024 IEEE 13th Global Conference on Consumer Electronics 808-810 2024年

To enhance speaker verification for short utterances, we have developed a Same Speaker Identification Deep Neural Network (SSI-DNN). This network identifies whether two utterances are uttered by the same speaker with greater accuracy by focusing on the same texts. In this paper, we extend the detection target of the SSI-DNN from monosyllabic utterances to word utterances to improve the speaker recognition performance. Experimental results showed that the SSI-DNN trained on word utterances achieved an EER of 0.1% to 2.8%. These results indicated that the SSI-DNN outperformed the x-vector-based speaker verification method, which is a representative speaker verification method.
Template-Based Speech Recognition Using Pre-trained Large Speech Models for Voice-Activated Shower Control

Takumi Uehara, Shingo Kuroiwa, Yasuo Horiuchi, Masafumi Nishida, Satoru Tsuge

GCCE 2024 - 2024 IEEE 13th Global Conference on Consumer Electronics 141-143 2024年

Hands-free control of shower settings, such as temperature, is highly desirable, enhancing user convenience when both hands are occupied or eyes are closed. In this paper, we propose a speaker-dependent, template-based isolated word recognition system using pre-trained large speech models (LSMs) to realize voice-activated shower control with a single microphone. Specifically, we examine the performance of 3 LSMs (wav2vec2.0, HuBERT, WavLM) as well as conventional MFCC as features. Additionally, we investigate speech enhancement using a Convolutional Recurrent Neural Network (CRN) to improve robustness against shower noise. Our experiments for recognizing 30 words with SNRs ranging from -5 dB to 20 dB demonstrate that HuBERT achieves the highest recognition accuracy (77.8 to 95.6%). CRN, on the other hand, improved recognition accuracy only under -5 dB conditions, but its accuracy was only 80.8%.
Identification of vocal tract state before and after swallowing using acoustic features

Aoi Sugita, Masafumi Nishida, Masafumi Nishimura, Yasuo Horiuchi, Shingo Kuroiwa

2022 IEEE 11th Global Conference on Consumer Electronics (GCCE) 2022年10月18日
Same Speaker Identification with Deep Learning and Application to Text-Dependent Speaker Verification.

Manaka Takamizawa, Satoru Tsuge, Yasuo Horiuchi, Shingo Kuroiwa

KES-HCIS 149-158 2022年
Text-Dependent Closed-Set Two-Speaker Recognition of a Key Phrase Uttered Synchronously by Two Persons

Toshiyuki Ugawa, Satoru Tsuge, Yasuo Horiuchi, Shingo Kuroiwa

Human Centred Intelligent Systems 405-413 2021年査読有り
Constructing a Highly Accurate Japanese Sign Language Motion Database Including Dialogue

Yuji Nagashima, Keiko Watanabe, Daisuke Hara, Yasuo Horiuchi, Shinji Sako, Akira Ichikawa

Communications in Computer and Information Science 76-81 2020年6月査読有り
日本手話対話のポーズと話者交替の分析

堀内靖雄, 有本泰子, 黒岩眞吾

人工知能学会研究会資料 SIG-SLUD-B903-01 1-6 2020年3月筆頭著者
Analysis of Acoustic Features Affected by Conditions of the Vocal Tract

Masahiro Koto, Tomoki Hosoyama, Masafumi Nishimura, Masafumi Nishida, Yasuo Horiuchi, Shingo Kuroiwa

Proceedings of 2020 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing 311-314 2020年3月査読有り
Analysis of Acoustic Features Affected by Residual Food in the Piriform Fossa Toward Early-Detection of Dysphagia

Tomoki Hosoyama, Masahiro Koto, Masafumi Nishimura, Masafumi Nishida, Yasuo Horiuchi, Shingo Kuroiwa

Innovation in Medicine and Healthcare 171-177 2020年査読有り
Discussion of a Japanese sign language database and its annotation systems with consideration for its use in various areas

Shinji Sako, Yuji Nagashima, Daisuke Hara, Yasuo Horiuchi, Keiko Watanabe, Ritsuko Kikusawa, Naoto Kato, Akira Ichikawa

Proceeding of LingCologne 2019 2019年6月6日査読有り
コミュニケーションロボットを用いた失語症者向け絵カード呼称訓練システム

黒岩眞吾, 堀内靖雄, 古川大輔, 村西幸代

電子情報通信学会論文誌A J102-A(2) 1-5 2019年2月査読有り
Construction of a Japanese Sign Language Database with Various Data Types

Keiko Watanabe, Yuji Nagashima, Daisuke Hara, Yasuo Horiuchi, Shinji Sako, Akira Ichikawa

Communications in Computer and Information Science 317-322 2019年査読有り
Constructing a Japanese Sign Language Multi-Dimensional Database

•Yuji Nagashima, Daisuke Hara, Shinji Sako, Keiko Watanabe, Yasuo Horiuchi, Ritsuko Kikusawa, Naoto Kato, Akira Ichikawa

The 7th Meeting of Signed and SpokenLanguage Linguistics (SSLL 2018) 2018年9月28日査読有り
伴奏システムのテンポ制御モデルの検討

堀内靖雄, 足立亜里紗, 黒岩眞吾

情報処理学会研究報告 2018-MUS-118(25) 1-6 2018年2月13日筆頭著者
心的負担が軽い「対話のことば」の構造

市川熹, 堀内靖雄, 長嶋祐二

ヒューマンインタフェース学会論文誌 20(2) 191-204 2018年査読有り

We had shown experimental results on prosody of languages characterized by real-time dialogue such as speech, sign language, finger braille and so on. These results were discussed along with various research results both from inside and outside Japan. Based on the results, we examined a structure that enabled real-time dialogue with a light mental burden. Furthermore, we will propose a model which makes real-time dialogue possible by elucidating information structures of various languages characterized by real-time dialogue. The model to be proposed can explain various phenomena in real-time dialogue.
Text-Independent Speaker Identification Based on Reducing Intersession Variability of Speech Feature Using PCA Transformation

Wenbin Zhang, Haoze Lu, Yasuo Horiuchi, Satoru Tsuge, Kenji Kita, Shingo Kuroiwa

Journal of Signal Processing Vol.15(No.4) 275-278 2011年7月査読有り

テキスト独立な話者認識において，音声変動やセッション間変動は話者認識の精度に大きな影響を与える．本論文では，PCA変換を用いることにより，音声データのセッション間変動を削減することを提案する．提案手法を用いることにより，MFCCを用いる従来手法に比べ，誤認識率を42.6%削減でき，MFB-PCAに基づく手法に比べ，誤認識率を27.2%削減できた．
Evaluating Interpreter's Skill by Measurement of Prosody Recognition

Tanaka Saori, Nakazono Kaoru, Nishida Masafumi, Horiuchi Yasuo, Ichikawa Akira

Information and Media Technologies 3(2) 375-384 2008年

Sign language is a visual language in which main articulators are hands, torso, head, and face. For simultaneous interpreters of Japanese sign language (JSL) and spoken Japanese, it is very important to recognize not only the hands movement but also prosody such like head, eye, posture and facial expression. This is because prosody has grammatical rules for representing the case and modification relations in JSL. The goal of this study is to introduce an examination called MPR (Measurement of Prosody Recognition) and to demonstrate that it can be an indicator for the other general skills of interpreters. For this purpose, we conducted two experiments: the first studies the relationship between the interpreter's experience and the performance score on MPR (Experiment-1), and the second investigates the specific skill that can be estimated by MPR (Experiment-2). The data in Experiment-1 came from four interpreters who had more than 1-year experience as interpreters, and more four interpreters who had less than 1-year experience. The mean accuracy of MPR in the more experienced group was higher than that in the less experienced group. The data in Experiment-2 came from three high MPR interpreters and three low MPR interpreters. Two hearing subjects and three deaf subjects evaluated their skill in terms of the speech or sign interpretation skill, the reliability of interpretation, the expeditiousness, and the subjective sense of accomplishment for the ordering pizza task. The two experiments indicated a possibility that MPR could be useful for estimating if the interpreter is sufficiently experienced to interpret from sign language to spoken Japanese, and if they can work on the interpretation expeditiously without making the deaf or the hearing clients anxious. Finally we end this paper with suggestions for conclusions and future work.
Prosody Rule for Time Structure of Finger Braille.

Manabi Miyagi, Yuji Fujimori, Yasuo Horiuchi, Akira Ichikawa

Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications) - RIAO 2000(RIAO) 862-869 2000年
New WWW browser for visually impaired people using interactive voice technology.

Yasuo Horiuchi, Fujiwara Atsushi, Akira Ichikawa

Sixth European Conference on Speech Communication and Technology(EUROSPEECH) 1999年

MISC

559

日本手話における手の直線運動の音素に関するモーションキャプチャによる分析

仲本征矢, 堀内靖雄, 原大介, 黒岩眞吾

人工知能学会第97回研究会言語・音声理解と対話処理研究会 68-73 2023年3月
歌声の響きの調和性を考慮した伴奏システムにおける音高調整手法の検討

嵯峨良理, 堀内靖雄, 黒岩眞吾

情報処理学会研究報告(Web) 2020(MUS-128) 2020年
日本手話対話におけるポーズと話者交替に関する分析

長谷川愛, 堀内靖雄, 黒岩眞吾

人工知能学会研究会資料 SIG-SLUD-B803 13-18 2019年3月
日本手話の多用途・3次元高精度データベースの開発 (福祉情報工学)

長嶋祐二, 酒向慎司, 渡辺桂子, 原大介, 堀内靖雄, 市川熹

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 118(440) 71-75 2019年2月7日
日本手話の多用途・3次元高精度データベースの開発

長嶋祐二, 酒向慎司, 渡辺桂子, 原大介, 堀内靖雄, 市川熹

聴覚研究会資料 = Proceedings of the auditory research meeting 49(1) 71-75 2019年2月7日

もっとみる

所属学協会

Works(作品等)

もっとみる

共同研究・競争的資金等の研究課題

対話型自然言語の韻律に関する音声と手話の横断的分析

日本学術振興会科学研究費助成事業 2020年4月 - 2024年3月

堀内靖雄
多用途型日本手話言語データベース構築に関する研究

日本学術振興会科学研究費助成事業 2017年5月 - 2021年3月

長嶋祐二, 原大介, 堀内靖雄, 酒向慎司
作曲・演奏・信号の数理モデルに基づく音楽の生成と解析の研究

日本学術振興会科学研究費助成事業 2017年4月 - 2020年3月

嵯峨山茂樹, 北原鉄朗, 齋藤康之, 堀玄, 小野順貴, 中村和幸, 堀内靖雄, 齋藤大輔, 饗庭絵里子
言語聴覚士の会話技術の分析に基づく失語症者の単語思い出し支援手法

日本学術振興会科学研究費助成事業 2016年4月 - 2019年3月

黒岩眞吾, 堀内靖雄, 村西幸代, 古川大輔
モダリティが異なる対話型自然言語としての手話と音声の韻律機能の解明

日本学術振興会科学研究費助成事業 2015年4月 - 2019年3月

堀内靖雄

もっとみる

一覧へ戻る

堀内 靖雄

基本情報

研究分野

受賞

論文

MISC

所属学協会

Works(作品等)

共同研究・競争的資金等の研究課題

堀内靖雄