荒井幸代

アライサチヨ (Sachiyo Arai)

基本情報

所属: 千葉大学大学院工学研究院教授

学位: 博士（工学）(東京工業大学)

連絡先: sachiyofaculty.chiba-u.jp
J-GLOBAL ID: 200901031363146377
researchmap会員ID: 6000002280

外部リンク: http://nexus-lab.tu.chiba-u.ac.jp/

研究キーワード

研究分野

情報通信 / 知能情報学 /

主要な委員歴

もっとみる

受賞

2024年11月

研究奨励賞深層強化学習における軌跡の分類を用いたReward Shaping ～自動運転による緊急回避制御の実現に向けて～計測自動制御学会

國枝武史, 荒井幸代
2023年11月

優秀論文賞鉄道の定刻運行を前提とした省エネルギー運転方策の獲得日本鉄道サイバネティクス協議会

鳥海良太,荒井幸代
2023年9月

奨励賞セミマルコフモデルを導入したイベント駆動型強化学習による空調制御の安定性実現情報処理学会，ソフトウェア科学会，人工知能学会

中条隼人, 荒井幸代
2023年3月

ベストプレゼンテーション賞機関リポジトリデータ移行時の修復プログラム作成とエラー分析電子情報通信学会

齋木匠, 荒井幸代
2022年12月

研究奨励賞階層型多目的強化学習による自律分散型交通信号制御計測自動制御学会

齋木匠,荒井幸代

もっとみる

論文

Coordinated Traffic-Signal Control of Wide Area Network via Hierarchical Reinforcement Learning

Takumi Saiki, Sachiyo Arai

IEEE Access 2025年
Estimation of Different Reward Functions Latent in Trajectory Data

Saito Masaharu, Arai Sachiyo

Journal of Advanced Computational Intelligence and Intelligent Informatics 28(2) 403-412 2024年3月20日

In recent years, inverse reinforcement learning has attracted attention as a method for estimating the intention of actions using the trajectories of various action-taking agents, including human flow data. In the context of reinforcement learning, “intention” refers to a reward function. Conventional inverse reinforcement learning assumes that all trajectories are generated from policies learned under a single reward function. However, it is natural to assume that people in a human flow act according to multiple policies. In this study, we introduce an expectation-maximization algorithm to inverse reinforcement learning, and propose a method to estimate different reward functions from the trajectories of human flow. The effectiveness of the proposed method was evaluated through a computer experiment based on human flow data collected from subjects around airport gates.
Estimating Objective Weights of Pareto-Optimal Policies for Multi-Objective Sequential Decision-Making

Ikenaga Akiko, Arai Sachiyo

Journal of Advanced Computational Intelligence and Intelligent Informatics 28(2) 393-402 2024年3月20日

Sequential decision-making under multiple objective functions includes the problem of exhaustively searching for a Pareto-optimal policy and the problem of selecting a policy from the resulting set of Pareto-optimal policies based on the decision maker’s preferences. This paper focuses on the latter problem. In order to select a policy that reflects the decision maker’s preferences, it is necessary to order these policies, which is problematic because the decision-maker’s preferences are generally tacit knowledge. Furthermore, it is difficult to order them quantitatively. For this reason, conventional methods have mainly been used to elicit preferences through dialogue with decision-makers and through one-to-one comparisons. In contrast, this paper proposes a method based on inverse reinforcement learning to estimate the weight of each objective from the decision-making sequence. The estimated weights can be used to quantitatively evaluate the Pareto-optimal policies from the viewpoints of the decision-makers preferences. We applied the proposed method to the multi-objective reinforcement learning benchmark problem and verified its effectiveness as an elicitation method of weights for each objective function.
Neuroevolutionary diversity policy search for multi-objective reinforcement learning.

Dan Zhou, Jiqing Du, Sachiyo Arai

Inf. Sci. 657 119932-119932 2024年2月
On-Time and Energy-Saving Optimization of Train Auto Driving Based on Reinforcement Learning.

Cheng Liu, Sachiyo Arai

SCIS/ISIS 1-6 2024年

もっとみる

MISC

121

機関リポジトリデータ移行時の修復プログラム作成とエラー分析

齋木匠, 小林裕太, 武内八重子, 荒井幸代, 檜垣泰彦

電子情報通信学会技術研究報告(Web) 123(429(LOIS2023 49-66)) 2024年
逆強化学習を用いたショウジョウバエの対捕食者行動における報酬推定とその遺伝基盤の解明

佐藤大気, 佐藤大気, 松田一流, 荒井幸代, 高橋佑磨

日本進化学会大会プログラム・講演要旨集(Web) 25th 2023年
セミマルコフモデルを導入したイベント駆動型強化学習による空調制御の安定性実現

中条隼人, 荒井幸代

電子情報通信学会技術研究報告(Web) 123(190(AI2023 1-36)) 2023年
ガンマダイバージェンスに基づく準最適な軌跡のための逆強化学習

岸川大航, 荒井幸代

人工知能学会全国大会論文集(Web) 37th 2023年
潜在的選好の多様性を導入した歩行者モデル~行動変容メカニズムの説明生成に向けて~

田澤慧樹, 荒井幸代

人工知能学会全国大会論文集(Web) 37th 2023年

もっとみる

書籍等出版物

New frontiers in artificial intelligence : JSAI-isAI 2022 Workshop, JURISIN 2022, and JSAI 2022 International Session, Kyoto, Japan, June 12-17, 2022, revised selected papers

高間, 康史, Yada, Katsutoshi, 佐藤, 健 (知能情報学), 荒井, 幸代

Springer 2023年 (ISBN: 9783031291678)
自動運転技術入門～AI×ロボティクスによる自動車の進化～

荒井幸代 (担当:分担執筆, 範囲:9章深層強化学習)

オーム社 2021年4月
これからの強化学習

荒井幸代 (担当:分担執筆, 範囲:2章、3章)

森北出版 2016年10月 (ISBN: 9784627880313)
電気工学ハンドブック第7版

荒井幸代 (担当:分担執筆, 範囲:07編制御・システム 7章4節強化学習とマルチエージェント)

オーム社 2013年9月 (ISBN: 427421382X)
電気学会技術報告機械学習技術の基礎と応用

荒井幸代 (担当:編者(編著者))

一般社団法人電気学会 2013年1月

もっとみる

講演・口頭発表等

201

Multi-Objective Inverse Reinforcement Learning via Non-Negative Matrix Factorization

Daiko Kishikawa, Sachiyo Arai

10th International Congress on Advanced Applied Informatics / 9th International Conference on Smart Computing and Artificial Intelligence 2021年7月12日
Deep Inverse Reinforcement Learning with Adversarial One-Class Classification

2021年度第35回人工知能学会全国大会 2021年6月
Verification of Autonomous Drone Control System for Gathering Information in Disaster Areas

2021年度第35回人工知能学会全国大会 2021年6月
多目的強化学習を用いた交差点における適応的信号制御

齋木匠, 荒井幸代

令和3年電気学会全国大会 2021年3月
模範的軌道を用いた逆強化学習導入によるモデルフリー制御の実現

今村麟太郎, 荒井幸代

令和3年電気学会全国大会 2021年3月

もっとみる

所属学協会

もっとみる

共同研究・競争的資金等の研究課題

人と自律システム系の多目的性に着目した逆強化学習の展開：危険ゼロと快適最大化

日本学術振興会科学研究費助成事業 2022年4月 - 2025年3月

荒井幸代, 松香敏彦, 小林宏泰, 鈴木智
省エネルギーと輸送品質とを考慮した鉄道システムの知的リアルタイム制御技術

日本学術振興会科学研究費補助金基盤研究（C) 2019年4月 - 2022年3月

宮武昌史
高齢者・障害者などの社会的弱者の技術受容と人間中心設計の臨床的調査研究

日本学術振興会科学研究費補助金基盤研究（B) 2017年4月 - 2020年3月

矢入郁子
センサ協調による廃棄物系バイオマス還元物流の適応的モーダルシフト

日本学術振興会科学研究費補助金挑戦的萌芽研究 2016年4月 - 2019年3月

荒井幸代
レジリエントな都市交通機能を実現する「認知，インフラ，制度」の相互改善型設計

日本学術振興会科学研究費補助金基盤研究（B) 2016年4月 - 2019年3月

荒井幸代
危険行動の誘因推定に基づく潜在的危険箇所の抽出：状況判断改善支援への展開

日本学術振興会科学研究費補助金基盤研究（C) 2013年4月 - 2016年3月

荒井幸代
望ましい交通流の実現に向けた情報ダイナミクスの解析と設計

日本学術振興会科学研究費補助金基盤研究（C) 2010年4月 - 2013年3月

荒井幸代
首都圏に特有の微地形境界斜面に着目したミクロレベル地震防災

日本学術振興会科学研究費補助金基盤研究（B) 2010年4月 - 2013年3月

中井正一
学術生産性と教育効果向上のための創知資本活用ネットワーク基盤Nexusの構築

日本学術振興会科学研究費補助金基盤研究（C) 2007年4月 - 2010年3月

荒井幸代
知識ベース推論による微地形・表層地盤分類を用いた災害危険度指標の構築

日本学術振興会科学研究費補助金基盤研究（B） 2006年4月 - 2008年3月

中井正一
Web情報の信頼性を保証するための利用者間のインセンティブに関する研究

日本学術振興会科学研究費補助金基盤研究（C) 2004年4月 - 2007年3月

荒井幸代
人間中心の(ヒュマンセンタード)セマンティックWeb

日本学術振興会科学研究費補助金基盤研究（A） 2003年4月 - 2006年3月

石田亨

産業財産権

特願G09G 5/00 510 G 9471-5G ビデオテックス端末装置

荒井(丹)幸代, 古郡正美

社会貢献活動

委員

その他

千葉県先端情報技術活用研究会 2006年4月 - 現在
室員

その他

千葉大学附属図書館研究開発室 2005年11月 - 現在
作業部会委員

その他

国立情報学研究所学術コンテンツ運営本部 2005年12月 - 2010年3月
技術顧問

助言・指導

iKuni Inc.(Palo Alto, California) 2000年9月 - 2002年6月
技術顧問

助言・指導

Walt Disney Image Engineering Inc.(LA, California) 2000年7月 - 2002年6月

もっとみる

一覧へ戻る

荒井 幸代

基本情報

研究キーワード

研究分野

主要な委員歴

受賞

論文

MISC

書籍等出版物

講演・口頭発表等

所属学協会

共同研究・競争的資金等の研究課題

産業財産権

社会貢献活動

荒井幸代