荒井幸代

アライサチヨ (Sachiyo Arai)

基本情報

所属: 千葉大学大学院工学研究院教授

学位: 博士（工学）(東京工業大学)

連絡先: sachiyofaculty.chiba-u.jp
J-GLOBAL ID: 200901031363146377
researchmap会員ID: 6000002280

外部リンク: http://nexus-lab.tu.chiba-u.ac.jp/

研究キーワード

研究分野

情報通信 / 知能情報学 /

主要な委員歴

もっとみる

受賞

2024年11月

研究奨励賞深層強化学習における軌跡の分類を用いたReward Shaping ～自動運転による緊急回避制御の実現に向けて～計測自動制御学会

國枝武史, 荒井幸代
2023年11月

優秀論文賞鉄道の定刻運行を前提とした省エネルギー運転方策の獲得日本鉄道サイバネティクス協議会

鳥海良太,荒井幸代
2023年9月

奨励賞セミマルコフモデルを導入したイベント駆動型強化学習による空調制御の安定性実現情報処理学会，ソフトウェア科学会，人工知能学会

中条隼人, 荒井幸代
2023年3月

ベストプレゼンテーション賞機関リポジトリデータ移行時の修復プログラム作成とエラー分析電子情報通信学会

齋木匠, 荒井幸代
2022年12月

研究奨励賞階層型多目的強化学習による自律分散型交通信号制御計測自動制御学会

齋木匠,荒井幸代
2022年11月

学生奨励賞マルチエージェント強化学習による障害物回避を伴うUAV編隊の経路計画人工知能学会

森友輝, 荒井幸代
2022年9月

優秀論文賞スパースな報酬・エキスパート不在の環境下での深層強化学習に関する一考察計測自動制御学会

荒井幸代, 渡部洋介, 入江大史, 古賀祐一
2021年7月

Competitive Paper Award Multi-Objective Inverse Reinforcement Learning via Non-Negative Matrix Factorization 10th International Congress on Advanced Applied Informatics / 9th International Conference on Smart Computing and Artificial Intelligence:

Daiko Kishikawa, Sachiyo Arai
2019年11月

優秀発表賞2 準最適な軌跡群を用いた逆強化学習における軌跡の定量的評価計測自動制御学会システム・情報部門学術講演会2019

千邑峻明, 荒井幸代
2019年11月

優秀発表賞 1 搭乗者の快適性を考慮した自動走行の実現計測自動制御学会システム・情報部門学術講演会2019 (SSI2019)

岸川大航, 荒井幸代
2019年10月

JAWS Young Researcher Award 部分観測下の多目的逐次意思決定問題における各目的の重み推定 IEEE Computer Society Japan Chapter

池永晶子, 荒井幸代
2019年10月

Best student paper award Comfortable Driving by Using Deep Inverse Reinforcement Learning. The 4th IEEE International Conference on Agents

荒井幸代
2019年9月

優秀発表賞単純なシーンの学習勾配に着目した運転方策の切替え法，～市街地の複雑なシーンでの自動運転実現に向けて～ Joint Agent Workshops and Symposium 2019

北村清也, 荒井幸代
2019年9月

最優秀発表賞 MASモデル構築のためのHeterogeneous swarm逆強化学習の検討 Joint Agent Workshops and Symposium 2019

浪越圭一, 野田五十樹, 荒井幸代
2019年9月

研究奨励賞 2 部分観測下の多目的逐次意思決定問題における各目的の重み推定 Joint Agent Workshops and Symposium 2019

池永晶子, 荒井幸代
2019年9月

研究奨励賞 1 深層強化学習による鉄道システムの回生電力活用 Joint Agent Workshops and Symposium 2019

吉田賢央, 荒井幸代
2019年9月

研究奨励賞複数環境におけるエキスパート軌跡を用いたミニバッチベイジアン逆強化学習 Joint Agent Workshops and Symposium 2019

中田勇介, 荒井幸代
2017年9月

ポスター発表優秀賞熟練ドライバの運転を学習するための報酬と特徴ベクトルの同時推定法 Joint Agent Workshops and Symposium 2017

石川翔太, 荒井幸代
2017年9月

優秀ポスター発表優秀賞追従エージェントを考慮した人流データからの戦略抽出 Joint Agent Workshops and Symposium 2017

浪越圭一, 荒井幸代
2017年9月

優秀論文賞期待報酬ベクトルの非線形スカラー化による多目的強化学習アルゴリズム Joint Agent Workshops and Symposium 2017

齋竹良介, 竹木祥太, 荒井幸代
2016年9月

優秀発表賞深層学習の中間層の解析に基づくアフォーダンスの設計に有用な特徴の抽出 Joint Agent Workshops and Symposium 2016

中田勇介, 荒井幸代
2015年10月

ポスター発表優秀賞歩行軌跡に基づく歩行者の行動規範の同定 Joint Agent Workshops and Symposium 2015

浪越圭一, 荒井幸代
2012年10月

企業賞強化学習エージェントによる地域間電力融通 Joint Agent Workshops and Symposium 2012

荒井幸代
2012年6月

優秀賞逆強化学習による複数均衡下での均衡収束の実現第26回人工知能学会全国大会

荒井幸代
2011年10月

ベストプレゼンテーション賞報酬設計による空間ゲームの社会規範創出 Joint Agent Workshops and Symposium 2011

荒井幸代
2009年9月

ベストティーチャー賞千葉大学大学院工学研究科都市環境システムコース

荒井幸代
2008年11月

ベストティーチャー賞千葉大学

荒井幸代
2008年10月

優秀論文賞渋滞発生過程におけるメタ安定相の特徴の分析 Joint Agent Workshops and Symposium 2008

荒井幸代
2007年10月

優秀論文賞情報量によるマルチエージェント系強化学習過程 Joint Agent Workshops and Symposium 2007

荒井幸代
2007年9月

ベストティーチャー賞千葉大学大学院工学研究科都市環境システムコース

荒井幸代

論文

Coordinated Traffic-Signal Control of Wide Area Network via Hierarchical Reinforcement Learning

Takumi Saiki, Sachiyo Arai

IEEE Access 2025年
Estimation of Different Reward Functions Latent in Trajectory Data

Saito Masaharu, Arai Sachiyo

Journal of Advanced Computational Intelligence and Intelligent Informatics 28(2) 403-412 2024年3月20日

In recent years, inverse reinforcement learning has attracted attention as a method for estimating the intention of actions using the trajectories of various action-taking agents, including human flow data. In the context of reinforcement learning, “intention” refers to a reward function. Conventional inverse reinforcement learning assumes that all trajectories are generated from policies learned under a single reward function. However, it is natural to assume that people in a human flow act according to multiple policies. In this study, we introduce an expectation-maximization algorithm to inverse reinforcement learning, and propose a method to estimate different reward functions from the trajectories of human flow. The effectiveness of the proposed method was evaluated through a computer experiment based on human flow data collected from subjects around airport gates.
Estimating Objective Weights of Pareto-Optimal Policies for Multi-Objective Sequential Decision-Making

Ikenaga Akiko, Arai Sachiyo

Journal of Advanced Computational Intelligence and Intelligent Informatics 28(2) 393-402 2024年3月20日

Sequential decision-making under multiple objective functions includes the problem of exhaustively searching for a Pareto-optimal policy and the problem of selecting a policy from the resulting set of Pareto-optimal policies based on the decision maker’s preferences. This paper focuses on the latter problem. In order to select a policy that reflects the decision maker’s preferences, it is necessary to order these policies, which is problematic because the decision-maker’s preferences are generally tacit knowledge. Furthermore, it is difficult to order them quantitatively. For this reason, conventional methods have mainly been used to elicit preferences through dialogue with decision-makers and through one-to-one comparisons. In contrast, this paper proposes a method based on inverse reinforcement learning to estimate the weight of each objective from the decision-making sequence. The estimated weights can be used to quantitatively evaluate the Pareto-optimal policies from the viewpoints of the decision-makers preferences. We applied the proposed method to the multi-objective reinforcement learning benchmark problem and verified its effectiveness as an elicitation method of weights for each objective function.
Neuroevolutionary diversity policy search for multi-objective reinforcement learning.

Dan Zhou, Jiqing Du, Sachiyo Arai

Inf. Sci. 657 119932-119932 2024年2月
On-Time and Energy-Saving Optimization of Train Auto Driving Based on Reinforcement Learning.

Cheng Liu, Sachiyo Arai

SCIS/ISIS 1-6 2024年

もっとみる

MISC

121

機関リポジトリデータ移行時の修復プログラム作成とエラー分析

齋木匠, 小林裕太, 武内八重子, 荒井幸代, 檜垣泰彦

電子情報通信学会技術研究報告(Web) 123(429(LOIS2023 49-66)) 2024年
逆強化学習を用いたショウジョウバエの対捕食者行動における報酬推定とその遺伝基盤の解明

佐藤大気, 佐藤大気, 松田一流, 荒井幸代, 高橋佑磨

日本進化学会大会プログラム・講演要旨集(Web) 25th 2023年
セミマルコフモデルを導入したイベント駆動型強化学習による空調制御の安定性実現

中条隼人, 荒井幸代

電子情報通信学会技術研究報告(Web) 123(190(AI2023 1-36)) 2023年
ガンマダイバージェンスに基づく準最適な軌跡のための逆強化学習

岸川大航, 荒井幸代

人工知能学会全国大会論文集(Web) 37th 2023年
潜在的選好の多様性を導入した歩行者モデル~行動変容メカニズムの説明生成に向けて~

田澤慧樹, 荒井幸代

人工知能学会全国大会論文集(Web) 37th 2023年

もっとみる

書籍等出版物

New frontiers in artificial intelligence : JSAI-isAI 2022 Workshop, JURISIN 2022, and JSAI 2022 International Session, Kyoto, Japan, June 12-17, 2022, revised selected papers

高間, 康史, Yada, Katsutoshi, 佐藤, 健 (知能情報学), 荒井, 幸代

Springer 2023年 (ISBN: 9783031291678)
自動運転技術入門～AI×ロボティクスによる自動車の進化～

荒井幸代 (担当:分担執筆, 範囲:9章深層強化学習)

オーム社 2021年4月
これからの強化学習

荒井幸代 (担当:分担執筆, 範囲:2章、3章)

森北出版 2016年10月 (ISBN: 9784627880313)
電気工学ハンドブック第7版

荒井幸代 (担当:分担執筆, 範囲:07編制御・システム 7章4節強化学習とマルチエージェント)

オーム社 2013年9月 (ISBN: 427421382X)
電気学会技術報告機械学習技術の基礎と応用

荒井幸代 (担当:編者(編著者))

一般社団法人電気学会 2013年1月

もっとみる

講演・口頭発表等

201

Multi-Objective Inverse Reinforcement Learning via Non-Negative Matrix Factorization

Daiko Kishikawa, Sachiyo Arai

10th International Congress on Advanced Applied Informatics / 9th International Conference on Smart Computing and Artificial Intelligence 2021年7月12日
Deep Inverse Reinforcement Learning with Adversarial One-Class Classification

2021年度第35回人工知能学会全国大会 2021年6月
Verification of Autonomous Drone Control System for Gathering Information in Disaster Areas

2021年度第35回人工知能学会全国大会 2021年6月
多目的強化学習を用いた交差点における適応的信号制御

齋木匠, 荒井幸代

令和3年電気学会全国大会 2021年3月
模範的軌道を用いた逆強化学習導入によるモデルフリー制御の実現

今村麟太郎, 荒井幸代

令和3年電気学会全国大会 2021年3月

もっとみる

所属学協会

もっとみる

共同研究・競争的資金等の研究課題

人と自律システム系の多目的性に着目した逆強化学習の展開：危険ゼロと快適最大化

日本学術振興会科学研究費助成事業 2022年4月 - 2025年3月

荒井幸代, 松香敏彦, 小林宏泰, 鈴木智
省エネルギーと輸送品質とを考慮した鉄道システムの知的リアルタイム制御技術

日本学術振興会科学研究費補助金基盤研究（C) 2019年4月 - 2022年3月

宮武昌史
高齢者・障害者などの社会的弱者の技術受容と人間中心設計の臨床的調査研究

日本学術振興会科学研究費補助金基盤研究（B) 2017年4月 - 2020年3月

矢入郁子
センサ協調による廃棄物系バイオマス還元物流の適応的モーダルシフト

日本学術振興会科学研究費補助金挑戦的萌芽研究 2016年4月 - 2019年3月

荒井幸代
レジリエントな都市交通機能を実現する「認知，インフラ，制度」の相互改善型設計

日本学術振興会科学研究費補助金基盤研究（B) 2016年4月 - 2019年3月

荒井幸代

もっとみる

産業財産権

特願G09G 5/00 510 G 9471-5G ビデオテックス端末装置

荒井(丹)幸代, 古郡正美

社会貢献活動

委員

その他

千葉県先端情報技術活用研究会 2006年4月 - 現在
室員

その他

千葉大学附属図書館研究開発室 2005年11月 - 現在
作業部会委員

その他

国立情報学研究所学術コンテンツ運営本部 2005年12月 - 2010年3月
技術顧問

助言・指導

iKuni Inc.(Palo Alto, California) 2000年9月 - 2002年6月
技術顧問

助言・指導

Walt Disney Image Engineering Inc.(LA, California) 2000年7月 - 2002年6月

もっとみる

一覧へ戻る

荒井 幸代

基本情報

研究キーワード

研究分野

主要な委員歴

受賞

論文

MISC

書籍等出版物

講演・口頭発表等

所属学協会

共同研究・競争的資金等の研究課題

産業財産権

社会貢献活動

荒井幸代