荒井幸代

アライサチヨ (Sachiyo Arai)

基本情報

所属: 千葉大学大学院工学研究院教授

学位: 博士（工学）(東京工業大学)

連絡先: sachiyofaculty.chiba-u.jp
J-GLOBAL ID: 200901031363146377
researchmap会員ID: 6000002280

外部リンク: http://nexus-lab.tu.chiba-u.ac.jp/

研究キーワード

研究分野

情報通信 / 知能情報学 /

主要な委員歴

もっとみる

受賞

2024年11月

研究奨励賞深層強化学習における軌跡の分類を用いたReward Shaping ～自動運転による緊急回避制御の実現に向けて～計測自動制御学会

國枝武史, 荒井幸代
2023年11月

優秀論文賞鉄道の定刻運行を前提とした省エネルギー運転方策の獲得日本鉄道サイバネティクス協議会

鳥海良太,荒井幸代
2023年9月

奨励賞セミマルコフモデルを導入したイベント駆動型強化学習による空調制御の安定性実現情報処理学会，ソフトウェア科学会，人工知能学会

中条隼人, 荒井幸代
2023年3月

ベストプレゼンテーション賞機関リポジトリデータ移行時の修復プログラム作成とエラー分析電子情報通信学会

齋木匠, 荒井幸代
2022年12月

研究奨励賞階層型多目的強化学習による自律分散型交通信号制御計測自動制御学会

齋木匠,荒井幸代

もっとみる

論文

Coordinated Traffic-Signal Control of Wide Area Network via Hierarchical Reinforcement Learning

Takumi Saiki, Sachiyo Arai

IEEE Access 2025年
Estimation of Different Reward Functions Latent in Trajectory Data

Saito Masaharu, Arai Sachiyo

Journal of Advanced Computational Intelligence and Intelligent Informatics 28(2) 403-412 2024年3月20日

In recent years, inverse reinforcement learning has attracted attention as a method for estimating the intention of actions using the trajectories of various action-taking agents, including human flow data. In the context of reinforcement learning, “intention” refers to a reward function. Conventional inverse reinforcement learning assumes that all trajectories are generated from policies learned under a single reward function. However, it is natural to assume that people in a human flow act according to multiple policies. In this study, we introduce an expectation-maximization algorithm to inverse reinforcement learning, and propose a method to estimate different reward functions from the trajectories of human flow. The effectiveness of the proposed method was evaluated through a computer experiment based on human flow data collected from subjects around airport gates.
Estimating Objective Weights of Pareto-Optimal Policies for Multi-Objective Sequential Decision-Making

Ikenaga Akiko, Arai Sachiyo

Journal of Advanced Computational Intelligence and Intelligent Informatics 28(2) 393-402 2024年3月20日

Sequential decision-making under multiple objective functions includes the problem of exhaustively searching for a Pareto-optimal policy and the problem of selecting a policy from the resulting set of Pareto-optimal policies based on the decision maker’s preferences. This paper focuses on the latter problem. In order to select a policy that reflects the decision maker’s preferences, it is necessary to order these policies, which is problematic because the decision-maker’s preferences are generally tacit knowledge. Furthermore, it is difficult to order them quantitatively. For this reason, conventional methods have mainly been used to elicit preferences through dialogue with decision-makers and through one-to-one comparisons. In contrast, this paper proposes a method based on inverse reinforcement learning to estimate the weight of each objective from the decision-making sequence. The estimated weights can be used to quantitatively evaluate the Pareto-optimal policies from the viewpoints of the decision-makers preferences. We applied the proposed method to the multi-objective reinforcement learning benchmark problem and verified its effectiveness as an elicitation method of weights for each objective function.
Neuroevolutionary diversity policy search for multi-objective reinforcement learning.

Dan Zhou, Jiqing Du, Sachiyo Arai

Inf. Sci. 657 119932-119932 2024年2月
On-Time and Energy-Saving Optimization of Train Auto Driving Based on Reinforcement Learning.

Cheng Liu, Sachiyo Arai

SCIS/ISIS 1-6 2024年

もっとみる

MISC

121

災害時鉄道利用客の経路統制による駅の滞留緩和に向けた考察

樋野葉子, 荒井幸代

人工知能学会全国大会論文集 28 1-4 2014年
太陽光発電の変動抑制に向けた強化学習エージェントモデル (ソフトウェアエージェントとその応用論文特集)

加賀谷駿, 荒井幸代

電子情報通信学会論文誌. D, 情報・システム = The IEICE transactions on information and systems 96(12) 3000-3008 2013年12月

新たなエネルギー供給と消費の形態として注目されているマイクログリッドは,既存の電力系統からの送電に極力依存せずに,エネルギー供給源と消費施設をもつ小規模なエネルギーネットワークである.供給が不安定な自然エネルギー活用を前提とするため,グリッド内での電圧均衡を実現し,逆潮流による大停電を回避する必要がある.そこで本研究では,マイクログリッドを,複数のエージェントからなるネットワークとしてモデル化し,需給の不安定をエージェント間での電力融通によって吸収することを考える.具体的には,太陽光発電をもつ需要家エージェントの強化学習モデルを提案する.エージェントは,太陽光発電・定常発電・電力需要・需要制御・蓄電の五つのモジュールから構成され,不安定要素である太陽光発電と電力需要の二つを需要制御と蓄電によって安定化する.需要制御に用いる強化学習には,連続値をとる電力制御問題に対応するために,Actor-Criticとタイルコーティングを導入する.実験では,5体のエージェントからなる完全グラフにおいて需給の安定化が学習できることをシミュレーションで示し,電力融通における提案モデルの有効性を示す.
行動干渉状態の検出によるマルチエージェント強化学習法の改善 (システム研究会インテリジェント・システム・シンポジウム(FANシンポジウム))

許海遅, 荒井幸代

電気学会研究会資料. ST 2013(29) 115-120 2013年9月25日
需要家エージェント間の電力融通と需給安定化の実現 (人工知能と知識処理)

加賀谷駿, 荒井幸代

電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 113(113) 13-18 2013年7月4日

太陽光の利用において,不安定な電力供給と需要のバランスをどう維持するかという課題がある.太陽光パネルの導入が進めば,過剰な発電による逆潮流が発生し,大停電を引き起こす危険もある.そこで,本研究では,安定的な需給バランスの実現を目的として,需要側の制御方策を強化学習によって獲得する方法を提案する.具体的には,太陽光発電を持つ需要家の電力利用を管理するエージェントのネットワークを想定する.各エージェントは自律分散的に,電力を融通し合いながら,需給の差を最小化する需要制御方策を強化学習によって獲得する.また,過去の太陽光発電量データを用いた計算機実験によって本モデルの有効性を検証する.
リレー解説強化学習の最近の発展《第6回》逆強化学習によるマルチエージェント系の報酬設定

荒井幸代

計測と制御 52(6) 534-539 2013年6月査読有り招待有り
協調系実現に向けた監視対象の自律的決定モデル

野村一平, 荒井幸代

人工知能学会全国大会論文集 27 1-4 2013年
道路ネットワークの変化に対する経路選択の学習

内田英明, 藤井秀樹, 吉村忍, 荒井幸代

情報処理学会論文誌 53(11) 2409-2418 2012年11月15日

現在，多くの交通流シミュレーションでは定常状態の交通流を評価するモデルが採用されており，過渡的な現象を再現することは困難である．しかし，道路ネットワークの変更をともなう交通施策の評価においてこの過渡的な現象を無視することはできない．そこで本研究では，道路ネットワークが変化する状況下でどちらの現象も扱うことのできる経路選択モデルを新たに提案し，マルチエージェント型交通流シミュレータに実装する．はじめに，基本として用いた強化学習の枠組みに基づくQ-routingアルゴリズムの説明を行い，交通流シミュレーションに適用するため加えたいくつかの改良について説明する．不規則な格子状のネットワークにおいて，渋滞や信号制御に対する本経路選択モデルの基本的な振舞いを検証し，最後に岡山市内の現実のLRT延伸計画を対象としてシミュレーションを行い，延伸実行後に小規模な渋滞が発生する可能性を示す．また，この渋滞現象は運転者の保持する延伸前の走行経験がバイアスとして影響した結果，一時的に生じるもので，十分な時間経過の後定常状態に至ることを示す．This paper describes the impact of changes in road network on driver's behavior. In agent-based traffic simulations, agents typically choose the shortest route, while drivers in a real world choose their routes through their own past experience. Since traffic simulations are used extensively in the evaluation and verification of traffic policy, accurate simulation reproducing the routing behavior of the real world is strongly demanded. Thus, we newly develop a reinforcement learning based routing algorithm, and implement it in Traffic Simulator. Then we explains some improvements in the Q-routing for traffic simulation. Firstly, we perform a preliminary experiment using an irregular grid network with various loads or signal control, and get good performance, i.e. robustness of the improved Q-routing in the cases of heavy traffic with signal control. Second, the simulator with/without the improved Q-routing is applied to simulate the LRT expansion project in Okayama city, and we evaluate transient traffic behaviors after the implementation of the plan. We observe transient congestion phenomenon only in the simulation with the improved Q-routing. Though the detailed analysis of the results, we find that such transient traffic congestions are caused due to the bias of the drivers' past experience.
協調フィルタリングに基づく改良版授業推薦システムの構築と評価

蒋再興, 檜垣泰彦, 荒井幸代

研究報告グループウェアとネットワークサービス（GN） 2012(12) 1-6 2012年5月10日

協調フィルタリングを用いたユーザベースの授業推薦システムのプロトタイプを作成し評価を行なったところ，網羅性，精確性の改善，ユーザインタフェースの改善,時間割による制限への対応が必要であることがわかった．それに対応するため，アイテムベースの推薦を追加し類似度の計算尺度の改善を行なった.また，自分と類似した先輩の履修授業を推薦する先輩推薦と，時間割を考慮した協調推薦の機能を設けた．さらにアイテムベースの相関推薦も併用することで学生の主体性の維持を図った．推薦画面を時間割ベースに変更した．この評価の結果,この改良版プロトタイプは，網羅性と精確性を備えていること，高い主体性の維持が確認できた．The Lecture Recommendation System based on User-Based Collaborative Filtering had been completed. The improvement of comprehensiveness, precision and user interface would be needed. In order to improve the system, Item-basedCollaborative Filtering and Cosine Similarity were introduced. The Senior Recommendation could recommend lectures the seniors who are similar to the user registered. The Collaboration Recommendation based on User-Based Collaborative Filter ing and the Item Recommendation based on Item-Based Item-to-item Collaborative Filtering had been developed to improve the initiative of students. In evaluation of the improved prototype with the form of a timetable, it is confirmed that the system performance was increased.
協調フィルタリングに基づく改良版授業推薦システムの構築と評価

蒋再興, 檜垣泰彦, 荒井幸代

研究報告情報セキュリティ心理学とトラスト（SPT） 2012(12) 1-6 2012年5月10日
協調フィルタリングに基づく改良版授業推薦システムの構築と評価

蒋再興, 檜垣泰彦, 荒井幸代

電子情報通信学会技術研究報告. LOIS, ライフインテリジェンスとオフィス情報システム = IEICE technical report. LOIS, Life intelligence and office information systems 112(35) 67-72 2012年5月10日

協調フィルタリングを用いたユーザベースの授業推薦システムのプロトタイプを作成し評価を行なったところ,網羅性,精確性の改善,ユーザインタフェースの改善,時間割による制限への対応が必要であることがわかった.それに対応するため,アイテムベースの推薦を追加し類似度の計算尺度の改善を行なった.また,自分と類似した先輩の履修授業を推薦する先輩推薦と,時間割を考慮した協調推薦の機能を設けた.さらにアイテムベースの相関推薦も併用することで学生の主体性の維持を図った.推薦画面を時間割ベースに変更した.この評価の結果,この改良版プロトタイプは,網羅性と精確性を備えていること,高い主体性の維持が確認できた.
集団のジレンマを解消する相互監視ネットワーク形成アルゴリズム

荒井幸代

計測自動制御学会システム・情報部門,知能システムシンポジウム講演論文集第39回 1-6 2012年
逆強化学習による複数均衡下での均衡収束の実現 (人工知能学会全国大会(第26回)文化,科学技術と未来) -- (エージェント)

荒井幸代, 鈴木香名子, 大喜多周

人工知能学会全国大会論文集 26 1-4 2012年
ランダムタイリングの逐次改善による状態空間の分割

荒井幸代

計測自動制御学会システム・情報部門学術講演会2011講演論文集 SSI2011 111-116 2011年
ラインナップ最適化に向けた打番の要求機能の定量化法

角井勇哉, 荒井幸代

日本オペレーションズ・リサーチ学会和文論文誌 54 84-108 2011年

野球が契約料や広告料など巨額の金銭を動かすビジネスになっている.この背景では,期待得点値を最大にするラインナップを構成することは野球チームにとって大きな課題である.期待得点値は,野球の攻撃をマルコフ連鎖として捉えた期待得点値算出モデルにより計算可能である.しかし,n名の選手集合から構成され得る全通りのラインナップの期待得点値を計算するためには,O(n^9)の計算量が必要である.そこで,本稿では打番の要求機能の定量化法,および得られた要求機能を用いて,最適ラインナップ構成問題をマッチング問題に定式化する方法を提案する.また,提案法を,1.打番の要求機能の定量化法の妥当性,2.ラインナップ構成法としての評価の2段階から既存手法との比較により議論する.
交通流制御のための情報提供戦略の学習

内田英明, 荒井幸代

全国大会講演論文集 72 243-244 2010年3月8日
社会的ジレンマ解消に向けたステークホルダの特定と利得操作法

今宿誠己, 荒井幸代

全国大会講演論文集 72 557-558 2010年3月8日
報酬設計による空間ゲームの社会規範創出

荒井幸代

Joint Agent Workshops and Symposium 2010 Vol.9(USB版) 8 2010年査読有り
情報提供戦略のQ学習による交通ネットワーク流の制御

内田英明, 荒井幸代

人工知能学会全国大会論文集 24 1-4 2010年
前方情報を考慮した走行ルールによるメタ安定相の発生と特徴の解析

増渕達也, 荒井幸代

電子情報通信学会論文誌. D, 情報・システム = The IEICE transactions on information and systems (Japanese edition) 92(11) 1935-1944 2009年11月1日

高速道路の交通流には渋滞に至る過程で「本来ならば渋滞となる交通密度に至っても,交通流の流量が増加し続けるメタ安定状態」が出現する段階(メタ安定相)を経ることが知られている.メタ安定相の発生メカニズムが解明できれば,これを利用することによって自然渋滞発生の軽減も期待できる.本論文では,自然渋滞の原因が運転者前方の道路情報の不確実性に起因することに着目し,交通流のセルオートマトンモデルとして知られるNagel-Schreckenbergモデルの運転者ルールを拡張したモデルを提案する.具体的には,「先行車の前方に関する情報」として,先行車との距離,及び,先行車の前方の空き具合を運転者に与えるだけの単純な拡張によってメタ安定相が発生することをシミュレーションによって確めた.また,従来の交通流の実験で用いられていた道路長やシミュレーション時間が,メタ安定状態の発生パターンや渋滞発生過程に及ぼす影響に着目して考察する.
大学機関の特性を活かす知識共創システムの提案

西島寛, 荒井幸代, 檜垣泰彦, 土屋俊

情報処理学会論文誌 50(2) 615-625 2009年2月15日

大学組織において，教職員は質の高い教育の提供という共通の目的を持つにもかかわらず，互いが提供できる情報を共有する場は限られている．また，学生が潜在的に必要とする情報を的確にとらえることは難しい．本論文は，大学組織を構成する教員·附属図書館員，学生が，必要な情報や提供できる情報を相互に補完し合うためのシステムを提案する．提案システムは，パスファインダとフォークソノミの2つの部分から構成され，前者は，図書館員と教員が作成し，学生に提示する情報提供部分，後者は，講義シラバスに基づいて学生が創り出し，教職員に向けた情報発信部分である．本提案システムの特徴は，教職員や学生がシステムを利用することにより，データベースが利用目的別に構造化され，さらに必要なデータが付加されてデータベースが豊富になっていく仕組みを実現している点にある．本論文では，フォークソノミを「学生側のニーズ抽出機能」として今後の大学組織にとって重要な部分であるという認識から，この実現方法を中心に説明する．検証実験では，Social TagとSocial Linkを学生に付加してもらい，抽出された学生の潜在的ニーズは，パスファインダに掲載する内容を選定するうえでの貴重な情報源となることが示された．また，千葉大学で実用を開始しているパスファインダとフォークソノミとの統合によって教職員と学生間に情報が流通し，効果的に利用し合うことが可能になることを示す．On university campus, teaching faculty and staff share only a limited amount of the information they can mutually provide although they share the common goal of quality education for students. Neither can they appropriately perceive the kind of information students potentially need. The present paper proposes and verifies a system in which all constituents of university including faculty, librarians and students can share and supplement each other for information necessitate for their life and potentially sharable by them. The proposed system consists of the “Pathfinder” and “Falksonomy” components, with the former targeting at the student body, designed collaboratively by teaching faculty and librarians, and the latter addressing the faculty and staff on behalf of students. The currently proposed system is characterized by the implemented mechanism in which the use by the faculty and students helps structurize in terms of their needs the database, which will eventually enrich the database with sorting by purpose and addition of data. The description in the paper concentrate on the implementation of the “Folksonomy” component as the functionality of extracting needs from students in the belief that it will be more important in future. In the testing experiment, students are asked to tag the data in terms of their preferences. The resulting analysis shows that the revealed data structure could be important input for the selecting of items in the pathfinders. It will be also demonstrated that the integration of the creation of pathfinders augmented by folksonomy will potentially result in better communication in teachers and doctors/nurses communicate more effectively.
大学機関の特性を活かす知識共創システムの提案 (特集組織における情報システム開発)

西島寛, 荒井幸代, 檜垣泰彦

情報処理学会論文誌論文誌ジャーナル 50(2) 615-625 2009年2月
大学機関の特性を活かす知識共創システムの提案 (共著)

情報処理学会論文誌 50(2) 615-625 2009年
前方情報を考慮した走行ルールによるメタ安定相の発生と特徴の解析 (共著)

電子情報通信学会論文誌D J92-D(11) 1935-1944 2009年
Information Theoretic Approach for Measuring Interaction in Multiagent Domain. (共著)

Journal of Advanced Computational Intelligence and Intelligent Informatics 13(6) 649-657 2009年
エレベータ群制御における呼び対応の公平化を目的とした優先度学習法

荒井幸代

Joint Agent Workshops and Symposium JAWS2009 2009年査読有り
A Preliminary Analysis of Interactive Effects between Network Structure and Decision Criteria on the Global Behavior

荒井幸代

ICROS-SICE International Joint Conference 2009, USB 6 2009年査読有り
交通信号機の協調制御に向けた情報共有効果の評価

加本昌也, 荒井幸代

論文集 23 1-4 2009年
複車線交通流に与える運転選択速度と後方視野の影響

内田英明, 荒井幸代

論文集 23 1-4 2009年
最適ラインナップ構成に向けての打順の要求機能分析法

角井勇哉, 荒井幸代

論文集 23 1-4 2009年
エレベータ群制御における呼び優先度決定法

大久保有基, 荒井幸代

論文集 23 1-4 2009年
任意ノードの視点からのコミュニティ抽出法

高橋篤, 荒井幸代

論文集 23 1-4 2009年
交差点の滞留車両減少のための信号制御シミュレーション

福島信太郎, 荒井幸代

日本シミュレーション学会大会発表論文集 : シミュレーション・テクノロジー・コンファレンス・計算電気・電子工学シンポジウム 27 477-480 2008年6月19日
フォークソノミーを用いた講義選択知識の抽出

西島寛, 荒井幸代, 檜垣泰彦, 土屋俊

電子情報通信学会技術研究報告. OIS, オフィスインフォメーションシステム 108(53) 79-84 2008年5月23日

大学における履修講義の選択は,シラバスによって提供される講義内容や必修・選択の区分,開講時期に加えて,学生間の口コミなどによる情報に基づいて決定するのが一般的である.本稿では,学生によるボトムアップな情報を統合することによって,学生間における講義選択知識の抽出をおこなう.具体的には,学生自身が付与する「講義を抽象するソーシャルタグ」と「関連の深い講義間のソーシャルリンク」を用いて講義に関するフォークソノミーを割出し,ソーシャルリンクとソーシャルタグの共起によるネットワークを生成した.これを用いてPageRankによる中心的な講義の抽出とSDC法によるクラスターの抽出をおこない,講義選択知識から学生の観点からなされる分類と履修が推奨される講義の抽出をおこなった.
フォークソノミーを用いた講義選択知識の抽出

西島寛, 荒井幸代, 檜垣泰彦, 土屋俊

情報処理学会研究報告. GN, [グループウェアとネットワークサービス] 68(48) 79-84 2008年5月16日

大学における履修講義の選択は,シラバスによって提供される講義内容や必修・選択の区分,開講時期に加えて,学生間の口コミなどによる情報に基づいて決定するのが一般的である.本稿では,学生によるボトムアップな情報を統合することによって,学生間における講義選択知識の抽出をおこなう.具体的には,学生自身が付与する「講義を抽象するソーシャルタグ」と「関連の深い講義間のソーシャルリンク」を用いて講義に関するフォークソノミーを創出し,ソーシャルリンクとソーシャルタグの共起によるネットワークを生成した.これを用いてPageRankによる中心的な講義の抽出とSDC法によるクラスターの抽出をおこない,講義選択知識から学生の観点からなされる分類と履修が推奨される講義の抽出をおこなった.
Measurement of Underlying Cooperation in Multiagent Reinforcement Learning. (共著)

荒井幸代

Lecture Notes in Computer Science 5357 34-41 2008年
LO-003 グラフ構造に基づくコミュニティ抽出手法(情報システム)

高橋篤, 荒井幸代, 須貝康雄

情報科学技術レターズ 6(6) 477-480 2007年8月22日
最小カットを用いたネットワーククラスタリング手法の考察

高橋篤, 荒井幸代, 須貝康雄

自律分散システム・シンポジウム資料 = SICE Symposium on Decentralized Autonomous Systems 19 295-300 2007年1月29日
ネットワーク生成モデルを用いたコミュニティ抽出法の考察

高澤知也, 荒井幸代, 須貝康雄

自律分散システム・シンポジウム資料 = SICE Symposium on Decentralized Autonomous Systems 19 301-306 2007年1月29日
グラフ構造に基づくコミュニティ抽出手法 (共著)

荒井幸代

情報処理学会/電子情報通信学会情報科学技術レターズ 6 477-483 2007年
マルチエージェントシステムのための階層型モジュール強化学習

田中信行, 荒井幸代, 小圷成一, 平田廣則

電気学会研究会資料. IIC, 産業計測制御研究会 2006(91) 101-106 2006年3月14日
マルチエージェント連続タスクにおける報酬設計の実験的考察－RoboCup Soccer Keepawayタスクを例として－ (共著)

荒井幸代, 田中信行

人工知能学会誌学会論文誌 21(6) 537-546 2006年

In this paper, we discuss guidelines for a reward design problem that defines when and what amount of reward should be given to the agent/s, within the context of reinforcement learning approach. We would like to take keepaway soccer as a standard task of the multiagent domain which requires skilled teamwork. The difficulties of designing reward for this task are due to its features as follows: i) since it belongs to the continuing task which has no explicit goal to achieve, it is hard to tell when reward should be given to the agent/s. ii) since it is a multiagent cooperative task, it is hard to decide what is a fair share of reward for each agent's contribution to achieve the goal. Through some experiments, we show that the reward design have a major effect on the agent's behavior, and introduce the successful reward function that makes agents perform keepaway better and more interesting than the conventional one does. Finally, we explore the relationship between `reward design' and `acquired behaviors' from the viewpoint of teamwork.
Teamwork Formation for Keepaway in Robotics Soccer (Reinforcement Learning Approach). (共著)

Lecture Notes in Computer Science 4088 279-292 2006年
Reward Design for Emerging Cooperative Behavior in Continuing Task Domains

荒井幸代

International Joint Conference on Autonomous Agents and Multiagent Systems Workshops on Adaptation and Learning in Autonomous Agents and Multiagent Systems (H.18.5発表予定) 2006年
エージェント間の相互作用 : 望ましい関係性の創発

生天目章, 荒井幸代, 服部聖彦

計測と制御 = Journal of the Society of Instrument and Control Engineers 44(12) 865-874 2005年12月10日
エージェント間の相互作用：望ましい関係性の創発 (共著)

計測自動制御学会 44(12) 865-874 2005年
Guiding Inference Through Relational Reinforcement Learning. (共著)

Lecture Notes in Computer Science 3625 20-37 2005年
環境モデルを利用した弱-強化学習エージェント

荒井幸代

人工知能学会,ソフトウェア科学学会合同エージェントワークショップ&シンポジウム論文集 371-378 2004年
Experoence-based Reinforcement Learning to Acquire Effective Behavior in a Multi-agent Domain. (共著)

Lecture Notes in Artificial Intelligence 2891 98-109 2003年
Semantic Web Service Architecture using Multi-agent Scenario Description. (共著)

Lecture Notes in Artificial Intelligence 2891 98-109 2003年
セマンティクス導入によるユーザ指向Webサービス連携支援

杉本悠樹, 村上陽平, 荒井幸代

人工知能学会全国大会論文集 17 1-4 2003年

書籍等出版物

New frontiers in artificial intelligence : JSAI-isAI 2022 Workshop, JURISIN 2022, and JSAI 2022 International Session, Kyoto, Japan, June 12-17, 2022, revised selected papers

高間, 康史, Yada, Katsutoshi, 佐藤, 健 (知能情報学), 荒井, 幸代

Springer 2023年 (ISBN: 9783031291678)
自動運転技術入門～AI×ロボティクスによる自動車の進化～

荒井幸代 (担当:分担執筆, 範囲:9章深層強化学習)

オーム社 2021年4月
これからの強化学習

荒井幸代 (担当:分担執筆, 範囲:2章、3章)

森北出版 2016年10月 (ISBN: 9784627880313)
電気工学ハンドブック第7版

荒井幸代 (担当:分担執筆, 範囲:07編制御・システム 7章4節強化学習とマルチエージェント)

オーム社 2013年9月 (ISBN: 427421382X)
電気学会技術報告機械学習技術の基礎と応用

荒井幸代 (担当:編者(編著者))

一般社団法人電気学会 2013年1月

もっとみる

講演・口頭発表等

201

Multi-Objective Inverse Reinforcement Learning via Non-Negative Matrix Factorization

Daiko Kishikawa, Sachiyo Arai

10th International Congress on Advanced Applied Informatics / 9th International Conference on Smart Computing and Artificial Intelligence 2021年7月12日
Deep Inverse Reinforcement Learning with Adversarial One-Class Classification

2021年度第35回人工知能学会全国大会 2021年6月
Verification of Autonomous Drone Control System for Gathering Information in Disaster Areas

2021年度第35回人工知能学会全国大会 2021年6月
多目的強化学習を用いた交差点における適応的信号制御

齋木匠, 荒井幸代

令和3年電気学会全国大会 2021年3月
模範的軌道を用いた逆強化学習導入によるモデルフリー制御の実現

今村麟太郎, 荒井幸代

令和3年電気学会全国大会 2021年3月

もっとみる

所属学協会

もっとみる

共同研究・競争的資金等の研究課題

人と自律システム系の多目的性に着目した逆強化学習の展開：危険ゼロと快適最大化

日本学術振興会科学研究費助成事業 2022年4月 - 2025年3月

荒井幸代, 松香敏彦, 小林宏泰, 鈴木智
省エネルギーと輸送品質とを考慮した鉄道システムの知的リアルタイム制御技術

日本学術振興会科学研究費補助金基盤研究（C) 2019年4月 - 2022年3月

宮武昌史
高齢者・障害者などの社会的弱者の技術受容と人間中心設計の臨床的調査研究

日本学術振興会科学研究費補助金基盤研究（B) 2017年4月 - 2020年3月

矢入郁子
センサ協調による廃棄物系バイオマス還元物流の適応的モーダルシフト

日本学術振興会科学研究費補助金挑戦的萌芽研究 2016年4月 - 2019年3月

荒井幸代
レジリエントな都市交通機能を実現する「認知，インフラ，制度」の相互改善型設計

日本学術振興会科学研究費補助金基盤研究（B) 2016年4月 - 2019年3月

荒井幸代

もっとみる

産業財産権

特願G09G 5/00 510 G 9471-5G ビデオテックス端末装置

荒井(丹)幸代, 古郡正美

社会貢献活動

委員

その他

千葉県先端情報技術活用研究会 2006年4月 - 現在
室員

その他

千葉大学附属図書館研究開発室 2005年11月 - 現在
作業部会委員

その他

国立情報学研究所学術コンテンツ運営本部 2005年12月 - 2010年3月
技術顧問

助言・指導

iKuni Inc.(Palo Alto, California) 2000年9月 - 2002年6月
技術顧問

助言・指導

Walt Disney Image Engineering Inc.(LA, California) 2000年7月 - 2002年6月

もっとみる

一覧へ戻る

荒井 幸代

基本情報

研究キーワード

研究分野

主要な委員歴

受賞

論文

MISC

書籍等出版物

講演・口頭発表等

所属学協会

共同研究・競争的資金等の研究課題

産業財産権

社会貢献活動

荒井幸代