TRAIL

TRAIL (Tokyo Robotics and AI Lab) は東京大学松尾・岩澤研究室のサブグループです．実世界での知能の実現を目指して，ロボット学習を中心とした研究開発活動を行なっています．

トピックス

2025年度サークルメンバーの募集（学部生向け）

2025年度サークルメンバーの募集を開始しました

TRAIL Admin, HSR

2025/3/29

2024年度サークルメンバーの募集（学部生向け）

2024年度サークルメンバーの募集を開始しました

TRAIL Admin, HSR

2024/3/10

RSJ2023にて基盤モデルを活用したロボットシステムの統合について発表しました

RSJ2023にて「基盤モデルを活用した自然言語による多様なタスク実現に向けたロボットシステムの統合」と題する発表を行いました

辻知香葉, 小武海大, 和田輝, 綱島颯志, 生駒創, 白坂翠萌, 保呂蒼威, 大見謝恒和, 池田悠也, 松嶋達也, 松尾豊, 岩澤有祐, HSR

2023/9/13

RSJ2023にて基盤モデルを活用したロボットシステムの統合について発表しました

ロボカップ@Home 2023で受賞しました

辻知香葉, 小武海大, 和田輝, 綱島颯志, 生駒創, 白坂翠萌, 保呂蒼威, 大見謝恒和, 池田悠也, 松嶋達也, 岩澤有祐, HSR

2023/7/23

ロボカップジャパンオープン2023で受賞しました

辻知香葉, 小武海大, 和田輝, 綱島颯志, 生駒創, 白坂翠萌, 保呂蒼威, 大見謝恒和, 池田悠也, 松嶋達也, 岩澤有祐, HSR

最終更新 2023/8/9

全てのトピックスを見る

プロジェクト

ロボットナビゲーションにおける継続学習

Improving policy adaptation for robot learning

自己修復型3Dプリンタにおける非理想的固定点

本研究では、x軸およびy軸上において2つの非理想的なタイミングプーリーを有するプリンタが、フィードバックを用いずに自己修復を達成できないことを数学的に証明することを目的とする。

遅延を伴うシステムに対する連続時間ラグランジュ緩和

双対変数の有用なヒューリスティック表現を見つけることを目的とする．これにより、他の問題の初期推定値を見つける手助けになる．

オフラインデータを利用したロボット学習アルゴリズムの開発

ロボットが動作することで蓄積されるログを制御の学習に活用するアルゴリズムを開発しています

シミュレーションと直感的遠隔操作を活用したロボット学習による持続可能なブドウ栽培のための適応型剪定ロボット基盤の創成 (JST SICORP DEMETER)

本プロジェクトでは、日本、フランス、ドイツの研究チームが国際共同研究を通じて、持続可能な農業の実現に向けた革新的なロボット技術の開発に取り組むとともに、ロボット学習や遠隔操作技術を活用し、多様な分野への応用性を広げることを目指します。

代表論文

松嶋達也, 野口裕貴, 有馬純平, 青木俊樹, 沖田祐樹, 池田悠也, 石本幸暉, 谷口尚平, Yuki Yamashita, 瀬戸翔一, 顧世翔, 岩澤有祐, 松尾豊

August 2022 Advanced Robotics

World robot challenge 2020 – partner robot: a data-driven approach for room tidying with mobile manipulator

Tidying up a household environment using a mobile manipulator poses various challenges in robotics, such as adaptation to large real-world environmental variations, and safe and robust deployment in the presence of humans. The Partner Robot Challenge in World Robot Challenge (WRC) 2020, a global competition held in September 2021, benchmarked tidying tasks in real home environments, and, importantly, tested for full system performances. For this challenge, we developed an entire household service robot system, which leverages a data-driven approach to adapt to numerous edge cases that occur during the execution, instead of classical manual pre-programmed solutions. In this paper, we describe the core ingredients of the proposed robot system, including visual recognition, object manipulation, and motion planning. Our robot system won the second prize, verifying the effectiveness and potential of data-driven robot systems for mobile manipulation in home environments.

古田拓毅, 松嶋達也, Tadashi Kozuno, 松尾豊, Sergey Levine, Ofir Nachum, 顧世翔

March 2021 ICML 2021

Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning

Progress in deep reinforcement learning (RL) research is largely enabled by benchmark task environments. However, analyzing the nature of those environments is often overlooked. In particular, we still do not have agreeable ways to measure the difficulty or solvability of a task, given that each has fundamentally different actions, observations, dynamics, rewards, and can be tackled with diverse RL algorithms. In this work, we propose policy information capacity (PIC) – the mutual information between policy parameters and episodic return – and policy-optimal information capacity (POIC) – between policy parameters and episodic optimality – as two environment-agnostic, algorithm-agnostic quantitative metrics for task difficulty. Evaluating our metrics across toy environments as well as continuous control benchmark tasks from OpenAI Gym and DeepMind Control Suite, we empirically demonstrate that these information-theoretic metrics have higher correlations with normalized task solvability scores than a variety of alternatives. Lastly, we show that these metrics can also be used for fast and compute-efficient optimizations of key design parameters such as reward shaping, policy architectures, and MDP properties for better solvability by RL algorithms without ever running full RL experiments.

松嶋達也, 古田拓毅, 松尾豊, Ofir Nachum, 顧世翔

January 2021 ICLR 2021