Skip to content

TianxingChen/RoboScholar

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RoboScholar: A Comprehensive Paper List of Embodied AI and Robotics Research

It's RoboScholar Project Here, started by Tianxing Chen.
Related Information:

Lumina Embodied AI Community

Embodied-AI-Guide

Sections

Recent Random Papers

Open-Vocabulary 3D Articulated Objects Modeling https://arxiv.org/pdf/2507.02747

  • [] [arXiv 25] LEMON: Learning 3D Human-Object Interaction Relation from 2D Images, arXiv

  • [] [arXiv 25] Merging and Disentangling Views in Visual Reinforcement Learning for Robotic Manipulation, arXiv

  • [] [RSS 25] Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich Manipulation, arXiv

  • [] [arXiv 24] GRAPE: Generalizing Robot Policy via Preference Alignment, arXiv

  • [] [arXiv 25] GROVE: A Generalized Reward for Learning Open-Vocabulary Physical Skill, arXiv

  • [] [arXiv 24] Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers, arXiv

1. Diffusion Model for Planning, Policy, and RL

  • [] [arXiv 24] Surgical Robot Transformer: Imitation Learning for Surgical Tasks, website

6. Generative Model for Embodied

  • [] [arXiv 24] Generative Image as Action Models, website

  • [] [arXiv 24] Genie: Generative Interactive Environments, website

9. Pose Estimation and Tracking

  • [] [CVPR 24 (Highlight)] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects, website

  • [] [CVPR 23 (Highlight)] GAPartNet: Cross-Category Domain-Generalizable Object Perception and Manipulation via Generalizable and Actionable Parts, website

  • [] [arXiv 23] GAMMA: Generalizable Articulation Modeling and Manipulation for Articulated Objects, website

  • [] [arXiv 24] ManiPose: A Comprehensive Benchmark for Pose-aware Object Manipulation in Robotics, website

  • [] [ICCV 23] AffordPose: A Large-scale Dataset of Hand-Object Interactions with Affordance-driven Hand Pose, website

  • [] [CVPR 23] BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects, website

  • [] [arXiv 24] WiLoR: End-to-end 3D hand localization and reconstruction in-the-wild, website

TO READ

  1. Stabilizing Transformers for Reinforcement Learning

    • Summary: 本文提出了Gated Transformer-XL (GTrXL),一种改进的Transformer架构,用于解决标准Transformer在强化学习中的优化难题。通过引入层归一化和门控机制,GTrXL在部分可观察性环境中取得了优于LSTM的性能。
    • 链接
  2. CoBERL: Contrastive BERT for Reinforcement Learning

    • Summary: 文章介绍了CoBERL,它结合了对比损失和Transformer架构,通过双向掩码预测和对比学习方法提高强化学习中的数据效率和性能。
    • 链接
  3. Adaptive Transformers in RL

    • Summary: 该研究探索了在强化学习中使用具有自适应注意力跨度的Transformer模型,发现这种方法能够提高模型在需要长期依赖的环境中的性能。
    • 链接
  4. Efficient Transformers in Reinforcement Learning using Actor-Learner Distillation

    • Summary: 本文提出了Actor-Learner Distillation (ALD)方法,通过从大型学习者模型向小型执行者模型进行知识蒸馏,以提高Transformer在强化学习中的样本效率。
    • 链接
  5. Deep Transformer Q-Networks for Partially Observable Reinforcement Learning

    • Summary: 介绍了Deep Transformer Q-Networks (DTQN),这是一种新型的强化学习架构,使用Transformer的自注意力机制来处理部分可观察性任务,并在多个挑战性环境中展示了有效性。
    • 链接
  6. CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer

    • Summary: CtrlFormer是一种新型的Transformer架构,专注于通过学习可迁移的状态表示来提高视觉控制任务的样本效率,特别强调了在跨任务迁移学习方面的优势。
    • 链接

Sapiens: Foundation for Human Vision Models: https://about.meta.com/realitylabs/codecavatars/sapiens General Flow as Foundation Affordance for Scalable Robot Learning https://general-flow.github.io/

About

RoboScholar: A Comprehensive Paper List of Embodied AI and Robotics Research

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published