🔬 Robotics research
Frontline robotics research — arXiv papers, patents, research institutes. Daily updates; filter by category, year, citation count.
50 papers
Good Token Hunting: A Hitchhiker's Guide to Token Selection for Visual Geometry Transformers
Shuhong Zheng, Michael Oechsle, Erik Sandström +3 more
2026
Robotic Strawberry Harvesting with Robust Vision and Deep Reinforcement Learning based Sim-to-Real Control
Al Bashir, Shao-Yang Chang, Partho Ghose +3 more
2026
Point Tracking Improves World Action Models
Jiarui Guan, Wenshuai Zhao, Yue Pei +3 more
2026
Instrumentation for Imitation Learning: Enhancing Training Datasets for Clothes Hanger Insertion
Remko Proesmans, Thomas Lips, Francis wyffels
2026
SFG-ROS: A Resource-Aware Framework for Dense Multi-Agent Perception
Constantin Blessing, Elias Geiger, Jakob Häringer +2 more
2026
Direct Dynamic Retargeting for Humanoid Imitation Learning from Videos
Constant Roux, Ludovic De Matteïs, Armand Jordana +4 more
2026
Any2Any: Efficient Cross-Embodiment Transfer for Humanoid Whole-Body Tracking
Ming Yang, Tao Yu, Feng Li +1 more
2026
Vision-Based Agile Landing on Turbulent Waters
Dimosthenis Angelis, Leonard Bauersfeld, Davide Scaramuzza +1 more
2026
How Many Training Samples Are Needed for the Inverse Kinematics Solutions by Artificial Neural Networks
Dong-Won Lim
2026
TactileReflex: Noise-Statistics-Driven Vision-Tactile Reflex Control for Force-Sensitive Manipulation
Ziyan Feng, Yulong Fu, Zheng Li +6 more
2026
ComPose: When to Trust Hands for Object Pose Tracking
Jisu Shin, Junoh Lee, JunGyu Lee +5 more
2026
Semantically Structured Mixture-of-Experts for Compositional Robotic Manipulation
Chengyu Deng, Guanqi Chen, Yizhou Chen +4 more
2026
Joint Target-Less Intrinsic and Extrinsic Camera-LiDAR Calibration using Deep Point Correspondences
Simon Bultmann, Daniele Cattaneo, Abhinav Valada
2026
Droneulator: A Portable UAV Simulator for Agricultural Workflows with RotorPy and Godot 4
Jacob Swindell, Michael Lowen, Marija Popovic +1 more
2026
Multi-Floor Exploration for Ground Robots via an Incremental Reachable Graph and Structural Priors
Zhiwen Zhu, Jiaqi Chen, Xiangyi Huang +2 more
2026
Sparse Compositional Flow Matching by geometric assembly from motion primitives
Yan Tang, Yuanbo Tang, Tingyu Cao +2 more
2026
ChainFlow-VLA: Causal Flow Planning with Vision-Language Models
Xiyang Wang, Xinlin Wang, Tingguang Zhou +7 more
2026
6G Communication Networks Enabling Embodied Agents: Architecture and Prototype
Lipeng Dai, Luping Xiang, Kun Yang
2026
Turning Adaptation into Assets: Cross-Domain Bridging for Online Vision-Language Navigation
Zixuan Hu, Xuantuo Huang, Yancheng Li +3 more
2026
Signal Temporal Logic Motion Planning via Graphs of Convex Sets
Yu Chen, Ancheng Hou, Mingyang Feng +2 more
2026
Lipschitz Optimization for Formal Verification of Homographies
Jean-Guillaume Durand, Panagiotis Kouvaros, Maxime Gariel +1 more
2026
IntentionNav: A Benchmark for Intent-Driven Object Navigation from Implicit Human Instruction
Lin Qian, Shijie Li, Sihao Lin +4 more
2026
Autonomous Frontier-Based Exploration with VLM Guidance
Aarush Aitha, Avideh Zakhor
2026
RoboSurg-VQA: A Multimodal Benchmark for Surgical Segmentation-Aware Visual Question Answering
Chengyi Zhang, Zi Ye, Ziyang Wang
2026
Robots That Know What to Ask: Recovering Misaligned Rewards through Targeted Explanations
Helena Merker, Nick Walker, Andreea Bobu
2026
GesVLA: Gesture-Aware Vision-Language-Action Model Embedded Representations
Wenxuan Guo, Ziyuan Li, Meng Zhang +7 more
2026
Superhuman Safe and Agile Racing through Multi-Agent Reinforcement Learning
Ismail Geles, Leonard Bauersfeld, Markus Wulfmeier +1 more
2026
Scout-Assisted Planning for Heterogeneous Robot Teams under Partially Known Environments
Hoang-Dung Bui, Abhish Khanal, Raihan Islam Arnob +1 more
2026
Agentic-VLA: Efficient Online Adaptation for Vision-Language-Action Models
Ruofan Jin, Zaixi Zhang
2026
MoSA: Motion-constrained Stress Adaptation for Mitigating Real-to-Sim Gap in Continuum Dynamics via Learning Residual Anisotropy
Jiaxu Wang, Junhao He, Jingkai Sun +5 more
2026
Understanding Multimodal Failure in Action-Chunking Behavioral Cloning
Lorenzo Mazza, Massimiliano Datres, Ariel Rodriguez +3 more
2026
Imagine2Real: Towards Zero-shot Humanoid-Object Interaction via Video Generative Priors
Jiahe Chen, ZiRui Wang, Feiyu Jia +7 more
2026
Action with Visual Primitives
Weilong Guo, Yuchen Wang, Renping Zhou +6 more
2026
EvoScene-VLA: Evolving Scene Beliefs Inside the Action Decoder for Chunked Robot Control
Chushan Zhang, Ruihan Lu, Jinguang Tong +3 more
2026
SceneGraphGrounder: Zero-Shot 3D Visual Grounding via Structured Scene Graph Matching
Xuefei Sun, Xujia Zhang, Brendan Crowe +2 more
2026
GEM-4D: Geometry-Enhanced Video World Models for Robot Manipulation
Kaichen Zhou, Yuzhen Chen, Fangneng Zhan +8 more
2026
Learning Altruistic Collaboration in Heterogeneous Multi-Team Systems
Riwa Karam, Ruoyu Lin, Brooks A. Butler +1 more
2026
PhysX-Omni: Unified Simulation-Ready Physical 3D Generation for Rigid, Deformable, and Articulated Objects
Ziang Cao, Yinghao Liu, Haitian Li +5 more
2026
PointACT: Vision-Language-Action Models with Multi-Scale Point-Action Interaction
Shizhe Chen, Paul Pacaud, Cordelia Schmid
2026
Learning Structural Latent Points for Efficient Visual Representations in Robotic Manipulation
Yicheng Jiang, Jiaxu Wang, Junhao He +8 more
2026
AutoRPA: Efficient GUI Automation through LLM-Driven Code Synthesis from Interactions
Minghao Chen, Xinyi Hu, Zhou Yu +1 more
2026
LiteViLNet: Lightweight Vision-LiDAR Fusion Network for Efficient Road Segmentation
Daojie Peng, Bingtao Wang, Fulong Ma +2 more
2026
ArchSIBench: Benchmarking the Architectural Spatial Intelligence of Vision-Language Models
Qirui Shen, Wenda Wang, Jiachen Lu +5 more
2026
TERDNet: Transformer Encoder-Recurrent Decoder Network for Scene Change Detection
Jiae Yoon, Ue-Hwan Kim
2026
VSCD: Video-based Scene Change Detection in Unaligned Scenes
Jiae Yoon, Ue-Hwan Kim
2026
NeuroNL2LTL: A Neurosymbolic Framework for Natural Language Translation of Linear Temporal Logic
Paapa Kwesi Quansah, Ernest Bonnah
2026
Jointly Learning Predicates and Actions Enables Zero-Shot Skill Composition
Benedict Quartey, Sebastian Castro, Eric Rosen +3 more
2026
The Yes-Man Syndrome: Benchmarking Abstention in Embodied Robotic Agents
Doguhan Yeke, Elif Su Temirel, Ananth Shreekumar +3 more
2026
SUGAR: A Scalable Human-Video-Driven Generalizable Humanoid Loco-Manipulation Learning Framework
Tianshu Wu, Xiangqi Kong, Yue Chen +5 more
2026
Spatially Prompted Visual Trajectory Prediction for Egocentric Manipulation
Yifan Li, Xinyu Zhou, Yunhao Ge +1 more
2026