感知 分类论文(3,985)
清除筛选 ✕Listen, Look, Drive: Coupling Audio Instructions for User-aware VLA-based Autonomous Driving
Ziang Guo, Feng Yang, Xuefeng Zhang 等 9 位作者
2026
SMc2f: Robust Scenario Mining for Robotic Autonomy from Coarse to Fine
Yifei Chen, Ross Greer
2026
AI for Green Spaces: Leveraging Autonomous Navigation and Computer Vision for Park Litter Removal
Christopher Kao, Akhil Pathapati, James Davis
2026
SurfSLAM: Sim-to-Real Underwater Stereo Reconstruction For Real-Time SLAM
Onur Bagoren, Seth Isaacson, Sacchin Sundar 等 9 位作者
2026
BikeActions: An Open Platform and Benchmark for Cyclist-Centric VRU Action Recognition
Max A. Buettner, Kanak Mazumder, Luca Koecher 等 6 位作者
2026
UEOF: A Benchmark Dataset for Underwater Event-Based Optical Flow
Nick Truong, Pritam P. Karmokar, William J. Beksi
2026
Predicting When to Trust Vision-Language Models for Spatial Reasoning
Muhammad Imran, Yugyung Lee
2026
LCF3D: A Robust and Real-Time Late-Cascade Fusion Framework for 3D Object Detection in Autonomous Driving
Carlo Sgaravatti, Riccardo Pieroni, Matteo Corno 等 6 位作者
2026
AquaFeat+: an Underwater Vision Learning-based Enhancement Method for Object Detection, Classification, and Tracking
Emanuel da Costa Silva, Tatiana Taís Schein, José David García Ramos 等 7 位作者
2026
Multimodal Signal Processing For Thermo-Visible-Lidar Fusion In Real-time 3D Semantic Mapping
Jiajun Sun, Yangyi Ou, Haoyuan Zheng 等 5 位作者
2026
Older Adults' Preferences for Feedback Cadence from an Exercise Coach Robot
Roshni Kaushik, Reid Simmons
2026
VLingNav: Embodied Navigation with Adaptive Reasoning and Visual-Assisted Linguistic Memory
Shaoan Wang, Yuanfei Luo, Xingyu Chen 等 9 位作者
2026
Do Open-Vocabulary Detectors Transfer to Aerial Imagery? A Comparative Evaluation
Christos Tsourveloudis
2026
Heterogeneous computing platform for real-time robotics
Jakub Fil, Yulia Sandamirskaya, Hector Gonzalez 等 20 位作者
2026
Real2Sim via Active Perception with Behavior Trees Automatically Generated by VLMs
Alessandro Adami, Sebastian Zudaire, Ruggero Carli 等 4 位作者
2026
SPARK: Scalable Real-Time Point Cloud Aggregation with Multi-View Self-Calibration
Chentian Sun
2026
Edge-Optimized Multimodal Learning for UAV Video Understanding via BLIP-2
Yizhan Feng, Hichem Snoussi, Jing Teng 等 7 位作者
2026
Efficient Incremental SLAM via Information-Guided and Selective Optimization
Reza Arablouei
2026
Hiking in the Wild: A Scalable Perceptive Parkour Framework for Humanoids
Shaoting Zhu, Ziwen Zhuang, Mengjie Zhao 等 5 位作者
2026
FlyCo: Foundation Model-Empowered Drones for Autonomous 3D Structure Scanning in Open-World Environments
Chen Feng, Guiyong Zheng, Tengkai Zhuang 等 9 位作者
2026
WaveMan: mmWave-Based Room-Scale Human Interaction Perception for Humanoid Robots
Yuxuan Hu, Kuangji Zuo, Boyu Ma 等 7 位作者
2026
Modeling Descriptive Norms in Multi-Agent Systems: An Auto-Aggregation PDE Framework with Adaptive Perception Kernels
Chao Li, Ilia Derevitskii, Sergey Kovalchuk
2026
PointSLAM++: Robust Dense Neural Gaussian Point Cloud-based SLAM
Xu Wang, Boyao Han, Xiaojun Chen 等 5 位作者
2026
InsSo3D: Inertial Navigation System and 3D Sonar SLAM for turbid environment inspection
Simon Archieri, Ahmet Cinar, Shu Pan 等 7 位作者
2026