感知 分类论文(3,985)
清除筛选 ✕Adapting Reinforcement Learning for Path Planning in Constrained Parking Scenarios
Feng Tao, Luca Paparusso, Chenyi Gu 等 9 位作者
2026
SHED Light on Segmentation for Dense Prediction
Seung Hyun Lee, Sangwoo Mo, Stella X. Yu
2026
VideoAesBench: Benchmarking the Video Aesthetics Perception Capabilities of Large Multimodal Models
Yunhao Li, Sijing Wu, Zhilin Gao 等 8 位作者
2026
SDCM: Simulated Densifying and Compensatory Modeling Fusion for Radar-Vision 3-D Object Detection in Internet of Vehicles
Shucong Li, Xiaoluo Zhou, Yuqian He 等 4 位作者
2026
IROS: A Dual-Process Architecture for Real-Time VLM-Based Indoor Navigation
Joonhee Lee, Hyunseung Shin, Jeonggil Ko
2026
Don't double it: Efficient Agent Prediction in Occlusions
Anna Rothenhäusler, Markus Mazzola, Andreas Look 等 5 位作者
2026
4D-CAAL: 4D Radar-Camera Calibration and Auto-Labeling for Autonomous Driving
Shanliang Yao, Zhuoxiao Li, Runwei Guan 等 11 位作者
2026
DSCD-Nav: Dual-Stance Cooperative Debate for Object Navigation
Weitao An, Qi Liu, Chenghao Xu 等 7 位作者
2026
Thinker: A vision-language foundation model for embodied intelligence
Baiyu Pan, Daqin Luo, Junpeng Yang 等 7 位作者
2026
InspecSafe-V1: A Multimodal Benchmark for Safety Assessment in Industrial Inspection Scenarios
Zeyi Liu, Shuang Liu, Jihai Min 等 10 位作者
2026
Li-ViP3D++: Query-Gated Deformable Camera-LiDAR Fusion for End-to-End Perception and Trajectory Prediction
Matej Halinkovic, Nina Masarykova, Alexey Vinel 等 4 位作者
2026
When Simultaneous Localization and Mapping Meets Wireless Communications: A Survey
Konstantinos Gounis, Sotiris A. Tegos, Dimitrios Tyrovolas 等 5 位作者
2026
HMVLA: Hyperbolic Multimodal Fusion for Vision-Language-Action Models
Kun Wang, Xiao Feng, Mingcheng Qu 等 4 位作者
2026
LangGS-SLAM: Real-Time Language-Feature Gaussian Splatting SLAM
Seongbo Ha, Sibaek Lee, Kyungsu Kang 等 6 位作者
2026
VGGT-SLAM 2.0: Real-time Dense Feed-forward Scene Reconstruction
Dominic Maggio, Luca Carlone
2026
How Does Delegation in Social Interaction Evolve Over Time? Navigation with a Robot for Blind People
Rayna Hata, Masaki Kuribayashi, Allan Wang 等 5 位作者
2026
The S3LI Vulcano Dataset: A Dataset for Multi-Modal SLAM in Unstructured Planetary Environments
Riccardo Giubilato, Marcus Gerhard Müller, Marco Sewtz 等 6 位作者
2026
Towards Gold-Standard Depth Estimation for Tree Branches in UAV Forestry: Benchmarking Deep Stereo Matching Methods
Yida Lin, Bing Xue, Mengjie Zhang 等 5 位作者
2026
Learned split-spectrum metalens for obstruction-free broadband imaging in the visible
Seungwoo Yoon, Dohyun Kang, Eunsue Choi 等 12 位作者
2026
Perception-to-Pursuit: Track-Centric Temporal Reasoning for Open-World Drone Detection and Autonomous Chasing
Venkatakrishna Reddy Oruganti
2026
Neuromorphic BrailleNet: Accurate and Generalizable Braille Reading Beyond Single Characters through Event-Based Optical Tactile Sensing
Naqash Afzal, Niklas Funk, Erik Helmut 等 5 位作者
2026
HomoFM: Deep Homography Estimation with Flow Matching
Mengfan He, Liangzheng Sun, Chunyu Li 等 4 位作者
2026
Strip-Fusion: Spatiotemporal Fusion for Multispectral Pedestrian Detection
Asiegbu Miracle Kanu-Asiegbu, Nitin Jotwani, Xiaoxiao Du
2026
SPACE-CLIP: Spatial Perception via Adaptive CLIP Embeddings for Monocular Depth Estimation
Taewan Cho, Taeryang Kim, Andrew Jaeyong Choi
2026