感知 分类论文(3,986)
清除筛选 ✕FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution
Gene Chou, Wenqi Xian, Guandao Yang 等 8 位作者
2025
Endowing Embodied Agents with Spatial Reasoning Capabilities for Vision-and-Language Navigation
Qianqian Bai, Zhongpu Chen, Ling Luo 等 6 位作者
2025
Holistic Fusion: Task- and Setup-Agnostic Robot Localization and State Estimation with Factor Graphs
Julian Nubert, Turcan Tuna, Jonas Frey 等 7 位作者
2025
How to Enable LLM with 3D Capacity? A Survey of Spatial Reasoning in LLM
Jirong Zha, Yuxuan Fan, Xiao Yang 等 5 位作者
2025
TAPNext: Tracking Any Point (TAP) as Next Token Prediction
Artem Zholus, Carl Doersch, Yi Yang 等 10 位作者
2025
Stereo-LiDAR Fusion by Semi-Global Matching With Discrete Disparity-Matching Cost and Semidensification
Yasuhiro Yao, Ryoichi Ishikawa, Takeshi Oishi
2025
MotionPRO: Exploring the Role of Pressure in Human MoCap and Beyond
Shenghao Ren, Yi Lu, Jiayi Huang 等 8 位作者
2025
Feedback-Enhanced Hallucination-Resistant Vision-Language Model for Real-Time Scene Understanding
Zahir Alsulaimawi
2025
A Self-Supervised Learning Approach with Differentiable Optimization for UAV Trajectory Planning
Yufei Jiang, Yuanzhu Zhan, Harsh Vardhan Gupta 等 5 位作者
2025
Nonlinear Observer Design for Landmark-Inertial Simultaneous Localization and Mapping
Mouaad Boughellaba, Soulaimane Berkane, Abdelhamid Tayebi
2025
An Optimized Density-Based Lane Keeping System for A Cost-Efficient Autonomous Vehicle Platform: AurigaBot V1
Farbod Younesi, Milad Rabiei, Soroush Keivanfard 等 8 位作者
2025
X-Capture: An Open-Source Portable Device for Multi-Sensory Learning
Samuel Clarke, Suzannah Wistreich, Yanjie Ze 等 4 位作者
2025
MultiTSF: Transformer-based Sensor Fusion for Human-Centric Multi-view and Multi-modal Action Recognition
Trung Thanh Nguyen, Yasutomo Kawanishi, Vijay John 等 5 位作者
2025
Overlap-Aware Feature Learning for Robust Unsupervised Domain Adaptation for 3D Semantic Segmentation
Junjie Chen, Yuecong Xu, Haosheng Li 等 4 位作者
2025
UniCalib: Targetless LiDAR-Camera Calibration via Probabilistic Flow on Unified Depth Representations
Shu Han, Xubo Zhu, Ji Wu 等 7 位作者
2025
A Retina-Inspired Pathway to Real-Time Motion Prediction inside Image Sensors for Extreme-Edge Intelligence
Subhradip Chakraborty, Shay Snyder, Md Abdullah-Al Kaiser 等 6 位作者
2025
Multimodal Reference Visual Grounding
Yangxiao Lu, Ruosen Li, Liqiang Jing 等 8 位作者
2025
Enabling Efficient Processing of Spiking Neural Networks with On-Chip Learning on Commodity Neuromorphic Processors for Edge AI Systems
Rachmad Vidya Wicaksana Putra, Pasindu Wickramasinghe, Muhammad Shafique
2025
Zero-Shot 4D Lidar Panoptic Segmentation
Yushan Zhang, Aljoša Ošep, Laura Leal-Taixé 等 4 位作者
2025
Visual Environment-Interactive Planning for Embodied Complex-Question Answering
Ning Lan, Baoshan Ou, Xuemei Xie 等 4 位作者
2025
Trajectory Planning for Automated Driving using Target Funnels
Benjamin Bogenberger, Johannes Bürger, Vladislav Nenchev
2025
Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation Model
Jannik Endres, Oliver Hahn, Charles Corbière 等 6 位作者
2025
ReasonGrounder: LVLM-Guided Hierarchical Feature Splatting for Open-Vocabulary 3D Visual Grounding and Reasoning
Zhenyang Liu, Yikai Wang, Sixiao Zheng 等 7 位作者
2025
The Marine Debris Forward-Looking Sonar Datasets
Matias Valdenegro-Toro, Deepan Chakravarthi Padmanabhan, Deepak Singh 等 5 位作者
2025