感知 分类论文(3,985)
清除筛选 ✕Foundation models on the bridge: Semantic hazard detection and safety maneuvers for maritime autonomy with vision-language models
Kim Alexander Christensen, Andreas Gudahl Tufte, Alexey Gusev 等 8 位作者
2025
Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems
Song Wang, Lingdong Kong, Xiaolu Liu 等 7 位作者
2025
UniAct: Unified Motion Generation and Action Streaming for Humanoid Robots
Nan Jiang, Zimo He, Wanhe Yu 等 11 位作者
2025
MambaSeg: Harnessing Mamba for Accurate and Efficient Image-Event Semantic Segmentation
Fuqiang Gu, Yuanke Li, Xianlei Long 等 7 位作者
2025
PointRAFT: 3D deep learning for high-throughput prediction of potato tuber weight from partial point clouds
Pieter M. Blok, Haozhou Wang, Hyun Kwon Suh 等 6 位作者
2025
Guided Diffusion-based Generation of Adversarial Objects for Real-World Monocular Depth Estimation Attacks
Yongtao Chen, Yanbo Wang, Wentao Zhao 等 6 位作者
2025
Unsupervised Learning for Detection of Rare Driving Scenarios
Dat Le, Thomas Manhardt, Moritz Venator 等 4 位作者
2025
PCR-ORB: Enhanced ORB-SLAM3 with Point Cloud Refinement Using Deep Learning-Based Dynamic Object Filtering
Sheng-Kai Chen, Jie-Yu Chao, Jr-Yu Chang 等 5 位作者
2025
The Dawn of Agentic EDA: A Survey of Autonomous Digital Chip Design
Zelin Zang, Yuhang Song, Aili Wang 等 9 位作者
2025
Wavelet-based Multi-View Fusion of 4D Radar Tensor and Camera for Robust 3D Object Detection
Runwei Guan, Jianan Liu, Shaofeng Liang 等 10 位作者
2025
MUSON: A Reasoning-oriented Multimodal Dataset for Socially Compliant Navigation in Urban Environments
Zhuonan Liu, Xinyu Zhang, Zishuo Wang 等 6 位作者
2025
Depth Anything in $360^\circ$: Towards Scale Invariance in the Wild
Hualie Jiang, Ziyang Song, Zhiqiang Lou 等 5 位作者
2025
RGS-SLAM: Robust Gaussian Splatting SLAM with One-Shot Dense Initialization
Wei-Tse Cheng, Yen-Jen Chiou, Yuan-Fu Yang
2025
SCAFusion: A Multimodal 3D Detection Framework for Small Object Detection in Lunar Surface Exploration
Xin Chen, Kang Luo, Yangyi Xiao 等 4 位作者
2025
A Unified AI, Embedded, Simulation, and Mechanical Design Approach to an Autonomous Delivery Robot
Amro Gamar, Ahmed Abduljalil, Alargam Mohammed 等 5 位作者
2025
World-Coordinate Human Motion Retargeting via SAM 3D Body
Zhangzheng Tu, Kailun Su, Shaolong Zhu 等 4 位作者
2025
PanoGrounder: Bridging 2D and 3D with Panoramic Scene Representations for VLM-based 3D Visual Grounding
Seongmin Jung, Seongho Choi, Gunwoo Jeon 等 5 位作者
2025
OccuFly: A 3D Vision Benchmark for Semantic Scene Completion from the Aerial Perspective
Markus Gross, Sai B. Matha, Aya Fahmy 等 6 位作者
2025
LEAD: Minimizing Learner-Expert Asymmetry in End-to-End Driving
Long Nguyen, Micha Fauth, Bernhard Jaeger 等 7 位作者
2025
Drift-Corrected Monocular VIO and Perception-Aware Planning for Autonomous Drone Racing
Maulana Bisyir Azhari, Donghun Han, Je In You 等 5 位作者
2025
FAR-AVIO: Fast and Robust Schur-Complement Based Acoustic-Visual-Inertial Fusion Odometry with Sensor Calibration
Hao Wei, Peiji Wang, Qianhao Wang 等 6 位作者
2025
KnowVal: A Knowledge-Augmented and Value-Guided Autonomous Driving System
Zhongyu Xia, Wenhao Chen, Yongtao Wang 等 4 位作者
2025
From Visual Perception to Deep Empathy: An Automated Assessment Framework for House-Tree-Person Drawings Using Multimodal LLMs and Multi-Agent Collaboration
Shuide Wen, Yu Sun, Beier Ku 等 7 位作者
2025
LoGoPlanner: Localization Grounded Navigation Policy with Metric-aware Visual Geometry
Jiaqi Peng, Wenzhe Cai, Yuqiang Yang 等 6 位作者
2025