TempTrans-MIL: A General Approach to Enhancing Multimodal Tactile-Driven Robotic Manipulation Classification Tasks
Jingnan Wang, Wenjia Ouyang, Senlin Fang, Yupo Zhang, Xinyu Wu, Zhengkun Yi
- 发表年份
- 2025
- 引用次数
- 3
摘要
In tactile-driven robotic manipulation, handling high-dimensional, multimodal tactile time series data from different tactile sensors presents challenges in feature extraction, processing, and interpretation. This study tackles these challenges by proposing a robust method that effectively processes tactile time series data, without visual input. We propose the temporal transformer-based multiple instance learning (TempTrans-MIL) model, a deep learning approach for high-dimensional tactile time series data in robotic manipulation classification tasks. The model is an MIL-based framework treats each short-term multimodal tactile time series as a bag of instances. It uses an inception module-based encoder for instance-level temporal feature extraction, and an MIL module to integrate bag-level features using tokenized transformers with learnable wavelet positional encoding. Extensive experiments in robotic manipulation tasks, using both publicly available and our own collected dataset, demonstrate that our proposed TempTrans-MIL model significantly outperforms baselines. Our proposed model achieves a good balance between classification accuracy and computational efficiency, with accuracies of 88.50% for surface material recognition, 91.75% for grasp stability detection, and 99.31% for robotic palpation task, highlighting its superior performance across various tasks.
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002