Are Open-Vocabulary Models Ready for Detection of MEP Elements on Construction Sites
Abdalwhab Bakheet Mohamed Abdalwhab, Ali Shariq Imran, Sina Heydarian, Ivanka Iordanova, David St-Onge
- 发表年份
- 2025
- 引用次数
- 1
- 访问权限
- 开放获取
摘要
The construction industry has long explored robotics and computer vision, yet their deployment on construction sites remains very limited.These technologies have the potential to revolutionize traditional workflows by enhancing accuracy, efficiency, and safety in construction management.Ground robots equipped with advanced vision systems could automate tasks such as monitoring mechanical, electrical, and plumbing (MEP) systems.The present research evaluates the applicability of open-vocabulary vision-language models compared to fine-tuned, lightweight, closed-set object detectors for detecting MEP components using a mobile ground robotic platform.A dataset collected with cameras mounted on a ground robot was manually annotated and analyzed to compare model performance.The results demonstrate that, despite the versatility of vision-language models, fine-tuned lightweight models still largely outperform them in specialized environments and for domain-specific tasks.
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002