Journal article
A Vision Language-Based Framework for Detecting Industrial Mechanical, Electrical, and Plumbing Assets Using Unlabelled Data
Masoud Kamali, Behnam Atazadeh, Abbas Rajabifard, Yiqun Chen, Ensiyeh Javaherian Pour
Sensors | MDPI AG | Published : 2026
DOI: 10.3390/s26082379
Open access
Abstract
There have been significant advancements in object detection using extensive labelled datasets. However, existing learning-based approaches remain constrained in industrial environments, primarily due to the limited diversity in training datasets; the lack of generalisation of close-set detectors to unseen asset categories; and the inherent spatial and geometric complexity of mechanical, electrical, and plumbing (MEP) assets. To address this challenge, we propose a new approach that leverages pre-trained vision language models and close-set object detectors to detect unseen MEP assets using unlabelled data. Experimental results reveal the superior performance of Grounding DINO using Swin B t..
View full abstractRelated Projects (3)
Grants
Awarded by Australian Research Council