Synthesizing Reality: Leveraging the Generative AI-Powered Platform Midjourney for Construction Worker Detection

Abstract

While recent advancements in deep neural networks (DNNs) have substantiallyenhanced visual AI's capabilities, the challenge of inadequate data diversityand volume remains, particularly in construction domain. This study presents anovel image synthesis methodology tailored for construction worker detection,leveraging the generative-AI platform Midjourney. The approach entailsgenerating a collection of 12,000 synthetic images by formulating 3000different prompts, with an emphasis on image realism and diversity. Theseimages, after manual labeling, serve as a dataset for DNN training. Evaluationon a real construction image dataset yielded promising results, with the modelattaining average precisions (APs) of 0.937 and 0.642 atintersection-over-union (IoU) thresholds of 0.5 and 0.5 to 0.95, respectively.Notably, the model demonstrated near-perfect performance on the syntheticdataset, achieving APs of 0.994 and 0.919 at the two mentioned thresholds.These findings reveal both the potential and weakness of generative AI inaddressing DNN training data scarcity.

Quick Read (beta)

loading the full paper ...