Enhancing Construction Design Efficiency: An Approach to Data Extraction with Natural Language Processing for Technical Drawing

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Within the construction and design engineering sphere, integrating advanced technologies has become indispensable for streamlining processes and enhancing productivity. This paper explores the development and implementation of a design assist tool, combining Natural Language Processing (NLP) methodologies to extract semantic information from digital sources. Moreover, different tools’ ability to extract data from email bodies and attachments has been studied. The extraction process, comprising steps such as file detachment with different formats, data labelling, pre-processing techniques such as tokenisation, and feature engineering, requires selecting appropriate techniques, which this paper examines. In this study, data labelling with six features has been done for 278 emails, each of which contained attachments in various formats such as JPEG, PDF, PNG, DWG, etc. Using various tools and patterns such as Regular Expression (Regex), Tokenisation, Stemming, etc., for pre-processing step, made the data ready for training the model. The paper delves into the utility of libraries and models like SpaCy, NLTK, and BERT for efficient data extraction and natural language processing tasks, offering insights into their comparative strengths and suitability for diverse textual analysis needs. The findings suggest that judicious selection of techniques in an NLP project can significantly streamline processes, resulting in time and resource efficiencies. Moreover, the results indicate automated data extraction substantially reduces internal design review time, translating to expedited design turnover and significant cost savings.

Original languageEnglish
Title of host publicationProceedings of the International Conference on Smart and Sustainable Built Environment, SASBE 2024
EditorsAli GhaffarianHoseini, Amirhosein Ghaffarianhoseini, Farzad Rahimian, Mahesh Babu Purushothaman
PublisherSpringer
Pages724-733
Number of pages10
ISBN (Print)9789819640508
DOIs
Publication statusPublished - 20 Apr 2025
EventInternational Conference of Sustainable Development and Smart Built Environments, SDSBE 2024 - Auckland, New Zealand
Duration: 7 Nov 20249 Nov 2024

Publication series

NameLecture Notes in Civil Engineering
Volume591 LNCE

Conference

ConferenceInternational Conference of Sustainable Development and Smart Built Environments, SDSBE 2024
Country/TerritoryNew Zealand
CityAuckland
Period7/11/249/11/24

Bibliographical note

Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

Fingerprint

Dive into the research topics of 'Enhancing Construction Design Efficiency: An Approach to Data Extraction with Natural Language Processing for Technical Drawing'. Together they form a unique fingerprint.

Cite this