Content-based text mining technique for retrieval of CAD documents

作者:Yu, Wen-der; Hsu, Jia-yang 刊名:Automation in Construction 上传者:刘永强

【摘要】The computer aided design (CAD) document provides an effective communication medium, a legal contract document, and a reusable design case for a construction project. Due to technological advancements in CAD industry, the volume of CAD documents has been increased dramatically in the database of construction organizations. Traditional retrieval methods relied on textual naming and indexing schemes that require the designers (engineers and architects) to memorize in details the meta-information used to characterize the drawings. Such approaches easily overwhelmed the users' memory capability and thus caused low reusability of CAD documents. In this paper, a content-based text mining technique is adopted to extract the textual content of a CAD document into a characteristic document (CD), which can be retrieved with similarity matching using a Vector Space Model (VSM), so that the automated and expedited retrievals of CAD documents from vast CAD databases become possible. A prototype system, namely Content-based CAD document Retrieval System (CCRS), is developed to implement the proposed method. After preliminary testing with a CAD database with 2094 Chinese annotated CAD drawings collected from two real-world construction projects and a public engineering drawing database, the proposed CCRS is proven to retrieve all relevant CAD documents with relatively high precision when appropriate query is specified. Finally, three search strategies are recommended for the users to narrow down search scope while a target CAD document is desired. It is concluded that the proposed content-based text mining approach provides a promising solution to improve the current difficulty encountered in retrieval and reusability of vast CAD documents for the construction industry. [All rights reserved Elsevier].

全文阅读

Automation in Construction 31 (2013) 65–74 Contents lists available at SciVerse ScienceDirect Automation in Construction j ourna l homepage: www.e lsev ie r .com/ locate /autconContent-based text mining technique for retrieval of CAD documents Wen-der Yu ⁎, Jia-yang Hsu Department of Construction Management, Chung Hua University, Hsinchu 300, Taiwan, ROC⁎ Corresponding author. Tel.: +886 3 5186748; fax: + E-mail address: wenderyu@chu.edu.tw (W. Yu). 0926-5805/$ – see front matter © 2012 Elsevier B.V. All http://dx.doi.org/10.1016/j.autcon.2012.11.037a b s t r a c ta r t i c l e i n f oArticle history: Accepted 25 November 2012 Available online 25 December 2012 Keywords: CAD Text mining Information retrieval Characteristic document Construction engineeringThe computer aided design (CAD) document provides an effective communication medium, a legal contract document, and a reusable design case for a construction project. Due to technological advancements in CAD industry, the volume of CAD documents has been increased dramatically in the database of construction or- ganizations. Traditional retrieval methods relied on textual naming and indexing schemes that require the designers (engineers and architects) to memorize in details the meta-information used to characterize the drawings. Such approaches easily overwhelmed the users' memory capability and thus caused low reusability of CAD documents. In this paper, a content-based text mining technique is adopted to extract the textua

参考文献

引证文献

问答

我要提问