COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > Engineering Department Structures Research Seminars > Leveraging Text Content for Management of Construction Project Documents
Leveraging Text Content for Management of Construction Project DocumentsAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Lorna Everett. The construction industry is a knowledge intensive industry. Thousands of documents are generated by construction projects. Documents, as information carriers, must be managed effectively to ensure successful project management. The fact that a single project can produce thousands of documents and that a lot of the documents are generated in a textual/unstructured format greatly complicates the task of information management. Conventionally, project documents are organized based on classifying documents according to fixed/predefined classes and document metadata, e.g. according to document type, originator, project attribute, specification division, date, etc. While such classification method is easy to implement, it is only advantageous for document search and retrieval if the document seeker has prior knowledge of the content of the document corpus. In many cases and for various project management activities this is not the case, resulting in frustration of the search task with delayed or incomplete search results. An alternative framework for organizing project documents based on document content is proposed. The framework takes into account important characteristics of construction project documents and leverages such characteristics to facilitate document search and retrieval. The premise for the framework is the fact that documents are not produced haphazardly, but are generated as a result of certain events or circumstances occurring in the project. As such documents can be linked to each other on the semantic level; a point that is overlooked by document management systems which generally manage documents in vacuo by disregarding or failing to utilize such semantic connections between the documents. Organizing project documents based on the semantic relations that exist between them (revealed from the document content and not just the document attributes) facilitates information retrieval and retains the knowledge of the actual project participants, thereby supporting knowledge reuse. Another aspect of this research investigates the use of document content analysis to enable automated document management. If textual similarities between documents correlate with what human users recognize through their semantic abilities, then content analysis of documents can be used to automatically organize documents according to the proposed framework. Text classifiers based on machine learning techniques were evaluated to determine their performance in identifying which group of semantically-similar documents a test document belongs. Also, an unsupervised learning method was adapted and evaluated for the task of clustering documents based on textual similarity into sets of documents that are semantically related. The purpose of such evaluations is to equip electronic document management systems with content analysis capabilities that facilitate document search and retrieval. This talk is part of the Engineering Department Structures Research Seminars series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsSusan Gathercole Department of Geography Slavonic Film and Media Studies Cambridge Public Policy Workshops C.U. Cryptographic Currency Society Type the title of a new list hereOther talksThe Global Warming Sceptic How to rediscover a medical secret in eighteenth-century France: the lost recipe of the Chevalier de Guiller's powder febrifuge Art speak Current-Induced Stresses in Ceramic Lithium-Ion Conductors The Age of the Applied Economist: The Transformation of Economics Since the 1970s Adaptation in log-concave density estimation An SU(3) variant of instanton homology for webs Single Cell Seminars (August) Networks, resilience and complexity Singularities of Hermitian-Yang-Mills connections and the Harder-Narasimhan-Seshadri filtration BP KEYNOTE LECTURE: Importance of C-O Bond Activation for CO2/COUtilization - An Approach to Energy Conversion and Storage |