University of Cambridge > Talks.cam > Engineering Department Structures Research Seminars > Leveraging Text Content for Management of Construction Project Documents

Log in

University Account

External (via Google)

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Leveraging Text Content for Management of Construction Project Documents

Download to your calendar using vCal

Amr A. Kandil, Assistant Professor, School of Civil Engineering, Purdue University
Friday 09 May 2014, 15:00-16:00
Cambridge University Engineering Department, LR3B.

If you have a question about this talk, please contact Lorna Everett .

The construction industry is a knowledge intensive industry. Thousands of documents are generated by construction projects. Documents, as information carriers, must be managed effectively to ensure successful project management. The fact that a single project can produce thousands of documents and that a lot of the documents are generated in a textual/unstructured format greatly complicates the task of information management. Conventionally, project documents are organized based on classifying documents according to fixed/predefined classes and document metadata, e.g. according to document type, originator, project attribute, specification division, date, etc. While such classification method is easy to implement, it is only advantageous for document search and retrieval if the document seeker has prior knowledge of the content of the document corpus. In many cases and for various project management activities this is not the case, resulting in frustration of the search task with delayed or incomplete search results.

An alternative framework for organizing project documents based on document content is proposed. The framework takes into account important characteristics of construction project documents and leverages such characteristics to facilitate document search and retrieval. The premise for the framework is the fact that documents are not produced haphazardly, but are generated as a result of certain events or circumstances occurring in the project. As such documents can be linked to each other on the semantic level; a point that is overlooked by document management systems which generally manage documents in vacuo by disregarding or failing to utilize such semantic connections between the documents. Organizing project documents based on the semantic relations that exist between them (revealed from the document content and not just the document attributes) facilitates information retrieval and retains the knowledge of the actual project participants, thereby supporting knowledge reuse.

Another aspect of this research investigates the use of document content analysis to enable automated document management. If textual similarities between documents correlate with what human users recognize through their semantic abilities, then content analysis of documents can be used to automatically organize documents according to the proposed framework. Text classifiers based on machine learning techniques were evaluated to determine their performance in identifying which group of semantically-similar documents a test document belongs. Also, an unsupervised learning method was adapted and evaluated for the task of clustering documents based on textual similarity into sets of documents that are semantically related. The purpose of such evaluations is to equip electronic document management systems with content analysis capabilities that facilitate document search and retrieval.

This talk is part of the Engineering Department Structures Research Seminars series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Leveraging Text Content for Management of Construction Project Documents

📅 Download to calendar (vCal)

👤 Speaker: Amr A. Kandil, Assistant Professor, School of Civil Engineering, Purdue University 🔗 Website
📅 Date & Time: Friday 09 May 2014, 15:00 - 16:00
📍 Venue: Cambridge University Engineering Department, LR3B

Questions? Contact Lorna Everett

Abstract

Series This talk is part of the Engineering Department Structures Research Seminars series.

Included in Lists

Note: Ex-directory lists are not shown.

Log in

🔐 Log In

Information on

ℹ️ Information

Leveraging Text Content for Management of Construction Project Documents

This talk is included in these lists:

Leveraging Text Content for Management of Construction Project Documents

Abstract

Included in Lists

Log in

🔐 Log In

Information on

ℹ️ Information

Leveraging Text Content for Management of Construction Project Documents

This talk is included in these lists:

Other lists

Other talks

Leveraging Text Content for Management of Construction Project Documents

Abstract

Included in Lists