COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > Computational and Systems Biology > Mining scientific diagrams for semantic information
Mining scientific diagrams for semantic informationAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Emily Boyd. Scientific data is often only reported as diagrams in publications and is effectively destroyed and lost. This data is often critically valuable for other scientists and data abstracting services, and often has to be recreated manually from the diagram at great expense, with waste and error. Examples include plots, charts, and more complex objects such as chemical structure diagrams and phylogenetic (evolutionary) trees. I shall show how, in favourable circumstances, it is possible to recreate semantic information from diagrams using well-established Computer Vision techniques. These include thresholding, binarization, dilation and thinning, OCR and a variety of domain-specific heuristics. Our Open Source library is based on BoofCV , an Open Java Image processing library, and enhanced with tools useful for scientific documents. Some PDF documents contain vector images and are particularly tractable while others are only pixel images and suffer form overlap, problems of scale and loss of detail I shall show the application to chemistry and phylogenetics and show where errors and loss occur. http://www.slideshare.net/petermurrayrust/mining-scientific-images This talk is part of the Computational and Systems Biology series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsARClub Talks ARC (Anglia Ruskin – Cambridge) Romance Linguistics Seminars Darwin College Research Talks Statistical Laboratory Open Afternoon Cambridge Institute Scientists' SocietyOther talksStatistical analysis of biotherapeutic datasets to facilitate early ‘Critical Quality Attribute’ characterization. Pruning and grafting syntactic trees for cross-lingual transfer tasks Peak Youth: the end of the beginning HE@Cam Seminar: Christian Hill - Patient Access Scheme, Managed Access Agreements and their influence on the approval trends on new medicines, devices and diagnostics Reserved for CambPlants "Vectorbuilder: Revolutionising Vector Design & Custom Cloning" (25 min seminar) followed by "Advanced Technologies For Rapid Generation Of Custom Designed Animal Models" (25 min seminar) |