COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > Isaac Newton Institute Seminar Series > Big data integration: challenges and new approaches
Big data integration: challenges and new approachesAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact INI IT. DLAW02 - Data linkage: techniques, challenges and applications Data integration is a key challenge for Big Data applications to semantically enrich and combine large sets of heterogeneous data for enhanced data analysis. In many cases, there is also a need to deal with a very high number of data sources, e.g., product offers from many e-commerce websites. We will discuss approaches to deal with the key data integration tasks of (large-scale) entity resolution and schema matching. In particular, we discuss parallel blocking and entity resolution on Hadoop platforms together with load balancing techniques to deal with data skew. We also discuss challenges and recent approaches for holistic data integration of many data sources, e.g., to create knowledge graphs or to make use of huge collections of web tables. This talk is part of the Isaac Newton Institute Seminar Series series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsFaraday Institute Theory of Condensed Matter Culture of Scientific Research CUED Computer Vision Research Seminars BlueSci - Workshops on Science Communication Perspectives from Cambridge AssessmentOther talksPropaganda porcelain: The mirror of the Russian revolution and its consequences Louisiana Creole - a creole at the periphery Player 2 has entered the game - ways of working towards open science How could education systems research prompt a change to how DFIS works on education |