COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > Cambridge ML Systems Seminar Series > Balancing Quality and Efficiency in Future AI Systems
Balancing Quality and Efficiency in Future AI SystemsAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Sally Matthews. Quality and efficiency are both essential in AI systems as data sources become more diverse and model sizes grow. In this talk, I will present techniques to address challenges in data and model quality as well as their efficiency, which are essential for building high-performing and sustainable AI systems. I will first introduce a toolkit for enhancing the quality of datasets, which can be used in a broad range of learning tasks including the training or fine tuning large language models (LLMs), laying the groundwork for model training with good data. Then, considering the specific challenge where data is distributed unevenly across sources with varying sizes, quality, and availability, such as in the case of federated learning, I will introduce the FedAU algorithm. This algorithm dynamically adjusts aggregation weights in the model training process based on the availability of data sources, to prevent model bias and improve training convergence. Afterwards, I will introduce techniques to make both training and inference more efficient, focusing on a framework that optimizes model selection from a zoo of LLMs to minimize energy usage while maintaining model performance guarantees. Together, these approaches form a blueprint for future AI systems that are capable of learning effectively from a vast amount of data at diverse sources and delivering high quality models while enhancing resource efficiency in real-world applications. Bio: Shiqiang Wang is a Staff Research Scientist at IBM T . J. Watson Research Center, NY, USA . He received his Ph.D. from Imperial College London, United Kingdom, in 2015. His research focuses on the intersection of distributed computing, machine learning, networking, and optimization, currently emphasizing on quality and efficiency aspects related to distributed data and models, which has a broad range of applications including distributed data analytics, efficient model training and inference, edge-based artificial intelligence (Edge AI), and large language models (LLMs). He received the IEEE Communications Society (ComSoc) Leonard G. Abraham Prize in 2021, IEEE ComSoc Best Young Professional Award in Industry in 2021, IBM Outstanding Technical Achievement Awards (OTAA) in 2019, 2021, 2022, and 2023, multiple Invention Achievement Awards from IBM since 2016. For more details, please visit his homepage at https://shiqiang.wang/ This talk is part of the Cambridge ML Systems Seminar Series series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listspolitical thought Plant Sciences 'ABC' Seminars Charity ISOCOther talksGlacial Fractures: An Environmental Art History of Sioqqap Sermia, Greenland The voyage of the sheep from Tibet: animal breeding in the 18th-century French Empire RSC Prizewinner's Talk: Microfluidics and engineered biomembranes as enabling technologies in synthetic cell design JCTS Presentations The post-Variscan evolution of Ireland – a case study in constraining shallow crustal exhumation and deformation On the (Local) Lifting Property |