Multimodal Foundation and Large Language Models: Applications, Challenges, and Future Directions

Speaker:  Irwin King – Hong Kong, Hong Kong
Topic(s):  Artificial Intelligence, Machine Learning, Computer Vision, Natural language processing

Abstract

In recent years, the field of artificial intelligence has witnessed significant advancements in multimodal foundation and large language models. This seminar presentation will provide an exploration of these models, focusing on their applications across various domains such as science, robotics, recommender systems, and watermarking. We will discuss the current trends in multimodal models, highlighting their growing importance in understanding and processing complex information. Additionally, we will delve into the challenges faced by these models, such as scalability, trustworthiness, and explore potential future directions for the field. By examining innovative approaches to improve these challenges and considering the impact of emerging technologies, we aim to inspire further research and innovation in this rapidly evolving field.

About this Lecture

Number of Slides:  50
Duration:  45 minutes
Languages Available:  English
Last Updated: 

Request this Lecture

To request this particular lecture, please complete this online form.

Request a Tour

To request a tour with this speaker, please complete this online form.

All requests will be sent to ACM headquarters for review.