基于扩散模型的大规模生成式视频生成大模型回顾与分析
Speaker
Xinyuan Chen
Shanghai Artificial Intelligence Lab
Time
Venue
Online
Meeting Info
腾讯会议
ID: 353135191
Password:111457
Abstract
As OpenAI introduces Sora, a generative text-to-video diffusion model, it opens the door to generation of high-definition videos at the minute level from text descriptions. This groundbreaking model not only showcases the vast potential of video generation but also captures the attention of researchers and enthusiasts alike. In this talk, we will delve into the evolutionary path of large-scale video generation models and explore key research milestones. We will then analyze the technical advancements and breakthrough effects achieved by the Sora model. However, while the success of Sora, there still exist limitations and bottlenecks in video generation. Concluding the talk, we will discuss the challenges faced in video generation and explore potential avenues for future breakthroughs.
Bio
Dr. Xinyuan Chen is currently a researcher at the Shanghai Artificial Intelligence Lab, collaborating closely with Prof. Yu Qiao. In 2020, she completed her dual PhD from Shanghai Jiao Tong University and the University of Technology Sydney, under the supervision of Prof. Xiaokang Yang and Prof. Dacheng Tao. Her research interests lie in generative models, diffusion models, and generative adversarial networks. Previously, she has focused her work on image and video generation, large-scale video generation models, as well as controllable generation incorporating multi-modality and semantic conditions.
Sponsor
Institute of Natural Sciences, Shanghai Jiao Tong University
Shanghai National Center for Applied Mathematics (SJTU Center)