基于扩散模型的大规模生成式视频生成大模型回顾与分析
Speaker

Xinyuan Chen
Shanghai Artificial Intelligence Lab

Time
2024-02-29 15:00 ~ 16:00
Venue
Online
Meeting Info
腾讯会议
  • ID: 353135191
  • Password:111457
  • Abstract
    As OpenAI introduces Sora, a generative text-to-video diffusion model, it opens the door to generation of high-definition videos at the minute level from text descriptions. This groundbreaking model not only showcases the vast potential of video generation but also captures the attention of researchers and enthusiasts alike. In this talk, we will delve into the evolutionary path of large-scale video generation models and explore key research milestones. We will then analyze the technical advancements and breakthrough effects achieved by the Sora model. However, while the success of Sora, there still exist limitations and bottlenecks in video generation. Concluding the talk, we will discuss the challenges faced in video generation and explore potential avenues for future breakthroughs.
    Bio
    Dr. Xinyuan Chen is currently a researcher at the Shanghai Artificial Intelligence Lab, collaborating closely with Prof. Yu Qiao. In 2020, she completed her dual PhD from Shanghai Jiao Tong University and the University of Technology Sydney, under the supervision of Prof. Xiaokang Yang and Prof. Dacheng Tao. Her research interests lie in generative models, diffusion models, and generative adversarial networks. Previously, she has focused her work on image and video generation, large-scale video generation models, as well as controllable generation incorporating multi-modality and semantic conditions.
    Sponsor
  • Institute of Natural Sciences, Shanghai Jiao Tong University
  • Shanghai National Center for Applied Mathematics (SJTU Center)