On Tuesday, Microsoft Research Asia unveiled VASA-1, an AI model that can create a synchronized animated video of a person talking or singing from a single photo and an existing audio track. In the future, it could power virtual avatars that render locally and don’t require video feeds—or allow anyone with similar tools to take a photo of a person found online and make them appear to say whatever they want.

  • leds
    link
    fedilink
    arrow-up
    29
    ·
    7 months ago

    Great! When will this be included in teams? So that I can deepfake all meetings

    • Dave@lemmy.ml
      link
      fedilink
      English
      arrow-up
      2
      ·
      7 months ago

      I’ll give it a photo of myself from 10 years ago so that my coworkers don’t realize that I’m getting old.