Google researchers unveil ‘VLOGGER’, an AI that can animate a single still photos to allow them to talk
Described in a research paper titled “VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis,” the AI model can take a photo of a person and an audio clip as input, and then output a video that matches the audio, showing the person speaking the words and making corresponding facial expressions, head movements and hand gestures. The Read more about Google researchers unveil ‘VLOGGER’, an AI that can animate a single still photos to allow them to talk[…]