Loading...
Loading...
We've all seen a lot of AI-generated videos floating across social media. Most are dead internet theory at worst, cute at best. Of course, there are some that are scary good, but until recently, they were resource-intensive to develop, in terms of time, tokens, or even the hardware required.
Google has decided to ch
let a = [1, 2, 3];
let b = a; // [1, 2, 3]
// now let's change the first index of b
b[0] = 9; // [9, 2, 3]
console.log(a) // => [9, 2, 3] but how ???
/*
* we have changed the array b and we notice
* that array a has changed too
*/
let x = []
let y = []
let z = y;
console.log(x == y) // false
console.log(y == z) // true
// read the article and you will understand the reason
// behind that weird results
ange that. It's been launching AI offerings for every major category (think Firebase Studio for vibe coding), and Veo is its answer to AI-generated videos. And it's impressive.
Meet your new AI teammatesTry Zapier Agents
Interestingly—in a side-eye kind of way—some of its training data is from YouTube. Google hasn't necessarily elaborated on what that means precisely, but do with that what you will.
In any case, Veo 3 seems to be a major breakthrough: you can now include audio generation alongside video generation, in a way that doesn't look like your videos are defying the laws of physics.
After spending time testing it myself, I can say it's a significant leap forward—though it still has plenty of quirks as this technology finds its footing.
Table of contents:
Google Veo is a family of AI video generation models that can create videos from text prompts or from static images. The latest model, Veo 3, includes native audio generation alongside video (the previous model, Veo 2, produced silent clips).
That native audio, real-world physics simulation, and advanced prompt understanding are what make Veo 3 stand out from other AI video generators. In comparison, other AI video generators, like Sora and Runway, don't have native audio functionality (yet).
Veo 3 is really impressive—folks are already using it to overhaul their marketing strategies. Before we dive into how it works and what it can do, here's a quick glance at what it does well and where it still needs some love.
Google Veo pros:
Native audio and video generation with natural-sounding speech and background noise or music
Realistic physics simulation for elements like water, fabric, and light
Excellent cinematic camera controls and scene composition
Advanced prompt understanding, especially for interaction cues
Multiple input options (text, image, frames)
Integrated in Flow and Gemini, with an intuitive interface (especially in Flow)
Constantly improving and already ahead of competitors like Runway or Sora
Google Veo cons: