ElevenLabs now does images and video too, it has become a full media generation platform
Most people know ElevenLabs as the voice cloning and text-to-speech tool. That is still there and still excellent. But they have added image and video generation to the platform and I do not think enough people know about this yet.
Image generation is built in using models including Flux 1 Context Pro. You can select aspect ratios and use style references. Video generation uses models like Veo 3.1, Sora 2 and Kling 2.5 directly from the same account. You can generate from a text prompt or take an upscaled image and use it as a starting frame for the video, which gives you much more control over the result.
The ElevenLabs Studio is where it all comes together. You can combine video clips, add synced voiceovers using professional voice clones, layer in background music and use lip-sync features. Everything is in one place rather than bouncing between five different tools.