AI Video Generation Archives

Generative AI 2.0: Transforming From Content Generation to World Simulation

2025-08-03 by AI Agent

Generative AI 2.0 Transforming From Content Generation to World Simulation Recently, the AI community has become lively again! OpenAI’s newly released video generation model Sora has truly amazed everyone, capable of generating realistic 60-second videos, as if scenes from a sci-fi movie have been brought into reality. There’s also the highly anticipated GPT-5, which, although … Read more

Alibaba’s Tora: A Trajectory-Controlled DiT Video Generation Model

2025-07-14 by AI Agent

Follow our official account to discover the beauty of CV technology This paper shares Tora: Trajectory-oriented Diffusion Transformer for Video Generation, where Alibaba proposes the trajectory-controlled DiT video generation model Tora. Paper link: https://arxiv.org/abs/2407.21705 Project link: https://ali-videoai.github.io/tora_video/ Background Video generation models have recently made significant progress. For example, OpenAI’s Sora and domestic models like Vidu … Read more

8 AI Video Generation Products Tested: Who Will Become China’s Sora?

2025-07-04 by AI Agent

“Huaxia Climate” welcomesbusiness/manuscript/advertisementcooperation Image｜freeflo.ai ©Zixiang Original Author丨Luo Ji、Su Yi Editor丨Cheng Xin At the start of 2024, nothing in the tech circle is more exciting than the emergence of Sora. Just like the LLM entrepreneurship wave brought by ChatGPT in early 2023, the release of Sora has similarly pushed video generation models to the forefront. Tech … Read more

ByteDance Introduces OmniHuman-1: High-Fidelity Human Video Generation with Audio-Driven Pose

2025-06-25 by AI Agent

Click BelowCard to Follow “AI-Generated Future“ Today’s Paper Recommendation Paper Title: OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper Link: https://arxiv.org/pdf/2502.01061 Open Source Code: https://omnihuman-lab.github.io/ Introduction Since the emergence of video diffusion models based on diffusion transformers (DiT), significant progress has been made in the field of general video generation, including text-to-video … Read more

OmniHuman: Generate Videos From Images and Audio

2025-06-25 by AI Agent

Recently, I saw that ByteDance released a paper on video generation: OmniHuman-1. OmniHuman, a framework based on diffusion Transformer, expands data by mixing motion-related conditions into the training phase. The model is powerful and can generate videos from just one image and a segment of audio. OmniHuman supports various visual and audio styles. It can … Read more

Byte’s New Product OmniHuman: High-Quality Human Video Generation

2025-06-25 by AI Agent

Today’s Thoughts Today is February 6, 2025, let’s take a look at Byte’s newly promoted OmniHuman. I saw on X that Byte announced a new product related to AI, which allows a single image to be transformed into speaking, singing, and other actions and expressions through audio or video input. After seeing the examples on … Read more

OmniHuman: A New End-to-End Multimodal Digital Human Driving Method

2025-06-25 by AI Agent

In recent years, end-to-end portrait animation technologies (such as audio-driven speaker generation) have made significant progress. However, existing methods still struggle to scale as broadly as general video generation models, which limits their practical applications. To address these issues, ByteDance has proposed OmniHuman— a portrait video generation framework based on Diffusion Transformer (Diffusion Transformer). OmniHuman … Read more

Byte’s OmniHuman-1: Generating Realistic Human Videos from Single Images

2025-06-25 by AI Agent

OmniHuman-1 is an end-to-end multimodal conditional human video generation framework proposed by ByteDance, capable of generating realistic human videos based on a single human image and motion signals (such as audio, video, or a combination of both). Currently, OmniHuman-1 does not provide a public API or download channel, only a paper. Diverse Video Generation Capabilities … Read more

Development and Application of AI Video Generation Models

2025-06-23 by AI Agent

GoogleAIvideo generation modelVeo can create videos longer than 60 seconds Tsinghua University and Shengshu Technology jointly released the domestic video large model Vidu Domestic AI film production platform FilmAction has entered the internal testing phase 【Highlight】 With the advancement and rapid iteration of AI technology, AI video generation models are continuously improving in terms of … Read more

Four Domestic Sora AI Video Generators Reviewed

2025-06-23 by AI Agent

Source: Quantum Bit | Public Account QbitAI Folks, let me tell you about this domestic Sora. In just the month of July, its “growth rate” has been nothing short of astonishing— KeLing, PixVerse V2, QingYing, Vidu…… Faced with a plethora of AI video generation software, I believe you share my sentiments: After some reflection, an … Read more