Alibaba’s Tora: A Trajectory-Controlled DiT Video Generation Model

Follow our official account to discover the beauty of CV technology This paper shares Tora: Trajectory-oriented Diffusion Transformer for Video Generation, where Alibaba proposes the trajectory-controlled DiT video generation model Tora. Paper link: https://arxiv.org/abs/2407.21705 Project link: https://ali-videoai.github.io/tora_video/ Background Video generation models have recently made significant progress. For example, OpenAI’s Sora and domestic models like Vidu … Read more

8 AI Video Generation Products Tested: Who Will Become China’s Sora?

8 AI Video Generation Products Tested: Who Will Become China's Sora?

“Huaxia Climate” welcomesbusiness/manuscript/advertisementcooperation Image|freeflo.ai ©Zixiang Original Author丨Luo Ji、Su Yi Editor丨Cheng Xin At the start of 2024, nothing in the tech circle is more exciting than the emergence of Sora. Just like the LLM entrepreneurship wave brought by ChatGPT in early 2023, the release of Sora has similarly pushed video generation models to the forefront. Tech … Read more

ByteDance Introduces OmniHuman-1: High-Fidelity Human Video Generation with Audio-Driven Pose

ByteDance Introduces OmniHuman-1: High-Fidelity Human Video Generation with Audio-Driven Pose

Click BelowCard to Follow “AI-Generated Future“ Today’s Paper Recommendation Paper Title: OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper Link: https://arxiv.org/pdf/2502.01061 Open Source Code: https://omnihuman-lab.github.io/ Introduction Since the emergence of video diffusion models based on diffusion transformers (DiT), significant progress has been made in the field of general video generation, including text-to-video … Read more

OmniHuman: Generate Videos From Images and Audio

OmniHuman: Generate Videos From Images and Audio

Recently, I saw that ByteDance released a paper on video generation: OmniHuman-1. OmniHuman, a framework based on diffusion Transformer, expands data by mixing motion-related conditions into the training phase. The model is powerful and can generate videos from just one image and a segment of audio. OmniHuman supports various visual and audio styles. It can … Read more

Byte’s New Product OmniHuman: High-Quality Human Video Generation

Byte's New Product OmniHuman: High-Quality Human Video Generation

Today’s Thoughts Today is February 6, 2025, let’s take a look at Byte’s newly promoted OmniHuman. I saw on X that Byte announced a new product related to AI, which allows a single image to be transformed into speaking, singing, and other actions and expressions through audio or video input. After seeing the examples on … Read more

OmniHuman: A New End-to-End Multimodal Digital Human Driving Method

OmniHuman: A New End-to-End Multimodal Digital Human Driving Method

In recent years, end-to-end portrait animation technologies (such as audio-driven speaker generation) have made significant progress. However, existing methods still struggle to scale as broadly as general video generation models, which limits their practical applications. To address these issues, ByteDance has proposed OmniHuman— a portrait video generation framework based on Diffusion Transformer (Diffusion Transformer). OmniHuman … Read more

Byte’s OmniHuman-1: Generating Realistic Human Videos from Single Images

Byte's OmniHuman-1: Generating Realistic Human Videos from Single Images

OmniHuman-1 is an end-to-end multimodal conditional human video generation framework proposed by ByteDance, capable of generating realistic human videos based on a single human image and motion signals (such as audio, video, or a combination of both). Currently, OmniHuman-1 does not provide a public API or download channel, only a paper. Diverse Video Generation Capabilities … Read more

Development and Application of AI Video Generation Models

Development and Application of AI Video Generation Models

GoogleAIvideo generation modelVeo can create videos longer than 60 seconds Tsinghua University and Shengshu Technology jointly released the domestic video large model Vidu Domestic AI film production platform FilmAction has entered the internal testing phase 【Highlight】 With the advancement and rapid iteration of AI technology, AI video generation models are continuously improving in terms of … Read more

Four Domestic Sora AI Video Generators Reviewed

Four Domestic Sora AI Video Generators Reviewed

Source: Quantum Bit | Public Account QbitAI Folks, let me tell you about this domestic Sora. In just the month of July, its “growth rate” has been nothing short of astonishing— KeLing, PixVerse V2, QingYing, Vidu…… Faced with a plethora of AI video generation software, I believe you share my sentiments: After some reflection, an … Read more

Enhanced Control in AI-Generated Video Services

Enhanced Control in AI-Generated Video Services

Enhanced Control in AIGC Applications The domestic AI video generation model Vidu enhances subject consistency based on reference functionality. The AI video generation model Dream Machine 1.6 adds camera motion control features. Research Progress on AIGC Technology Control Beihang University and Aishi Technology jointly released a flexible and efficient controllable video generation method called TrackGo. … Read more