TIMES OF TECH

Chinese AI Startup Launches Image-to-Video Tool to Rival OpenAI's Sora

Chinese AI Startup Launches Image-to-Video Tool to Rival OpenAI’s Sora

Beijing-based AI startup Shengshu Technology has announced a new capability in its video generation tool, Vidu, designed to take on OpenAI’s forthcoming model, Sora. By allowing users to create videos from multiple images, Vidu aims to capture attention from advertisers, animators, and digital marketers. Shengshu’s image-to-video tool enables users to create complex, visually consistent scenes by combining distinct images into an 8-second video clip. This innovative functionality is currently available globally, and the company reports that it is already seeing substantial business demand and monetization for Vidu.

Shengshu’s Vidu tool stands out for its ability to produce visually cohesive videos, a breakthrough that its developers claim has been a priority from the start. According to Shengshu’s Chief Technology Officer Fan Bao, the challenge of maintaining “visual consistency” in AI-generated videos was a primary focus. The technology takes individual images, such as a person, a shirt, and a moped, and seamlessly integrates them to create lifelike animations of the person wearing the shirt and riding the moped.

For further information on advancements in generative AI tools, see our article on OpenAI’s upcoming AI agent tool Operator.

Vidu’s Functionality and Commercial Success

The newly enhanced Vidu tool lets users combine images into scenes with realistic transitions and animations. Initially released in April, Vidu went viral on platforms like TikTok with its ability to transform two profile photos into lifelike videos of people interacting. Now, Vidu is enhancing its offerings to support even more sophisticated animations based on both text and images.

Vidu is reported to be gaining traction among advertisers and animators, allowing Shengshu to generate significant revenue. CEO Jiayu Tang shared that Vidu’s monthly usage rates per customer range between 100,000 to 1 million yuan (approximately $13,871 to $138,711). This high adoption rate indicates a growing market demand for AI-driven content creation tools that save time and resources for businesses in creative industries.

For more about the growing applications of generative AI technology, read our article on Odyssey’s generative AI camera capture system.

A Response to OpenAI’s Sora

While OpenAI’s Sora model, announced earlier this year, is expected to generate one-minute videos solely from text prompts, it has yet to be publicly released. Shengshu, by contrast, has already launched Vidu globally, giving it an early lead in the AI-generated video market. Shengshu’s approach of blending images with text inputs to create short video clips distinguishes it from other platforms, which may lack the same level of visual consistency.

Shengshu’s rapid product development also benefits from major financial backing, including investments from Baidu Ventures, Alibaba-affiliate Ant Group, Qiming Venture Partners, and support from Beijing city. This funding, combined with the unique technical capabilities of Vidu, places Shengshu in a strong position to challenge both local and international players like OpenAI.

Data Privacy and Copyright Compliance

In addressing copyright concerns, Shengshu has developed a system that enables companies to enter contracts allowing the AI to replicate an artist’s style legally, specifically for commercial use cases such as advertising. Additionally, Vidu adheres to stringent data privacy protocols. Jiayu Tang explained that Vidu does not permit the public to create videos using images of celebrities or other “sensitive” figures, and it bans the use of nudes or violent imagery. Personal photos uploaded to the platform are deleted in accordance with GDPR standards, ensuring compliance with global privacy benchmarks.

For additional insights into the implications of AI and data privacy, see our piece on the growth forecast for video-as-a-sensor technology.

Shengshu’s Future in AI Video Generation

With its innovative Vidu tool, Shengshu Technology demonstrates the potential of AI to transform how brands and creators produce dynamic video content. The tool’s ability to integrate multiple images into cohesive animated videos offers a new level of flexibility for advertisers and creative professionals looking to create immersive visual experiences. Shengshu’s monthly revenue from Vidu shows that the demand for such technology is not only high but also financially viable, setting a precedent for future AI-driven visual content tools.

Moreover, Shengshu’s international reach, facilitated by rented cloud servers both in China and abroad, positions the company to expand its user base across continents. As Shengshu continues to enhance Vidu’s capabilities, it is likely to solidify its position as a leader in AI-powered video content generation. The race between AI startups like Shengshu and major players such as OpenAI underscores the rapid evolution of AI technology and its growing role in content creation.

Read More on Related Topics:

Shengshu’s Vidu tool exemplifies the transformative potential of AI in digital media. As AI models become increasingly sophisticated, they are shaping the future of video creation, enabling artists, advertisers, and developers to produce more captivating and customized content than ever before. Read Source – CNBC

Share this post on

Facebook
Twitter
LinkedIn

Leave a Comment