Alibaba Cloud Open Sources Advanced AI Video Models and Plans Major Investment

Article Highlights
Off On

In a groundbreaking move to democratize advanced AI technology, Alibaba Cloud has open-sourced its Tongyi Wanxiang 2.1 family of video foundation models, signaling a major step forward for businesses and researchers alike in the realm of AI-driven video creation. The decision aims to empower users with sophisticated capabilities to generate high-quality videos using cutting-edge AI technologies. The Tongyi Wanxiang 2.1 family is notable for its inclusion of both 14 billion and 1.3 billion parameter versions, designed to produce highly realistic videos from text and image inputs. Available through Alibaba Cloud’s AI model community, Model Scope, and the popular platform Hugging Face, these models are readily accessible to innovators looking to push the boundaries of AI video generation.

Introducing Tongyi Wanxiang 2.1’s Capabilities

One of the most striking features of the Wanxiang 2.1 family is its dual language support, offering text effects in both Chinese and English. This bilingual capability enhances its utility across a wide range of user scenarios, making it an attractive choice for global applications. The models’ proficiency in generating realistic visuals is driven by their ability to handle complex movements, improve pixel quality, and adhere to physical principles, thus optimizing the precision of instructions. This level of sophistication has allowed Wanxiang 2.1 to reach the top of the VBench leaderboard for video generative models, securing its position as the only open-source model among the top five on Hugging Face’s leaderboard.

The range of needs and computational resources addressed by the 14B and 1.3B parameter models is significant. The 14B model is renowned for producing superior high-quality visuals, while the 1.3B model strikes a balance between generation quality and computational efficiency. For example, a user generating a five-second 480p video on a standard laptop would only need about four minutes using the 1.3B model. By open-sourcing these advanced models, Alibaba Cloud aims to lower the barriers for businesses wishing to leverage AI, making high-quality visual content creation more attainable and cost-effective.

Expansion Beyond Wanxiang with Qwen Models

In addition to the Wanxiang 2.1 family, Alibaba Cloud has also made its Qwen foundation models available as open source. These models have garnered high rankings on the Hugging Face Open LLM leaderboards, showcasing performance that is comparable to other leading models globally. The Qwen models have seen widespread adoption, with more than 100,000 derivative models built on Qwen hosted on Hugging Face, underscoring their significant impact and utility.

Alibaba Cloud is not merely providing these advanced models but also supporting enterprises through its AI Model Studio. This platform allows large enterprises to access these foundation models with tools designed for model training and deployment within controlled environments. The AI Model Studio also assists in responsibly monitoring and managing content, creating training datasets, and customizing model training. These capabilities ensure robust risk management and model integrity, enabling businesses to confidently integrate advanced AI models into their operations.

Substantial Investment in AI and Cloud Computing

In a trailblazing initiative to democratize state-of-the-art AI technology, Alibaba Cloud has open-sourced its Tongyi Wanxiang 2.1 family of video foundation models. This decision is a significant advancement for both businesses and researchers in the field of AI-driven video creation, providing sophisticated tools that allow the generation of high-quality videos utilizing the latest AI technologies. The Tongyi Wanxiang 2.1 family stands out due to its inclusion of models with 14 billion and 1.3 billion parameters, specifically designed for generating highly realistic videos from text and image inputs. These models are accessible through Alibaba Cloud’s AI model community, Model Scope, as well as the popular platform Hugging Face. By making these models freely available, Alibaba Cloud is enabling innovators and developers to push the boundaries of AI video generation further than ever before. Available to a broad audience, this move is expected to drive new developments and creativity in the AI video production landscape.

Explore more

AI Redefines Software Engineering as Manual Coding Fades

The rhythmic clacking of mechanical keyboards, once the heartbeat of Silicon Valley innovation, is rapidly being replaced by the silent, instantaneous pulse of automated script generation. For decades, the ability to hand-write complex logic in languages like Python, Java, or C++ served as the ultimate gatekeeper to a world of prestige and high compensation. Today, that gate is being dismantled

Is Writing Code Becoming Obsolete in the Age of AI?

The 3,000-Developer Question: What Happens When the Keyboard Goes Quiet? The rhythmic tapping of mechanical keyboards that once echoed through every software engineering hub has gradually faded into a thoughtful silence as the industry pivots toward autonomous systems. This transformation was the focal point of a recent gathering of over 3,000 developers who sought to define their roles in a

Skills-Based Hiring Ends the Self-Inflicted Talent Crisis

The persistent disconnect between a company’s inability to fill open roles and the record-breaking volume of incoming applications suggests that modern recruitment has become its own worst enemy. While 65% of HR leaders believe the hiring power dynamic has finally shifted back in their favor, a staggering 62% simultaneously claim they are trapped in a persistent talent crisis. This paradox

AI and Gen Z Are Redefining the Entry-Level Job Market

The silent hum of a server rack now performs the tasks once reserved for the bright-eyed college graduate clutching a fresh diploma and a stack of business cards. This mechanical evolution represents a fundamental dismantling of the traditional corporate hierarchy, where the entry-level role served as a primary training ground for future leaders. As of 2026, the concept of “paying

How Can Recruiters Shift From Attraction to Seduction?

The traditional recruitment funnel has transformed into a complex psychological maze where simply posting a vacancy no longer guarantees a single qualified applicant. Talent acquisition teams now face a reality where the once-reliable job boards remain silent, reflecting a fundamental shift in how professionals view career mobility. This quietude signifies the end of a passive era, as the modern talent