Ant Group Leverages Chinese Chips to Train Advanced AI Models

Article Highlights
Off On

Ant Group, a subsidiary of the Alibaba Group, has taken a bold step in utilizing domestically produced semiconductors to train their cutting-edge artificial intelligence (AI) models.This strategic pivot addresses both the escalating operational costs and the limitations posed by dependencies on restricted U.S. technology. The initiative highlights the growing importance of self-reliance in AI model development and the broader implications for the tech sector in China.

Leveraging Local Technology

Shift from Nvidia to Domestic Suppliers

Ant Group’s initiative primarily focuses on substituting high-performance GPUs from Nvidia with chips from Chinese companies, including its parent company Alibaba and Huawei Technologies.This shift is aligned with a broader trend among Chinese firms attempting to find cost-effective and innovative solutions amidst stringent U.S. technology export restrictions. By reducing dependence on foreign technology, Ant Group aims to cut operational costs significantly and enhance its technological self-sufficiency.

The reliance on domestically produced chips is more than a cost-saving exercise; it represents a strategic maneuver to navigate through the complexities imposed by international trade restrictions in technology. Amid these constraints, Ant Group has showcased resilience by enhancing its reliance on local semiconductor technologies to sustain its growth trajectory in AI capabilities. This endeavor is particularly notable given the global demand for high-performance GPUs traditionally dominated by American tech giants like Nvidia.

Mixture of Experts (MoE) Method

Central to Ant Group’s AI training strategy is the innovative Mixture of Experts (MoE) method, a technique that divides tasks into smaller data sets for processing by specialized components.This method promises to optimize the efficiency and performance of AI models by leveraging collective expertise segmented into manageable tasks. The approach has not only shown promising results within Ant Group’s development framework but also captured the interest of major tech giants such as Google.The implementation of the MoE method by Ant Group illustrates a forward-thinking approach in the AI landscape. It reflects a burgeoning trend within the industry to maximize computational efficiency and model effectiveness. This technique, when paired with domestically produced semiconductors, has demonstrated that high performance in AI model training can be achieved without reliance on high-cost, imported technology. This innovation is a testament to the potential for Chinese technology to compete on a global stage, notwithstanding the external limitations imposed by export restrictions.

Performance and Cost Efficiency

Model Performance

Ant Group’s MoE-based models have demonstrated performance levels that are comparable to those trained using the more expensive Nvidia H800 chips. This outcome showcases the potential of Chinese semiconductor technology to compete effectively in the realm of AI model training. By achieving such performance benchmarks, Ant Group exemplifies how domestic technology can maintain, if not enhance, the quality of AI models while adhering to cost-effective measures.

This parity in performance underscores the viability of domestic alternatives, not just as a substitute but as a competent competitor to Western technology in AI training. The performance metrics of these models are a significant breakthrough, indicating that advanced AI capabilities can be realized without exclusive reliance on imported high-end GPUs.As Ant Group continues to refine its AI models, the industry may witness a shift in the preference towards local technology over international counterparts.

Cost Reduction

A major outcome of Ant Group’s experimentation with domestic chips is the significant reduction in AI training costs. By utilizing local semiconductors and optimizing training methods, the company has managed to substantially cut expenses associated with the development and training of AI models. This is particularly noteworthy given the high costs historically linked to high-performance hardware necessary for large-scale AI operations.

According to their research, the cost of training one trillion tokens—the foundational data units for AI learning—with conventional high-performance hardware was about 6.35 million yuan (approximately $880,000).With their optimized training approach using lower-specification, yet domestically produced chips, they reduced this cost to approximately 5.1 million yuan. This substantial cost reduction not only validates the economic feasibility of using local technology but also sets a precedent for other tech firms aiming to balance cost and performance in AI development.

Real-World Applications

Healthcare and Finance

Ant Group plans to deploy its optimized AI models in practical applications across various sectors, particularly healthcare and finance. The company’s recent acquisition of Haodf.com, a Chinese online medical platform, is a strategic move to enhance its capability in deploying AI solutions for healthcare. This acquisition, coupled with existing AI services such as the virtual assistant app Zhixiaobao and the financial advisory platform Maxiaocai, reflects Ant Group’s ongoing commitment to leveraging AI for real-world problem-solving.These applications showcase the versatility and practical utility of AI models developed using domestically produced semiconductors. By integrating AI into healthcare, Ant Group aims to streamline medical processes, improve diagnostic accuracy, and enhance overall patient care. In finance, the introduction of AI-driven platforms promises to redefine financial advising and customer interaction, offering personalized and efficient services.This pragmatic approach demonstrates Ant Group’s strategic vision to utilize AI in addressing tangible, sector-specific challenges.

Open-Source Contributions

In an effort to foster a collaborative environment and accelerate innovation in the AI field, Ant Group has made its AI models open source. This significant contribution includes models with parameters reaching billions, such as Ling-Lite operating with 16.8 billion parameters and Ling-Plus with a remarkable 290 billion.Such an initiative opens avenues for further research and development, allowing the broader tech community to benefit from Ant Group’s advancements.

The open-sourcing of these models provides valuable resources for researchers and developers globally, driving collective progress in AI technology. This move aligns with a growing trend within the tech industry to embrace open-source models, thereby democratizing access to cutting-edge AI tools. By making these resources available, Ant Group is enhancing the potential for collaborative breakthroughs and fostering an ecosystem of shared knowledge and innovation.

Challenges and Future Prospects

Stability Issues

Despite notable progress, training AI models with domestic chips continues to pose certain challenges.Ant Group’s research highlights that modifications to hardware or model structures during the training process can sometimes result in unstable outcomes. This includes issues such as increased error rates which underscore the complexity involved in refining AI training processes. These challenges necessitate ongoing adjustments and consistent efforts to stabilize model performance.

Ensuring the stability and reliability of AI models is crucial for their practical deployment across industries.Ant Group’s experience demonstrates that while the switch to domestic technology has clear advantages, it also invites nuanced technical challenges that must be addressed. The commitment to overcome these obstacles reflects a broader dedication within the tech community to refine AI methods continually, striving for perfection and operational excellence.

Strategic Independence

Ant Group’s strategic decision to incorporate domestically produced semiconductors in training their state-of-the-art AI models aims to reduce rising operational expenses and mitigate challenges linked with dependence on restricted U.S. technology. By turning to locally made chips, Ant Group is preparing to navigate the complexities of international tech restrictions and high costs.

This initiative is a testament to the increasing importance of self-sufficiency in the development of AI models. It underscores a broader shift within China’s tech sector, emphasizing the need to rely on homegrown technologies to overcome external limitations. As China continues to bolster its tech industry, the move by Ant Group reflects a broader trend toward innovation and independence in the face of global tech constraints. This development could potentially influence other tech companies in China to follow suit, further solidifying the country’s position in the global tech landscape and reducing its reliance on foreign technology.

Explore more

How is Digitalization Revolutionizing Small Traders in Vietnam?

In Vietnam, digitalization has emerged as a transformative force reshaping the landscape for small traders and household businesses. The introduction of Government Decree No. 70/2025/ND-CP stands at the forefront of this digital wave, mandating that businesses in specific sectors earning over 1 billion VND annually adopt e-invoices integrated with cash registers. This change aligns with national efforts to formalize and

How Can You Master Math for Data Science?

Mastering mathematics is a pivotal requirement for delving into the expansive field of data science. The challenge lies in understanding which mathematical concepts are essential and how to effectively apply them to real-world situations. As data science increasingly influences decision-making across diverse sectors, the ability to interpret data accurately through a mathematical lens becomes indispensable. This demand is reflected in

How to Master Data Science Programming as a Beginner?

In 2025, the allure of data science remains as strong as ever, drawing countless individuals eager to dive into this transformative field. The potential to unlock real-world solutions through data is immense, yet for beginners, the journey to mastering data science programming can be daunting. With a plethora of algorithms, mathematical principles, and programming languages to choose from, it’s easy

Free CRM Features vs. Paid CRM Upgrades: A Comparative Analysis

In the rapidly evolving landscape of digital tools, small businesses face the compelling challenge of optimizing customer relationship management without excessive costs. With free CRM software solutions becoming increasingly sophisticated, entrepreneurs must decide whether to invest in paid upgrades. This comparative analysis explores free CRM features versus paid CRM upgrades by highlighting essential facets, allowing businesses to make informed choices.

ABM Emerges as Top Strategy in B2B Marketing Transformation

In the ever-evolving landscape of B2B marketing, strategies like account-based marketing (ABM) are reshaping how companies engage with high-value clients. Today, we delve into this transformative approach with Aisha Amaira, a leading expert in MarTech known for her ability to blend technology and marketing prowess. Aisha shares her insights on why ABM is rapidly gaining traction and the role artificial