A Web3 First: China Leverages Cross-Border AI Data For Industrial Use

Source: Forbes

In a world-first, China’s Shenzhen Data Exchange (SDEx) facilitates a deal that brings decentralized, community-sourced AI data into real industrial applications through Web3 infrastructure. SDEx is the largest provider of national-level data trading platforms for data marketization and cross-border circulation in China’s digital economy. The platform provides a comprehensive suite of services, including compliance protection, circulation support, supply and demand connection, and ecological development, enabling businesses and consumers to trade data efficiently.

In my previous articles, I emphasized that data will inevitably become the next key battleground frontier in AI’s global race. This article delves into how SDEx has taken a significant leap forward in cross-border data collection through a commercially viable model powered by Decentralized AI (DeAI).

The Emerging AI Data Bottleneck

First, let’s reiterate the looming bottleneck facing the global AI industry: data scarcity. As industries and companies increasingly rely on AI models for innovation, the demand for high-quality training data will skyrocket. This challenge cuts across sectors:

  • Healthcare: AI-driven diagnostics require vast datasets of medical images and patient records to identify diseases accurately.
  • Autonomous Driving: Self-driving cars need millions of miles of diverse driving data to navigate complex real-world scenarios safely.
  • Financial Modeling: AI algorithms used in fraud detection or market prediction depend on large quantities of transactional data.
  • Smart Manufacturing: High-resolution images of equipment, materials, and defect patterns are vital for computer vision models in automation and quality control.

The fundamental question then becomes: where can we source this immense volume of data at scale? Traditional centralized data collection methods encounter significant limitations:

  • Geopolitical boundaries and regulatory constraints make cross-border data sharing difficult.
  • Data privacy laws such as GDPR and China’s PIPL impose strict data collection and use controls.
  • Lack of diversity in centralized datasets can result in biased AI models.
  • Access inequality, where only major tech companies control high-value data lakes, creates a walled-garden effect.

SDEx’s Pragmatic Breakthrough Using DeAI

While the chipset race dominates headlines, a quieter but equally crucial data war is underway. Recently, the SDEx facilitated a commercial deal between Shenzhen Intellifusion Technologies, a publicly listed Chinese AI company, and OORT, a decentralized AI solution provider.

Intellifusion has been developing industry-specific AI solutions to enhance its smart factory capabilities. Specifically, they needed industrial datasets, including images of professional respiratory masks and confined-space ventilation ducts, among other things. OORT enabled the collection of this data through its product solution, OORT DataHub. It achieved this by distributing data collection tasks to its global community, spanning over 130 countries. Participants could contribute their data and earn crypto incentives, a feat unattainable through traditional banking or Web2 platforms. This deal marks the realization of the first commercially viable model for truly decentralized, global data collection, a significant advancement in cross-border data services.

While established platforms like Amazon’s AWS Data Exchange (ADX) exist, they possess limitations that hinder the next phase of AI’s global advancement:

  • Limited access to open, community-contributed data: ADX primarily functions as a B2B marketplace dominated by commercial enterprises. This excludes valuable localized and citizen-generated data from developing regions, academic research, and open-source communities. For example, local, citizen-science-based water quality data collected in rural India or African agricultural data collected by farmers could be very valuable for AI training.
  • Cross-border data compliance challenges: Transborder data transfers are restricted, particularly in jurisdictions with stringent data localization laws, such as China, India, and the European Union.
  • Centralized access model: Access is contingent on an AWS account and Amazon’s infrastructure and policies. It lacks built-in data ownership validation or self-sovereign identity, forcing publishers to rely on AWS for subscription and billing management.
  • Fragmented global representation: Data providers are predominantly based in the United States and Europe, leading to the underrepresentation of SMEs and researchers in Africa, Latin America, and Southeast Asia, as well as Indigenous data holders and sources of real-time community or device-generated datasets, such as IoT data from rural areas.
  • Interoperability limitations: While ADX integrates well within the AWS ecosystem, it lacks open interoperability with other cloud providers and Web3 tools, hindering integration with Google Cloud, IPFS/Filecoin, decentralized compute layers and blockchain-native applications.

DeAI Taking Shape: From Hype to Practice

Reflecting this context, the DeAI space has recently made remarkable strides toward building a more open AI future amidst growing concerns over the centralized model’s dominance by a few major players. Notably, two DeAI alliances emerged on the same day.

First, HumanAIx, founded by 13 Web3 entities, including OORT, YGG, NEO, and io.net, introduced an open protocol designed to connect partners seamlessly. Each participant contributes essential components—validation, storage, computing, and data—to establish a permissionless, scalable, and verifiable decentralized AI infrastructure. Its three-layer architecture—interface, protocol (integrating compute, storage, and data), and security—leverages industry expertise to foster an open environment for the future of DeAI.

Simultaneously, another coalition of Web3 leaders, including NEAR, Aethir, and Coinbase, formed the Open Agents Alliance (OAA), which aims to ensure secure, open-source, economical, and fair AI access.

Despite crypto’s recent bearish turn and the AI sector’s vulnerability to hype and inflated narratives, it’s promising to see serious industry players working on potentially profound and sustainable solutions. Only projects with viable business models will endure. SDEx has taken a significant step by embracing decentralized data collection, a move that signals a broader shift with global implications. This development suggests an important shift is underway, prompting industry participants to reconsider how they gather, verify, and manage data for AI development.

Leave a comment

Your email address will not be published. Required fields are marked *