Trends
Interview with Du Junping, founder and CEO of Datastrato: Driving innovation in data and AI
Du Junping, founder & CEO of Datastrato, director of LF AI & DATA, and ASF member, has been deeply involved in the AI and Data open-source fields for over a decade. He has served as the general manager of Open Source Business for a Fortune 500 company, head of Data Business and chief architect, and

Headline
Du Junping, founder & CEO of Datastrato, director of LF AI & DATA, and ASF member, has been deeply involved in the AI and Data open-source fields for over a decade. He has served as the general manager of Open Source Business for a Fortune 500 company, head of Data Business and…
Context
Du Junping , founder & CEO of Datastrato , director of LF AI & DATA, and ASF member, has been deeply involved in the AI and Data open-source fields for over a decade. He has served as the general manager of Open Source Business for a Fortune 500 company, head of Data Business and chief architect, and as an expert in big data technology and the open-source field. He has been the chair of the TOC (Technical Oversight Committee) at the OpenAtom Open Source Foundation, a member of the Apache Open Source Foundation, and a committer and PMC for projects such as Apache Hadoop and Submarine. He has also served as a mentor for projects like Apache YuniKorn and TubeMQ . He has held positions such as chairman of Tencent’s Open Source alliance and director of Big Data platform R&D at Hortonworks, leading the Hadoop YARN team. “How to manage the unstructured data for better usage for larger models is definitely a top challenge today in the AI domain.”
Evidence
Pending intelligence enrichment.
Analysis
In a recent interview with Du Junping, Founder and CEO of Datastrato, he highlighted the pivotal role of open-source technologies in advancing AI and data applications. Du Junping emphasised, “I definitely trust the open-source community to the scaling law for engineering resources and technology values.” This trust is rooted in the belief that open-source frameworks can significantly accelerate innovation and collaboration across the tech industry. Du Junping also discussed how open-source technologies are crucial for managing unstructured data. “How to manage the unstructured data for better usage for larger models is definitely a top challenge today in the AI domain.” This perspective underscores the necessity of developing robust open-source tools to handle the growing complexity of data in AI applications. Furthermore, Du Junping pointed out the transformative impact of generative AI, noting, “We see the more magic between the data and AI, they combine more tightly.” This synergy between data and AI is driving advancements in model capabilities, making open-source contributions even more valuable. Also read: OpenAI backs California bill for AI content labels
Key Points
- Datastrato, led by Du Junping, is based in the U.S. and specialises in data infrastructure for AI.
- The company focuses on improving data management to support advanced AI technologies.
- Datastrato is building a data centre designed to handle both structured and unstructured data for AI applications.
Actions
Pending intelligence enrichment.





