Close Menu
  • Home
  • Leadership Alliance
  • Exclusives
  • History of the Internet
  • AFRINIC News
  • Internet Governance
    • Regulations
    • Governance Bodies
    • Emerging Tech
  • Others
    • IT Infrastructure
      • Networking
      • Cloud
      • Data Centres
    • Company Stories
      • Profile
      • Startups
      • Tech Titans
      • Partner Content
    • Fintech
      • Blockchain
      • Payments
      • Regulations
    • Tech Trends
      • AI
      • AR / VR
      • IoT
    • Video / Podcast
  • Country News
    • Africa
    • Asia Pacific
    • North America
    • Lat Am/Caribbean
    • Europe/Middle East
Facebook LinkedIn YouTube Instagram X (Twitter)
Blue Tech Wave Media
Facebook LinkedIn YouTube Instagram X (Twitter)
  • Home
  • Leadership Alliance
  • Exclusives
  • History of the Internet
  • AFRINIC News
  • Internet Governance
    • Regulation
    • Governance Bodies
    • Emerging Tech
  • Others
    • IT Infrastructure
      • Networking
      • Cloud
      • Data Centres
    • Company Stories
      • Profiles
      • Startups
      • Tech Titans
      • Partner Content
    • Fintech
      • Blockchain
      • Payments
      • Regulation
    • Tech Trends
      • AI
      • AR/VR
      • IoT
    • Video / Podcast
  • Africa
  • Asia-Pacific
  • North America
  • Lat Am/Caribbean
  • Europe/Middle East
Blue Tech Wave Media
Home » Tech giants accused of using unauthorised YouTube transcripts to train AI models
July-17-AI-news
July-17-AI-news
AI

Tech giants accused of using unauthorised YouTube transcripts to train AI models

By Yasmine LuoJuly 17, 2024Updated:July 18, 2024No Comments3 Mins Read
Share
Facebook Twitter LinkedIn Pinterest Email
  • Some of the Tech giants allegedly used YouTube transcripts without permission to train AI models.
  • The legality of using unauthorised databases to train AI is undetermined, potentially hindering future AI development.

OUR TAKE
The development of AI technology is certainly promising, but its creation and advancement are built on databases. The lack of transparency in these databases is bound to cause controversy. The affected parties and the infringing companies often hold conflicting views, with no definitive resolution in sight. This situation is like a Damocles sword hanging over the industry; if not addressed, it will inevitably hinder the continuous development of AI.

— Yasmine luo, BTW reporter

What happened?

Some major tech companies are accused of using YouTube transcripts without authorization to train their AI models.

According to Proof News, EleutherAI, a nonprofit organisation, created a dataset containing transcripts from over 48,000 YouTube channels, including content from prominent creators like Marques Brownlee and MrBeast, as well as major publishers like The New York Times, the BBC, and ABC News. According to a new investigation by Proof News, Apple, NVIDIA, Anthropic, and other large tech companies used this dataset to train their AI models.

Neal Mohan, CEO of YouTube, has previously stated, “Companies using YouTube’s data to train AI models would violate the platform’s terms of service.”

Marques Brownlee, a famous YouTuber, posted on social media, “Apple has sourced data for their AI from several companies. One of them scraped tons of data/transcripts from YouTube videos, including mine. Apple technically avoids ‘fault’ here because they’re not the ones scraping. But this is going to be an evolving problem for a long time.”

Currently, Apple, NVIDIA, Anthropic, and EleutherAI have not commented on the matter.

Also read: Warburg-backed PDG eyes AI-driven data centre expansion in Asia

Also read: OpenAI’s ‘Strawberry’ project advances AI reasoning

Why it’s important

The rapid growth of AI models, while promising to shape the future, has also raised numerous unresolved legal questions. The recent accusations against tech giants add to these concerns. Since its inception, AI technology has grappled with the issue of non-transparent training databases. If AI training data is not appropriately sourced, there is a risk of copyright or database right infringement.

However, it remains undetermined whether the companies involved will face legal charges. The Verge conducted an investigation among lawyers, analysts, and employees at AI startups, revealing divided opinions on this issue.

“I see people on both sides of this extremely confident in their positions, but the reality is nobody knows,” says Baio, an AI observer.

Although the affected companies or individuals claim that it’s illegal, their demands are unlikely to be addressed, as evidenced by the lack of response from the accused companies.

If this issue remains unresolved, it may one day hinder the continuous development of AI technology.

AI AI training Tech giants
Yasmine Luo

Related Posts

UK government backs satellite innovation and AI start‑ups

November 24, 2025

Nokia restructures business to drive AI and network innovation

November 21, 2025

Google opens energy‑efficient AI data centre in Winschoten

November 20, 2025
Add A Comment
Leave A Reply Cancel Reply

CATEGORIES
Archives
  • November 2025
  • October 2025
  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • May 2025
  • April 2025
  • March 2025
  • February 2025
  • January 2025
  • December 2024
  • November 2024
  • October 2024
  • September 2024
  • August 2024
  • July 2024
  • June 2024
  • May 2024
  • April 2024
  • March 2024
  • February 2024
  • January 2024
  • December 2023
  • November 2023
  • October 2023
  • September 2023
  • August 2023
  • July 2023

Blue Tech Wave (BTW.Media) is a future-facing tech media brand delivering sharp insights, trendspotting, and bold storytelling across digital, social, and video. We translate complexity into clarity—so you’re always ahead of the curve.

BTW
  • About BTW
  • Contact Us
  • Join Our Team
  • About AFRINIC
  • History of the Internet
TERMS
  • Privacy Policy
  • Cookie Policy
  • Terms of Use
Facebook X (Twitter) Instagram YouTube LinkedIn
BTW.MEDIA is proudly owned by LARUS Ltd.

Type above and press Enter to search. Press Esc to cancel.