Close Menu
  • Home
  • Leadership Alliance
  • Exclusives
  • History of the Internet
  • AFRINIC News
  • Internet Governance
    • Regulations
    • Governance Bodies
    • Emerging Tech
  • Others
    • IT Infrastructure
      • Networking
      • Cloud
      • Data Centres
    • Company Stories
      • Profile
      • Startups
      • Tech Titans
      • Partner Content
    • Fintech
      • Blockchain
      • Payments
      • Regulations
    • Tech Trends
      • AI
      • AR / VR
      • IoT
    • Video / Podcast
  • Country News
    • Africa
    • Asia Pacific
    • North America
    • Lat Am/Caribbean
    • Europe/Middle East
Facebook LinkedIn YouTube Instagram X (Twitter)
Blue Tech Wave Media
Facebook LinkedIn YouTube Instagram X (Twitter)
  • Home
  • Leadership Alliance
  • Exclusives
  • History of the Internet
  • AFRINIC News
  • Internet Governance
    • Regulation
    • Governance Bodies
    • Emerging Tech
  • Others
    • IT Infrastructure
      • Networking
      • Cloud
      • Data Centres
    • Company Stories
      • Profiles
      • Startups
      • Tech Titans
      • Partner Content
    • Fintech
      • Blockchain
      • Payments
      • Regulation
    • Tech Trends
      • AI
      • AR/VR
      • IoT
    • Video / Podcast
  • Africa
  • Asia-Pacific
  • North America
  • Lat Am/Caribbean
  • Europe/Middle East
Blue Tech Wave Media
Home » Google’s Gemini 1.5 Pro can now hear
Gemini
Gemini
AI

Google’s Gemini 1.5 Pro can now hear

By Jennifer YuApril 10, 2024No Comments2 Mins Read
Share
Facebook Twitter LinkedIn Pinterest Email
  • Google’s update to Gemini 1.5 Pro gives the model an ear.
  • The model can now listen to uploaded audio files and generate information from content such as earnings calls or video audio without having to refer to a written transcript.
  • Google is also making Gemini 1.5 Pro available as a public preview to those with access to Vertex AI

Capable of processing text, code, video, and now uploaded audio streams, including audio from video, Gemini 1.5 Pro can listen, analyse and extract information without a corresponding written record.

Gemini 1.5 Pro

Gemini is Google’s rebranded bot, previously called Bard, and Gemini 1.5 Pro is the latest iteration of the model made available to a limited number of developers in February of this year.

Google also announced it’ll make Gemini 1.5 Pro available to the public for the first time through its platform to build AI applications, Vertex AI. Gemini 1.5 Pro was first announced in February.

Google shared details of the update at its Cloud Next conference in Las Vegas. After calling Gemini Ultra LLM, which powers its Gemini advanced chatbot, the most powerful model in the Gemini family, Google is now calling Gemini 1.5 Pro its most powerful generative model.

The company adds that this version has better learning capabilities and requires no additional tweaking of the model.

Gemini 1.5 Pro is not available to users who do not have access to Vertex AI.

Also read: OpenAI voice-clone tool mimics your voice with 15-second sample

Text-to-image generation model Imagen 2

Gemini 1.5 Pro isn’t the only large AI model to get an update from Google.

Imagen 2 is a text-to-image generation model that will help enhance Gemini’s image generation capabilities, and will also add fixes and repairs that will allow users to add or remove elements from an image.

Many of the new features of Imagen, especially in painting and outpainting, have been part of other text-to-image models like Stability AI’s Stable Cascade and Getty’s Generative AI by iStock, not to mention wider consumer availability on newer Samsung Galaxy phones.

Gemini 1.5 Pro Google
Jennifer Yu

Jennifer Yu is a reporter at BTW Media covering artificial intelligence and products. She graduated from The University of Hong Kong. Send tips to j.yu@btw.media.

Related Posts

Deutsche Telekom and Schwarz Group in advanced talks to build joint ‘AI gigafactory’ data centre

December 2, 2025

Cisco and Asiacell join forces to bring AI-driven network assurance to Iraq

December 2, 2025

Google drops EU complaint as cloud probe expands

December 1, 2025
Add A Comment
Leave A Reply Cancel Reply

CATEGORIES
Archives
  • December 2025
  • November 2025
  • October 2025
  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • May 2025
  • April 2025
  • March 2025
  • February 2025
  • January 2025
  • December 2024
  • November 2024
  • October 2024
  • September 2024
  • August 2024
  • July 2024
  • June 2024
  • May 2024
  • April 2024
  • March 2024
  • February 2024
  • January 2024
  • December 2023
  • November 2023
  • October 2023
  • September 2023
  • August 2023
  • July 2023

Blue Tech Wave (BTW.Media) is a future-facing tech media brand delivering sharp insights, trendspotting, and bold storytelling across digital, social, and video. We translate complexity into clarity—so you’re always ahead of the curve.

BTW
  • About BTW
  • Contact Us
  • Join Our Team
  • About AFRINIC
  • History of the Internet
TERMS
  • Privacy Policy
  • Cookie Policy
  • Terms of Use
Facebook X (Twitter) Instagram YouTube LinkedIn
BTW.MEDIA is proudly owned by LARUS Ltd.

Type above and press Enter to search. Press Esc to cancel.