Close Menu
  • Home
  • Leadership Alliance
  • Exclusives
  • History of the Internet
  • AFRINIC News
  • Internet Governance
    • Regulations
    • Governance Bodies
    • Emerging Tech
  • Others
    • IT Infrastructure
      • Networking
      • Cloud
      • Data Centres
    • Company Stories
      • Profile
      • Startups
      • Tech Titans
      • Partner Content
    • Fintech
      • Blockchain
      • Payments
      • Regulations
    • Tech Trends
      • AI
      • AR / VR
      • IoT
    • Video / Podcast
  • Country News
    • Africa
    • Asia Pacific
    • North America
    • Lat Am/Caribbean
    • Europe/Middle East
Facebook LinkedIn YouTube Instagram X (Twitter)
Blue Tech Wave Media
Facebook LinkedIn YouTube Instagram X (Twitter)
  • Home
  • Leadership Alliance
  • Exclusives
  • History of the Internet
  • AFRINIC News
  • Internet Governance
    • Regulation
    • Governance Bodies
    • Emerging Tech
  • Others
    • IT Infrastructure
      • Networking
      • Cloud
      • Data Centres
    • Company Stories
      • Profiles
      • Startups
      • Tech Titans
      • Partner Content
    • Fintech
      • Blockchain
      • Payments
      • Regulation
    • Tech Trends
      • AI
      • AR/VR
      • IoT
    • Video / Podcast
  • Africa
  • Asia-Pacific
  • North America
  • Lat Am/Caribbean
  • Europe/Middle East
Blue Tech Wave Media
Home » MM1: Apple’s first multimodal AI model
Apple,MM1,multimodal AI
Apple,MM1,multimodal AI
AI

MM1: Apple’s first multimodal AI model

By Tilly LuMarch 31, 2024Updated:April 1, 2024No Comments2 Mins Read
Share
Facebook Twitter LinkedIn Pinterest Email
  • Rivaling Google’s Gemini: MM1’s extensive parameter range competes with Google’s initial AI model versions.
  • Innovative In-Context Learning: MM1’s ability to understand and respond to new queries based on current conversational context.

Apple has revealed MM1, a new generation of multimodal models that can seamlessly interpret and interact with both images and text, setting the stage for a more intuitive and responsive Siri and iMessage experience.

MM1: pioneering multimodal AI

Apple has introduced MM1, an innovative suite of multimodal AI models that are adept at processing both visual imagery and textual data. These models boast an impressive parameter count of up to 30 billion, making them a worthy match for the earliest iterations of Google’s Gemini models.

Also read: Anthropic claims its latest AI model outperforms GPT-4

The MM1 models are equipped with the ability to interpret and execute instructions that involve both visual and textual elements. For instance, the AI can calculate the combined cost of two beverages by analysing the pricing information displayed on a menu.

One of the standout features of MM1 is its capacity for in-context learning. This permits the model to grasp and address inquiries based on the contextual information present within the ongoing discourse, without the need for specific retraining or fine-tuning for each novel query or task.

This in-context learning capability could potentially enable the model to generate detailed descriptions of images or to respond to questions about the content of photo-based prompts, even if it hasn’t been previously exposed to similar content.

Also read: Apple to showcase ‘visionOS advancements’ at WWDC 2024

Enhancing user experience

In terms of enhancing the user experience, MM1’s multimodal comprehension skills could be leveraged by Apple to elevate the performance of its voice assistant, Siri. This would allow Siri to provide answers to questions that are grounded in visual data, such as those based on images. Furthermore, MM1 could assist in interpreting the context of images and text messages shared via iMessage, thereby providing users with more pertinent suggestions for replies.

Apple MM1 multimodal AI
Tilly Lu

Tilly Lu, an intern reporter at BTW media dedicated in Fintech and Blockchain. She is studying Broadcasting and Hosting in Sanming University. Send tips to t.lu@btw.media.

Related Posts

AT&T launches internal AI assistant for employees

November 12, 2025

Samsung honoured for AI and security breakthroughs at CES 2026

November 6, 2025

Google’s ‘Big Sleep’ AI uncovers 5 open-source cyber threats

November 5, 2025
Add A Comment
Leave A Reply Cancel Reply

CATEGORIES
Archives
  • November 2025
  • October 2025
  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • May 2025
  • April 2025
  • March 2025
  • February 2025
  • January 2025
  • December 2024
  • November 2024
  • October 2024
  • September 2024
  • August 2024
  • July 2024
  • June 2024
  • May 2024
  • April 2024
  • March 2024
  • February 2024
  • January 2024
  • December 2023
  • November 2023
  • October 2023
  • September 2023
  • August 2023
  • July 2023

Blue Tech Wave (BTW.Media) is a future-facing tech media brand delivering sharp insights, trendspotting, and bold storytelling across digital, social, and video. We translate complexity into clarity—so you’re always ahead of the curve.

BTW
  • About BTW
  • Contact Us
  • Join Our Team
  • About AFRINIC
  • History of the Internet
TERMS
  • Privacy Policy
  • Cookie Policy
  • Terms of Use
Facebook X (Twitter) Instagram YouTube LinkedIn
BTW.MEDIA is proudly owned by LARUS Ltd.

Type above and press Enter to search. Press Esc to cancel.