Close Menu
    Facebook LinkedIn YouTube Instagram X (Twitter)
    Blue Tech Wave Media
    Facebook LinkedIn YouTube Instagram X (Twitter)
    • Home
    • Leadership Alliance
    • Exclusives
    • Internet Governance
      • Regulation
      • Governance Bodies
      • Emerging Tech
    • IT Infrastructure
      • Networking
      • Cloud
      • Data Centres
    • Company Stories
      • Profiles
      • Startups
      • Tech Titans
      • Partner Content
    • Others
      • Fintech
        • Blockchain
        • Payments
        • Regulation
      • Tech Trends
        • AI
        • AR/VR
        • IoT
      • Video / Podcast
    Blue Tech Wave Media
    Home » MM1: Apple’s first multimodal AI model
    Apple,MM1,multimodal AI
    Apple,MM1,multimodal AI
    AI

    MM1: Apple’s first multimodal AI model

    By Tilly LuMarch 31, 2024Updated:April 1, 2024No Comments2 Mins Read
    Share
    Facebook Twitter LinkedIn Pinterest Email
    • Rivaling Google’s Gemini: MM1’s extensive parameter range competes with Google’s initial AI model versions.
    • Innovative In-Context Learning: MM1’s ability to understand and respond to new queries based on current conversational context.

    Apple has revealed MM1, a new generation of multimodal models that can seamlessly interpret and interact with both images and text, setting the stage for a more intuitive and responsive Siri and iMessage experience.

    MM1: pioneering multimodal AI

    Apple has introduced MM1, an innovative suite of multimodal AI models that are adept at processing both visual imagery and textual data. These models boast an impressive parameter count of up to 30 billion, making them a worthy match for the earliest iterations of Google’s Gemini models.

    Also read: Anthropic claims its latest AI model outperforms GPT-4

    The MM1 models are equipped with the ability to interpret and execute instructions that involve both visual and textual elements. For instance, the AI can calculate the combined cost of two beverages by analysing the pricing information displayed on a menu.

    One of the standout features of MM1 is its capacity for in-context learning. This permits the model to grasp and address inquiries based on the contextual information present within the ongoing discourse, without the need for specific retraining or fine-tuning for each novel query or task.

    This in-context learning capability could potentially enable the model to generate detailed descriptions of images or to respond to questions about the content of photo-based prompts, even if it hasn’t been previously exposed to similar content.

    Also read: Apple to showcase ‘visionOS advancements’ at WWDC 2024

    Enhancing user experience

    In terms of enhancing the user experience, MM1’s multimodal comprehension skills could be leveraged by Apple to elevate the performance of its voice assistant, Siri. This would allow Siri to provide answers to questions that are grounded in visual data, such as those based on images. Furthermore, MM1 could assist in interpreting the context of images and text messages shared via iMessage, thereby providing users with more pertinent suggestions for replies.

    Apple MM1 multimodal AI
    Tilly Lu

    Tilly Lu, an intern reporter at BTW media dedicated in Fintech and Blockchain. She is studying Broadcasting and Hosting in Sanming University. Send tips to t.lu@btw.media.

    Related Posts

    Unique Network President Charu Sethi on decentralised Web3 growth

    July 7, 2025

    Interview with Sarath Babu Rayaprolu from Voxtera on dynamic and secure VoIP

    July 7, 2025

    Authors sue Microsoft over AI training using their books

    July 3, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    CATEGORIES
    Archives
    • July 2025
    • June 2025
    • May 2025
    • April 2025
    • March 2025
    • February 2025
    • January 2025
    • December 2024
    • November 2024
    • October 2024
    • September 2024
    • August 2024
    • July 2024
    • June 2024
    • May 2024
    • April 2024
    • March 2024
    • February 2024
    • January 2024
    • December 2023
    • November 2023
    • October 2023
    • September 2023
    • August 2023
    • July 2023

    Blue Tech Wave (BTW.Media) is a future-facing tech media brand delivering sharp insights, trendspotting, and bold storytelling across digital, social, and video. We translate complexity into clarity—so you’re always ahead of the curve.

    BTW
    • About BTW
    • Contact Us
    • Join Our Team
    TERMS
    • Privacy Policy
    • Cookie Policy
    • Terms of Use
    Facebook X (Twitter) Instagram YouTube LinkedIn

    Type above and press Enter to search. Press Esc to cancel.