Close Menu
    Facebook LinkedIn YouTube Instagram X (Twitter)
    Blue Tech Wave Media
    Facebook LinkedIn YouTube Instagram X (Twitter)
    • Home
    • Leadership Alliance
    • Exclusives
    • Internet Governance
      • Regulation
      • Governance Bodies
      • Emerging Tech
    • IT Infrastructure
      • Networking
      • Cloud
      • Data Centres
    • Company Stories
      • Profiles
      • Startups
      • Tech Titans
      • Partner Content
    • Others
      • Fintech
        • Blockchain
        • Payments
        • Regulation
      • Tech Trends
        • AI
        • AR/VR
        • IoT
      • Video / Podcast
    Blue Tech Wave Media
    Home » 5 of Fatih Porikli’s most important thoughts on Gen AI
    Fatih-Porkili
    AI

    5 of Fatih Porikli’s most important thoughts on Gen AI

    By Audrey HuangJune 13, 2024Updated:June 18, 2024No Comments3 Mins Read
    Share
    Facebook Twitter LinkedIn Pinterest Email
    • Fatih Porikli, an IEEE Fellow and the Global Lead of AI Systems at Qualcomm AI Research, recently spoke on The TWIML AI Podcast about his thoughts on generative AI and traditional computer vision topics.
    • Ongoing efforts in enhancing optical flow algorithms, with techniques like speculative decoding and self-cleaning inversion.
    • Rising use of stereo imaging in XR headsets and autonomous vehicles drives the need for efficient compression techniques. Innovations like parallel hypercoding reduce redundancy while ensuring minimal latency in stereo imaging applications.

    OUR TAKE
    With the requirements of AI skyrocketed, answering textual questions can no longer satisfy users’ needs. Therefore, the updated AI model is built to have a wider range of functions, including analysing mathematical plots.
    –Audrey Huang, BTW reporter

    Fatih Porikli, an IEEE Fellow and the Global Lead of AI Systems at Qualcomm AI Research, recently spoke on The TWIML AI Podcast about his thoughts on generative AI and traditional computer vision topics. There are 5 important ideas for his thoughts.

    1. Multimodal model advancements

    The discussions highlighted significant advancements in multimodal models, particularly those integrating language and image processing. These models aim to interpret complex data, such as mathematical plots, by leveraging information from multiple modalities. This represents a crucial step towards developing AI systems capable of understanding diverse types of inputs and performing complex reasoning tasks.

    Also read: OpenAI thwarts 5 covert influence operations using AI models

    Also read: AI lies: Should we worry about deceptive AI models?

    2. Optical flow optimisation

    Researchers are actively working on enhancing optical flow algorithms, which are essential for tasks like video compression and motion analysis. Techniques such as speculative decoding and self-cleaning inversion aim to improve the accuracy and efficiency of optical flow, enabling real-time processing on devices like mobile phones. These advancements address the increasing demand for high-quality video processing across various applications.

    3. Efficient compression techniques for stereo imaging

    With the rising use of stereo imaging in devices like XR headsets and autonomous vehicles, efficient compression of stereo streams is becoming crucial. Novel approaches like parallel hypercoding and bidirectional shift modules enable stereo-aware compression, reducing redundancy and achieving significant bitrate savings while minimising latency. These techniques pave the way for more effective data transmission and storage in stereo imaging applications.

    4. On-device AI demos

    Demonstrations showcased practical applications of AI on mobile devices, ranging from portrait relighting and avatar generation to AI assistants with AR face recognition. These demos highlight the potential for on-device AI to enhance user experiences across various domains, including photography, communication, and augmented reality. By running AI algorithms directly on mobile devices, users can access advanced functionalities without relying on cloud-based processing, leading to faster and more seamless interactions.

    5. Insights from workshops

    The workshops on Efficient Large Vision Models and Omnidirectional Computer Vision provided valuable insights into emerging trends and challenges in vision model development. They emphasised the importance of efficient deployment of large models on edge devices and addressed unique considerations for processing omnidirectional imagery. These workshops serve as platforms for collaboration and knowledge sharing among researchers and industry professionals, driving advancements in vision model research and application.

    Generative AI Multimodal models Optical flow
    Audrey Huang

    Audrey Huang is an intern news reporter at Blue Tech Wave. She is interested in AI and startup stories. Send tips to a.huang@btw.media.

    Related Posts

    Orange Business: Unveils defence division

    July 11, 2025

    AFRINET SA: Expands digital services in the DRC

    July 10, 2025

    Vodafone and Digital Realty launch subsea hub in Crete

    July 10, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    CATEGORIES
    Archives
    • July 2025
    • June 2025
    • May 2025
    • April 2025
    • March 2025
    • February 2025
    • January 2025
    • December 2024
    • November 2024
    • October 2024
    • September 2024
    • August 2024
    • July 2024
    • June 2024
    • May 2024
    • April 2024
    • March 2024
    • February 2024
    • January 2024
    • December 2023
    • November 2023
    • October 2023
    • September 2023
    • August 2023
    • July 2023

    Blue Tech Wave (BTW.Media) is a future-facing tech media brand delivering sharp insights, trendspotting, and bold storytelling across digital, social, and video. We translate complexity into clarity—so you’re always ahead of the curve.

    BTW
    • About BTW
    • Contact Us
    • Join Our Team
    TERMS
    • Privacy Policy
    • Cookie Policy
    • Terms of Use
    Facebook X (Twitter) Instagram YouTube LinkedIn

    Type above and press Enter to search. Press Esc to cancel.