Close Menu
  • Home
  • Leadership Alliance
  • Exclusives
  • History of the Internet
  • AFRINIC News
  • Internet Governance
    • Regulations
    • Governance Bodies
    • Emerging Tech
  • Others
    • IT Infrastructure
      • Networking
      • Cloud
      • Data Centres
    • Company Stories
      • Profile
      • Startups
      • Tech Titans
      • Partner Content
    • Fintech
      • Blockchain
      • Payments
      • Regulations
    • Tech Trends
      • AI
      • AR / VR
      • IoT
    • Video / Podcast
  • Country News
    • Africa
    • Asia Pacific
    • North America
    • Lat Am/Caribbean
    • Europe/Middle East
Facebook LinkedIn YouTube Instagram X (Twitter)
Blue Tech Wave Media
Facebook LinkedIn YouTube Instagram X (Twitter)
  • Home
  • Leadership Alliance
  • Exclusives
  • History of the Internet
  • AFRINIC News
  • Internet Governance
    • Regulation
    • Governance Bodies
    • Emerging Tech
  • Others
    • IT Infrastructure
      • Networking
      • Cloud
      • Data Centres
    • Company Stories
      • Profiles
      • Startups
      • Tech Titans
      • Partner Content
    • Fintech
      • Blockchain
      • Payments
      • Regulation
    • Tech Trends
      • AI
      • AR/VR
      • IoT
    • Video / Podcast
  • Africa
  • Asia-Pacific
  • North America
  • Lat Am/Caribbean
  • Europe/Middle East
Blue Tech Wave Media
Home » 5 of Fatih Porikli’s most important thoughts on Gen AI
Fatih-Porkili
AI

5 of Fatih Porikli’s most important thoughts on Gen AI

By Audrey HuangJune 13, 2024Updated:June 18, 2024No Comments3 Mins Read
Share
Facebook Twitter LinkedIn Pinterest Email
  • Fatih Porikli, an IEEE Fellow and the Global Lead of AI Systems at Qualcomm AI Research, recently spoke on The TWIML AI Podcast about his thoughts on generative AI and traditional computer vision topics.
  • Ongoing efforts in enhancing optical flow algorithms, with techniques like speculative decoding and self-cleaning inversion.
  • Rising use of stereo imaging in XR headsets and autonomous vehicles drives the need for efficient compression techniques. Innovations like parallel hypercoding reduce redundancy while ensuring minimal latency in stereo imaging applications.

OUR TAKE
With the requirements of AI skyrocketed, answering textual questions can no longer satisfy users’ needs. Therefore, the updated AI model is built to have a wider range of functions, including analysing mathematical plots.
–Audrey Huang, BTW reporter

Fatih Porikli, an IEEE Fellow and the Global Lead of AI Systems at Qualcomm AI Research, recently spoke on The TWIML AI Podcast about his thoughts on generative AI and traditional computer vision topics. There are 5 important ideas for his thoughts.

1. Multimodal model advancements

The discussions highlighted significant advancements in multimodal models, particularly those integrating language and image processing. These models aim to interpret complex data, such as mathematical plots, by leveraging information from multiple modalities. This represents a crucial step towards developing AI systems capable of understanding diverse types of inputs and performing complex reasoning tasks.

Also read: OpenAI thwarts 5 covert influence operations using AI models

Also read: AI lies: Should we worry about deceptive AI models?

2. Optical flow optimisation

Researchers are actively working on enhancing optical flow algorithms, which are essential for tasks like video compression and motion analysis. Techniques such as speculative decoding and self-cleaning inversion aim to improve the accuracy and efficiency of optical flow, enabling real-time processing on devices like mobile phones. These advancements address the increasing demand for high-quality video processing across various applications.

3. Efficient compression techniques for stereo imaging

With the rising use of stereo imaging in devices like XR headsets and autonomous vehicles, efficient compression of stereo streams is becoming crucial. Novel approaches like parallel hypercoding and bidirectional shift modules enable stereo-aware compression, reducing redundancy and achieving significant bitrate savings while minimising latency. These techniques pave the way for more effective data transmission and storage in stereo imaging applications.

4. On-device AI demos

Demonstrations showcased practical applications of AI on mobile devices, ranging from portrait relighting and avatar generation to AI assistants with AR face recognition. These demos highlight the potential for on-device AI to enhance user experiences across various domains, including photography, communication, and augmented reality. By running AI algorithms directly on mobile devices, users can access advanced functionalities without relying on cloud-based processing, leading to faster and more seamless interactions.

5. Insights from workshops

The workshops on Efficient Large Vision Models and Omnidirectional Computer Vision provided valuable insights into emerging trends and challenges in vision model development. They emphasised the importance of efficient deployment of large models on edge devices and addressed unique considerations for processing omnidirectional imagery. These workshops serve as platforms for collaboration and knowledge sharing among researchers and industry professionals, driving advancements in vision model research and application.

Generative AI Multimodal models Optical flow
Audrey Huang

Audrey Huang is an intern news reporter at Blue Tech Wave. She is interested in AI and startup stories. Send tips to a.huang@btw.media.

Related Posts

UK government backs satellite innovation and AI start‑ups

November 24, 2025

Transatel selects Oracle to power its 5G Standalone core for IoT

November 17, 2025

AT&T launches internal AI assistant for employees

November 12, 2025
Add A Comment
Leave A Reply Cancel Reply

CATEGORIES
Archives
  • November 2025
  • October 2025
  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • May 2025
  • April 2025
  • March 2025
  • February 2025
  • January 2025
  • December 2024
  • November 2024
  • October 2024
  • September 2024
  • August 2024
  • July 2024
  • June 2024
  • May 2024
  • April 2024
  • March 2024
  • February 2024
  • January 2024
  • December 2023
  • November 2023
  • October 2023
  • September 2023
  • August 2023
  • July 2023

Blue Tech Wave (BTW.Media) is a future-facing tech media brand delivering sharp insights, trendspotting, and bold storytelling across digital, social, and video. We translate complexity into clarity—so you’re always ahead of the curve.

BTW
  • About BTW
  • Contact Us
  • Join Our Team
  • About AFRINIC
  • History of the Internet
TERMS
  • Privacy Policy
  • Cookie Policy
  • Terms of Use
Facebook X (Twitter) Instagram YouTube LinkedIn
BTW.MEDIA is proudly owned by LARUS Ltd.

Type above and press Enter to search. Press Esc to cancel.