Close Menu
    Facebook LinkedIn YouTube Instagram X (Twitter)
    Blue Tech Wave Media
    Facebook LinkedIn YouTube Instagram X (Twitter)
    • Home
    • Leadership Alliance
    • Exclusives
    • Internet Governance
      • Regulation
      • Governance Bodies
      • Emerging Tech
    • IT Infrastructure
      • Networking
      • Cloud
      • Data Centres
    • Company Stories
      • Profiles
      • Startups
      • Tech Titans
      • Partner Content
    • Others
      • Fintech
        • Blockchain
        • Payments
        • Regulation
      • Tech Trends
        • AI
        • AR/VR
        • IoT
      • Video / Podcast
    Blue Tech Wave Media
    Home » Microsoft safety system can catch hallucinations in its AI apps
    Azure-AI
    Azure-AI
    AI

    Microsoft safety system can catch hallucinations in its AI apps

    By Chloe ChenMarch 29, 2024No Comments2 Mins Read
    Share
    Facebook Twitter LinkedIn Pinterest Email
    • Microsoft’s Azure introduces new safety features for AI models, including Prompt Shields and Groundedness Detection, aimed at detecting vulnerabilities and blocking malicious prompts.
    • These features enhance control over filtering inappropriate content, addressing concerns about AI model safety.
    • Microsoft’s commitment to AI safety aligns with expanding Azure’s AI capabilities amidst growing demand.

    Microsoft’s Chief Product Officer of Responsible AI, Sarah Bird, reveals to The Verge in an interview the rollout of new safety features designed by her team for Azure customers. These features, powered by LLM technology, aim to detect vulnerabilities, monitor for plausible yet unsupported scenarios, and block malicious prompts in real-time for users leveraging Azure AI models.

    Also read: Windows is under new management due to Microsoft AI reorganisation

    Also read: Microsoft Teams is getting smarter Copilot AI features

    There are various functionalities improving safety

    The functionalities include Prompt Shields, Groundedness Detection, and safety evaluations, with additional features such as directing models towards safe outputs and tracking prompts for problematic users forthcoming. Notably, the system evaluates input prompts for banned words and hidden cues before processing, ensuring responses align with desired outcomes.

    It can filter hate speech or violence within AI models

    Bird emphasises the customisable control over filtering hate speech or violence within AI models, addressing concerns about inappropriate content. These safety measures extend to popular models like GPT-4 and Llama 2, although users of smaller, less-used open-source models may need to manually configure the features.

    Microsoft’s commitment to enhancing AI safety aligns with the growing demand for Azure’s AI capabilities, underscored by recent partnerships aimed at expanding its model offerings.

    AI apps new safety system
    Chloe Chen

    Chloe Chen is a junior writer at BTW Media. She graduated from the London School of Economics and Political Science (LSE) and had various working experiences in the finance and fintech industry. Send tips to c.chen@btw.media.

    Related Posts

    EU‑Japan AI accord fosters global standard setting

    July 25, 2025

    Aligned Data Centers launches massive AI campus in Ohio

    July 25, 2025

    Liquid Intelligent Technologies: Building Africa’s digital future

    July 25, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    CATEGORIES
    Archives
    • July 2025
    • June 2025
    • May 2025
    • April 2025
    • March 2025
    • February 2025
    • January 2025
    • December 2024
    • November 2024
    • October 2024
    • September 2024
    • August 2024
    • July 2024
    • June 2024
    • May 2024
    • April 2024
    • March 2024
    • February 2024
    • January 2024
    • December 2023
    • November 2023
    • October 2023
    • September 2023
    • August 2023
    • July 2023

    Blue Tech Wave (BTW.Media) is a future-facing tech media brand delivering sharp insights, trendspotting, and bold storytelling across digital, social, and video. We translate complexity into clarity—so you’re always ahead of the curve.

    BTW
    • About BTW
    • Contact Us
    • Join Our Team
    TERMS
    • Privacy Policy
    • Cookie Policy
    • Terms of Use
    Facebook X (Twitter) Instagram YouTube LinkedIn

    Type above and press Enter to search. Press Esc to cancel.