STACK Meetup - Building AI Responsibly Through Guardrails and Interpretability | Singapore Government Developer Portal
Have feedback? Please
EVENT MENU

DUMMY TEXT

28 Aug 2025
/assets/img/communities/stack-meetups.svgicon
STACK Meetup
10 Pasir Panjang Rd, Level 10

For upcoming STACK webinars and a full list of our past events, please visit our Meetup page.

STACK Meetup banner for 28 Aug 2025

Overview

As AI systems grow more advanced, ensuring their safety and predictability becomes increasingly critical. This STACK Meetup explores how safety testing and guardrails, and mechanistic interpretability, can reduce misinformation and bias. These approaches work together to ensure that AI functions safely and as intended, especially in high-stakes settings.

Get tips from our GovTech’s AI Practice team on safeguarding LLM applications against safety risks. Our speaker will guide you through the Responsible AI journey through the steps of defining a customised safety risk taxonomy, evaluating safety risks, and implementing safeguards to mitigate them.

Also, hear from a researcher at the Singapore AI Safety Institute on mechanistic interpretability, an approach akin to a brain scan for AI systems. This field seeks to uncover the inner workings of AI systems to identify backdoors, misalignment and unintended behaviours. This understanding powers applications such as model editing, behaviour steering, and the design of more robust guardrails, helping ensure that AI operates predictably and can be audited effectively.

Who should attend: AI Researchers/Engineers, Research Engineers, Data Scientists, Software Engineers/Developers and Designers who use AI in their products or solutions

Recommended knowledge level: Conceptual understanding of LLMs is helpful and experience building with LLMs is a bonus

Programme rundown

7:00pm – Introduction by STACK Community

By Goh Jia Yi, AI Engineer (Responsible AI), AI Practice, GovTech

By Clement Neo, Research Engineer, Singapore AI Safety Institute and Lab Advisor, Apart Research

8:15pm - Q&A

8:30pm - End of STACK Meetup

Last updated 18 August 2025


Was this article useful?
Send this page via email
Share on Facebook
Share on Linkedin
Tweet this page