StackOps – Your Monitoring and Logging Solution | Singapore Government Developer Portal
Have feedback? Please

Overview

StackOps was conceptualised to enable agencies to embrace Site Reliability Engineering (SRE), which emphasises automated monitoring and logging of services. SRE also shifts monitoring and logging processes into the development cycle itself to ensure that operational concerns reliability concepts are build-in.

StackOps is part of the Monitoring component of the Singapore Government Tech Stack (SGTS), which are tools to help you track application performance and provide you with operational transparency for effective incident response.

What is StackOps?

StackOps is a Monitoring toolchain that focuses on the most important and critical user actions of your service.

StackOps focuses on the most important and critical user actions of a service. It features automated reactions based on the golden monitoring and logging metrics (latency, load, limit, and errors).

StackOps consists of two main parts:

  1. StackOps Monitoring is a package of monitoring applications (Prometheus, Thanos, and Grafana) that are centrally managed, automated, open-sourced, and easy to use.

    Fig 1: StackOps Monitoring Applications (Prometheus, Thanos, and Grafana)
    Fig 1: StackOps Monitoring Applications (Prometheus, Thanos, and Grafana)
  2. StackOps Logging is a suite of centrally managed logging applications (Elastic, Logstash, and Kibana) that transforms and visualises your data from multiple sources.

Fig 2: StackOps Logging Applications (Elastic, Logstash, and Kibana)
Fig 2: StackOps Logging Applications (Elastic, Logstash, and Kibana)

Why Should StackOps be Adopted?

Adopting StackOps can benefit users in the following ways:

  • User can start monitoring and logging their systems quickly and simply.
  • Creation of meaningful presentation of metrics that are aggregated based on what service owner, administrators, and customers need to know.
  • Fills current lack of monitoring and logging solutions that are quick and easy to set up.
  • Resolves problem of inability to track System Level Objectives (SLOs) of Whole-of Government (WOG) products and services.
  • Offers a one-stop-shop for monitoring the health of government services.
  • StackOps services will be whitelisted by General Availability (GA) by the end of 2021.

How Do You Use StackOps?

Contact StackOp’s product manager at StackOps_SRE@tech.gov.sg for onboarding details.

Onboarding can typically be completed within one business day.

Success Stories

StackOps’ customer base grew from two to fifteen within five months. It is now used by SGTS products and the Government on Commercial Cloud (GCC) platform.

What’s Next?

The StackOps team is developing the StackOps Dashboard that enables a single-pane view of all services. It allows users to manage alerts, take remedial action against detected events, and easily publish their services’ availabilities to their target audiences.

Fig 3: StackOps Dashboard
Fig 3: StackOps Dashboard

Contact Information

For enquiries, email StackOps_SRE@tech.gov.sg.

Last updated 27 December 2021


Was this article useful?
Send this page via email
Share on Facebook
Share on Linkedin
Tweet this page