en

Services

We understand that no two organisations are the same. Find out more about how we've customised our talent solutions to help clients across South East Asia meet their needs.

Read more
Candidates

Together, we’ll map out career-defining, life-changing pathways to achieve your career ambitions. Browse our range of services, advice, and resources.

Learn more
Services

We understand that no two organisations are the same. Find out more about how we've customised our talent solutions to help clients across South East Asia meet their needs.

Read more
About Robert Walters Singapore

Since our establishment in 1998, our belief remains the same: Building strong relationships with people is vital in a successful partnership.

Learn more

Work for us

Our people are the difference. Hear stories from our people to learn more about a career at Robert Walters Singapore.

Learn more

Senior Site Reliability Engineer

Save job

Keywords: Site Reliability Engineer, Patform Engineering, Disaster Recovery strategy, Incident Management framework

Our client is embarking on a transition from a CTRM-centric technology architecture to a data-centric model to enhance scalability and stability. They are seeking an experienced senior site reliability engineer to support this multi-year programme of work. The primary responsibility will be to maintain and enhance the reliability of their platform in production as its scope, volume, and user base continue to expand throughout the programme.

  • Transitioning to a data-centric model for enhanced scalability and stability
  • Multi-year programme of work
  • Maintain and enhance the reliability of their platform

What you'll do:

As a Site Reliability Engineer, you will play a crucial role in maintaining and enhancing the reliability of our client's platform. You will establish and track SLOs and SLIs for their platform, lead remediation efforts when SLOs are breached, and develop comprehensive plans to maintain SLO compliance as the platform continues to scale. Your role will also involve working closely with the Platform Engineering team to continuously automate tasks related to production infrastructure, deployment pipelines, and system stability. This will improve operational efficiency while ensuring optimal observability across the platform.

  • Establish and track SLOs and SLIs for their platform
  • Work with the Platform Engineering team to continuously automate tasks related to production infrastructure
  • Maintain a strong relationship with end-users, ensuring the right level of feedback is collected
  • Identify and address potential system bottlenecks and failure points before they escalate into incidents
  • Develop, maintain, and upgrade tools to ensure optimal observability across the platform
  • Contribute to infrastructure capacity planning and to the implementation of our Disaster Recovery strategy

What you bring:

The ideal candidate for this Site Reliability Engineer position brings along prior experience in a similar role, preferably within a commodity trading or similar organization. You should have at least three years of experience maintaining decentralized or microservices systems in a production environment. An in-depth understanding of microservices-based systems is essential for this role, including designing, deploying, and managing distributed, scalable services. Experience with cloud PaaS and IaaS (Microsoft Azure preferred) is advantageous.

  • Prior experience in a site reliability/DevOps engineering role
  • At least three years of experience maintaining decentralized or microservices systems in a production environment
  • In-depth understanding of microservices-based systems
  • Experience with relational and document-based databases
  • Experience with cloud PaaS and IaaS is advantageous
  • Experience in developing and maintaining CI/CD Pipelines
  • Experience with containerisation technologies is advantageous

What sets this company apart:

Our client is a forward-thinking company that is embarking on a transition to a data-centric model to enhance scalability and stability. They are committed to providing their employees with a dynamic and fast-paced work environment where they can thrive and grow. Their focus on continuous improvement and innovation makes them an ideal choice for those who are looking to make a significant impact in their field.

What's next:

If you're ready to take the next step in your career as a Site Reliability Engineer, don't hesitate!

Apply today by clicking on the link!

Do note that we will only be in touch if your application is shortlisted.
Robert Walters (Singapore) Pte Ltd
ROC No.: 199706961E | EA Licence No.: 03C5451
EA Registration No.: R21100958 Harsh Paras Mehta

Contract Type: FULL_TIME

Specialism: Tech & Transformation

Focus: Infrastructure

Industry: IT

Salary: Negotiable

Workplace Type: Hybrid

Experience Level: Mid Management

Location: Singapore

Job Reference: REN43M-3B2C2115

Date posted: 31 December 2024

Consultant: Harsh Mehta (R21100958)

Phone number: +65 6228 5386

harsh.mehta@robertwalters.com.sg

Harsh Mehta (R21100958)

Save job

Share

I'm Robert Walters Are you?

Come join our global team of creative thinkers, problem solvers and game changers. We offer accelerated career progression, a dynamic culture and expert training.