CareerCross uses cookies to enhance your experience on our websites. If you continue to view our sites without changing your browser settings, then it is assumed that we have your consent to collect and utilise your cookies. If you do not want to give us your consent, then please change the cookie settings on your browser. Please refer to our privacy policy for more information.

Login

Login

Companies log in with your username, job seekers log in with your registered email.

Keep me logged in
Forgot your password?

Or login with

You can import your profiles by logging in with these social accounts

Keep me logged in Stay logged in?

Recommended for trusted devices only

Get logged out after 1 month of inactivity

When using a public or shared device, remember to logout once finished

Best for public and shared devices

Get logged out automatically after 30 minutes of inactivity

Recommended for trusted devices only

Get logged out after 1 month of inactivity

When using a public or shared device, remember to logout once finished
Register

Help us with a quick job change success survey

IMPORTANT: Please be cautious of messages from accounts claiming to be "CareerCross"

Job ID : 1536741 Date Updated : April 30th, 2025

PR/158845 | Site Reliability Engineering Lead

Location	Malaysia, kuala lumpur
Job Type	Permanent Full-time
Salary	Negotiable, based on experience

Job Description

COMPANY OVERVIEW
A well-established client of us in Kuala Lumpur is seeking for Site Reliability Engineering Lead.

JOB RESPONSIBILITIES

Team Leadership:

○ Lead and mentor a team of SREs, fostering a culture of ownership, collaboration, and continuous improvement.

○ Define clear goals, performance metrics, and development plans for the team.

System Reliability & Performance:

○ Design and implement strategies to improve system reliability, scalability, and performance.

○ Conduct root cause analysis of production incidents and develop preventive solutions.

Infrastructure Management:

○ Oversee the deployment, monitoring, and management of production environments.

○ Collaborate with development teams to design cloud-native infrastructure and architecture.

Automation & CI/CD:

○ Drive automation of operational processes, reducing manual intervention and response times.

○ Optimize CI/CD pipelines to ensure smooth and rapid deployments.

Incident Management:

○ Establish incident response protocols and lead efforts during major incidents.

○ Ensure robust monitoring and alerting systems are in place to proactively detect issues.

Collaboration & Communication:

○ Act as a liaison between engineering, operations, and other teams to align objectives.

○ Share insights and best practices with internal stakeholders to enhance overall system resilience.

JOB REQUIREMENTS

Technical Expertise:

○ Strong experience with cloud platforms (AWS, Azure, Google Cloud) and infrastructure-as-code tools (Terraform, Ansible, etc.).

○ Proficiency in programming/scripting languages (Python, Go, Shell, etc.).

○ Deep knowledge of Kubernetes, containerization, and distributed systems.

Leadership Skills:

○ Proven track record of leading SRE or DevOps teams and managing large-scale production environments.

○ Strong decision-making, prioritization, and problem-solving capabilities.

Monitoring & Metrics:

○ Expertise in implementing and using monitoring tools (Prometheus, Grafana, Datadog, etc.) and logging systems.

○ Familiarity with service-level objectives (SLOs), service-level agreements (SLAs), and error budgets.

Soft Skills:
Experience:

○ Excellent communication and collaboration skills to work across cross-functional teams.

○ Ability to mentor and upskill team members, fostering a learning-oriented culture.

○ At least 8 years of experience in SRE, DevOps, or related roles with a focus on reliability engineering

General Requirements

Minimum Experience Level	Over 3 years
Career Level	Mid Career
Minimum English Level	Business Level
Minimum Japanese Level	Business Level
Minimum Education Level	Associate Degree/Diploma
Visa Status	No permission to work in Japan required

Job Location

Malaysia, kuala lumpur

Work Conditions

Job Type	Permanent Full-time
Salary	Negotiable, based on experience
Industry	IT Consulting

Job Category

Other > Other

Some similar jobs others are looking at

Login

Or login with

Keep me logged in Stay logged in?