Site Reliability Engineer

Industry:

E-Commerce

Location:

Bangsar South

Job Type:

Permanent

Date Posted:

March 31, 2026

Job Summary:

 

  • Design, deploy and manage large-scale Linux server environments, primary using Rocky Linux, CentOS, and Ubuntu, to support web platforms, API servers, backend systems, and MySQL database clusters across both production and staging settings
  • Improve infrastructure reliability and platform security by applying security hardening standard operating procedures
  • Configure firewalls, set up VPNs, deploy Bastion Hosts and segment zones
  • Streamline system configuration, patch management and deployments through Ansible
  • Enhance monitoring and observability by deploying and managing tools like Zabbix, Prometheus, Grafana and performing infrastructure stress tests for system reliability
  • Support and optimise LAMP stack, Nginx, and Docker container environments
  • Implement proactive alert systems and conducted regular security assessments with tools like nmap and the ELK Stack
  • Deliver round-the-clock support through incident response rotations

 

Requirements:

  • Min 5 years’ experience running production systems, with a focus on reliability and scalability
  • Hands-on experience with cloud platforms (AWS or Azure)
  • Knowledge with Kubernetes in production
  • Familiarity with modern delivery practices — CI/CD, GitOps, and Infrastructure as Code

 

If this sounds like the role for you, please submit your resume to sewyee.gan@techstaffing.my, stating the job title. We regret that only shortlisted candidates will be notified, but we look forward to connecting if another opportunity arises.

Apply for this position

Allowed Type(s): .pdf, .doc, .docx

Apply for this Position

0.00
0.00
Maximum file size: 5 MB
Attach a resume file. Accepted file types are DOC, DOCX, PDF, HTML, and TXT.

Scroll to Top