Job Summary:
- Design, deploy and manage large-scale Linux server environments, primary using Rocky Linux, CentOS, and Ubuntu, to support web platforms, API servers, backend systems, and MySQL database clusters across both production and staging settings
- Improve infrastructure reliability and platform security by applying security hardening standard operating procedures
- Configure firewalls, set up VPNs, deploy Bastion Hosts and segment zones
- Streamline system configuration, patch management and deployments through Ansible
- Enhance monitoring and observability by deploying and managing tools like Zabbix, Prometheus, Grafana and performing infrastructure stress tests for system reliability
- Support and optimise LAMP stack, Nginx, and Docker container environments
- Implement proactive alert systems and conducted regular security assessments with tools like nmap and the ELK Stack
- Deliver round-the-clock support through incident response rotations
Requirements:
- Min 5 years’ experience running production systems, with a focus on reliability and scalability
- Hands-on experience with cloud platforms (AWS or Azure)
- Knowledge with Kubernetes in production
- Familiarity with modern delivery practices — CI/CD, GitOps, and Infrastructure as Code
If this sounds like the role for you, please submit your resume to sewyee.gan@techstaffing.my, stating the job title. We regret that only shortlisted candidates will be notified, but we look forward to connecting if another opportunity arises.