We are seeking a mid-level SysAdmin & DevOps Engineer with a strong background in automation and cloud operations. This role is exclusively production-focused and does not include internal IT support responsibilities. The ideal candidate will take a pragmatic, balanced approach to maintaining production resilience and responding to issues quickly while using that experience to drive proactive work to design ever more reliable, automated systems. You should enjoy working in a team-oriented culture and collaborating closely with engineers to improve deployment practices.
We are looking for someone who can:
Manage Kubernetes deployments (OpenShift, EKS, AKS, etc.)
Manage and monitor Linux-based servers and services primarily in cloud environments (AWS, Azure, private cloud)
Implement monitoring, logging, and alerting for proactive issue detection
Apply updates, patches, and configurations to maintain system health
Troubleshoot and remediate production deployment issues, and escalate if necessary
Manage production system performance, availability, security and capacity planning
Apply security best practices and support compliance initiatives (SOC 2, ISO 27001, etc.)
Collaborate with others to design, build, and maintain CI/CD pipelines and enable fast, reliable releases
Automate infrastructure provisioning and configuration using Infrastructure as Code (Terraform, Ansible, Helm Charts, etc.)
Collaborate with developers to optimize build, test, and deployment processes
Implement and test disaster recovery and backup strategies
Stay calm under pressure and communicate clearly with both technical and non-technical colleagues or customers