Vacancy caducado!
- Implementation of strategies to optimize the network Life Cycle Management (LCM) to maximize service reliability
- Measure and improve Reliability Metrics (SLO/SLI), implement Observability tools (Monitoring, Logging-Tracing solutions), manage Ops process (Incident, Problem Mgmt) and streamline release management
- Increase network performance through system design reviews, and process improvements
- Implement methods to minimize and eventually eliminate all manual interventions to deploy zero touch software-defined infrastructure (IaC), removing errors and inconsistencies
- Deploy automation tools to operate, monitor, and maintain highly available geo redundant network services in a hybrid cloud environments
- Write software to automate API-driven tasks at scale and contribute to the workflow engines
- Assist in building the required tools to configure and update infrastructure and applications using Continuous Integration/Continuous Deployment (CI/CD) practices and policies
- Implement tools for Configuration Management and automated monitoring of all infrastructure
- Proactively identify potential trouble spots and create mitigation strategies
- Master's Degree and 6+ years of Development and Operations related experience or equivalent combination of education and experience
- At least 3+ years of experience working in a modern engineering services team where you've built and extended an automated CI/CD pipeline
- Relevant experience as SRE would be an added advantage
- Hands on experience with Kubernetes and container deployment
- Knowledge of available tools landscape for monitoring and operations such as Ansible, Puppet, Chef, Terraform or another configuration management / orchestration suite
- Experience in writing code using Go or Python
- Knowledge of Linux-OS Internals and administration
- Experience on at least one of the Cloud computing Infrastructure - Google Cloud Platform / Azure / AWS preferred
- Knowledge of PaaS tools such as ELK stack, Kafka etc.
- Ability to work independently with minimum supervision