Job Details

ID #51853447
Estado Pennsylvania
Ciudad Newtownsquare
Full-time
Salario USD TBD TBD
Fuente Insight Global
Showed 2024-06-06
Fecha 2024-06-07
Fecha tope 2024-08-06
Categoría Etcétera
Crear un currículum vítae
Aplica ya

Site Observability Engineer

Pennsylvania, Newtownsquare, 19073 Newtownsquare USA
Aplica ya

Job DescriptionThis position is for our large software client. They are currently looking for an Engineer to join the team Observability team. The ideal candidate will be responsible for improving our monitoring and alerting posture for Cloud Infrastructure. The role requires a strong understanding of observability tools and practices, with a focus on Prometheus, Grafana, Gardener Kubernetes, and Splunk. This person will support and look at Prometheus for their HANA cloud implementation that is beginning this year into next.

Implement, manage, and improve monitoring solutions that use Prometheus, ensuring high availability and accurate alerting for our systems.

Contribute to the development of observability strategies to improve our Cloud monitoring posture.

Collaborate with development teams to integrate observability into the CI/CD pipeline and throughout the application lifecycle.

Respond to and investigate incidents, providing thorough post-mortem analyses and implementing preventive measures.

Stay current with the latest trends and best practices in site reliability and observability.

Work with cross-functional teams to ensure system reliability, scalability, and performance.

We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to [email protected] .   To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/ .Skills and Requirements

5+ years of experience as a Site Reliability or DevOps Engineer

3+ years of experience working with Kubernetes for container orchestration

Experience with Prometheus for monitoring solutions and ensuring high availability and accurate alerting for systems

Experience with Grafana and Splunk for observability

Ideally has experience with Dynatrace, they will work on making it a large footprint in the organization null

We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal employment opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment without regard to race, color, ethnicity, religion,sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military oruniformed service member status, or any other status or characteristic protected by applicable laws, regulations, andordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request to [email protected].

Aplica ya Suscribir Reportar trabajo