E

Site Reliability Specialist (Observability & Kubernetes)

Accepting applications

Everbridge · United States

Full-Time Senior AIaiateganrf
Posted
2 May
Category
Test
Experience
Senior
Country
United States
At Everbridge, we build resilient, scalable, and secure cloud platforms that power critical services used by 6,000+ organisations worldwide, especially when it matters most.

We’re looking for a Platform Site Reliability Specialist to take ownership of our enterprise observability platform and help shape how our teams understand, monitor, and improve system reliability at scale.

This is a high-impact role where you’ll drive both technical excellence and strategic direction, ensuring our engineers have deep, real-time visibility into system health, performance, and reliability across a complex, cloud-native environment.

Please note that this role requires eligibility to obtain secret secret clearance*

What You'll Do

Observability Platform Ownership

Head the design, operation, and evolution of Everbridge’s observability stack
Build and maintain a highly available, scalable observability platform
Standardize instrumentation, dashboards, alerts, and SLOs
Support incident response, root cause analysis, and capacity planning

Grafana Stack & Telemetry

Operate and scale Grafana and technology
Grafana Loki (logs)
Grafana Mimir (metrics)
Grafana Tempo (tracing)
Grafana Alerting

Kubernetes

Maintain reliability and security of EKS clusters running observability
Manage cluster lifecycle and upgrades

Infrastructure as Code & Automation

Terraform for infrastructure provisioning
HashiCorp Packer
Gitlab CI/CD at Scale

What You'll Bring

6+ years of experience in Site Reliability Engineering or Platform Engineering
Strong hands-on experience with the Grafana ecosystem
Deep expertise in Kubernetes, especially Amazon EKS
Solid proficiency with Terraform and infrastructure as code

Preferred Qualifications

Experience with OpenTelemetry
Background in large-scale observability systems
Experience with cloud cost optimization

The reasonably estimated salary for this role at Everbridge ranges from $118,700 - $145,000 and may also include variable compensation. Actual compensation is based on factors such as the candidate's skills, qualifications, and experience. In addition, Everbridge offers a wide range of best in class, comprehensive and inclusive employee benefits for this role including healthcare, dental, parental planning, and mental health benefits, disability income benefits, life and AD&D insurance, a 401(k) plan and match, paid time off, and fitness reimbursements.

Fair Chance Statement US & Canada

We are committed to providing equal employment opportunities in compliance with all applicable Federal, Provincial/State and Local laws, including the California Fair Chance Act and any local County Fair Chance Ordinance (or local equivalent). Pursuant to these and other relevant regulations, we consider qualified applicants with criminal histories in a manner consistent with the law.

For roles subject to background checks, the following material job duties may be affected by an applicant’s criminal history:

Access to sensitive or confidential information, such as financial records, proprietary data, or client information.
Management of cash, company funds, or other valuable assets.
Work in environments requiring heightened security measures.
Compliance with contractual or regulatory requirements specific to the position.

We evaluate each applicant's criminal history individually, considering its nature, timing, and relevance to the specific job duties, while maintaining our commitment to fair hiring practices and promoting workplace equity.

About Everbridge

Everbridge empowers enterprises and government organizations to anticipate, mitigate, respond to, and recover stronger from critical events. In today’s unpredictable world, resilient organizations minimize impact to people and operations, absorb stress, and return to productivity faster when deploying critical event management (CEM) technology. Everbridge digitizes organizational resilience by combining intelligent automation with the industry’s most comprehensive risk data to Keep People Safe and Organizations Running™. For more information, visit www.everbridge.com, read the company blog, and follow on Twitter. Everbridge… Empowering Resilience

Everbridge is an Equal Opportunity/Affirmative Action Employer. All qualified Applicants will receive consideration for employment without regard to race, creed, color, religion, or sex including sexual orientation and gender identity, national origin, disability, protected Veteran Status, or any other characteristic protected by applicable federal, state, or local law.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Show more Show less