Z

Senior Site Reliability Engineer

Zello

🌍 North America 🏠 Remote ⏱ FullTime 💼 Senior Level 🗓 1 weeks ago

IMPORTANT: Please be aware, scammers may try to impersonate Zello by reaching out regarding job opportunities. We will never ask you for bank account information, checks, or other sensitive information as part of our hiring process. All correspondence will come from the zello.com email domain. If you’re unsure, please email recruiting@zello.com with questions.

ABOUT ZELLO

Zello is a voice-first communication platform, powered by our industry-leading push-to-talk technology, to improve collaboration and productivity for desk-less workers. With over 175+ million users, we’re the #1 rated push-to-talk app in the world, delivering 9 billion (yes, with a B) messages a month. 

At Zello, our company values are at the heart of what we do everyday. We’re proud to serve the frontline, we’re privileged to connect people in times of crisis across the globe, and we’re honored to support first responders.

And this is where you come in.

We’re looking for a Site Reliability Engineer to help us make our systems more observable, performant, and resilient. You’ll work closely with our platform and application teams to build the tooling, practices, and insights that keep Zello reliable as we scale.

AFTER A SUCCESSFUL FIRST YEAR, YOU WILL HAVE

- Implemented end-to-end observability tooling for application and infrastructure metrics, traces, and logs.

- Delivered profiling and tracing systems that surface performance bottlenecks before they impact users.

- Defined and tuned alerting to ensure only high-signal, actionable incidents reach engineers.

- Helped evolve Zello’s incident response and postmortem processes, ensuring consistent learning and improvement.

- Provided developers with clear visibility into application performance and release impact, driving data-informed engineering.

WHAT YOU'LL DO

- Build and maintain monitoring, tracing, and profiling systems that empower teams to measure and improve performance.

- Partner with cross-organization teams to define SLIs, SLOs, and SLAs that reflect real user experience.

- Lead efforts to optimize observability, from instrumentation standards to dashboard design.

- Participate in and help coordinate our on-call rotation, incident response, and post-incident reviews.

- Continuously evaluate and recommend tools or process improvements to strengthen reliability and reduce alert fatigue.

- Collaborate on platform improvements that enhance system resilience and developer velocity.

WHO YOU ARE

- BSc in Computer Science or equivalent experience.

- 6+ years of experience in site reliability, DevOps, or software engineering roles.

- Deep understanding of monitoring, alerting, and observability platforms (e.g., Prometheus, Grafana, Loki, OpenTelemetry).

- Experience implementing tracing, logging, and profiling for distributed systems.

- Strong background in incident management, postmortem practices, and reliability metrics.

- Familiarity with Linux, Kubernetes, Terraform, and GCP (preferred) or other major clouds.

- Proficiency in a scripting or backend language (e.g., Python, Go, Bash).

Excellent problem-solving, communication, and collaboration skills.
Passionate about eliminating toil and driving continuous improvement in system health.

We hire for potential, passion for our mission, and a knack for solving difficult problems over checking every qualification box. We have competitive pay, equity with significant upside, and intentionally design our benefits to encourage healthy and well-balanced employees, flexible schedules and time off. We even offer a sabbatical after every five years of service so you’re able to pursue and enjoy what matters most to you. And of course, we wouldn’t be a technology company in Austin without a ping-pong table and free snacks in our break room. Join us!

Zello provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.

All Zello personnel are required to comply with defined security, privacy, and compliance requirements applicable to their role along with requirements that are applicable to all Zello personnel.

Share this job: