Z

Senior Site Reliability Engineer, Federal Cloud

Zscaler
Full-time
Remote
We ask that you have U.S. Citizenship given this role requires work on the federal cloud platform.

Our Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185 countries. Bring your vision and passion to our team of cloud architects, software engineers, security experts, and more who are helping organizations worldwide to maximize speed and agility with a cloud-first strategy.

The Cloud Operations team is looking for a Senior Site Reliability Engineer (SRE) to join our Systems Engineering team. Our team's primary focus is working on the federal cloud platform. To succeed in this role, you have experience working with and managing distributed systems, including job queues, message queues, and event-driven architectures that span from the database to the front-end web application. You will work on meaningful projects while being encouraged to explore ideas. This role is remote within the U.S. and reports to a Director of SRE.

Help to deploy and manage the systems engineering, automation platform for both on-prem and AWS clouds.
Manage, deploy, and troubleshoot embedded agents running on both Linux and Unix.
Automate blue-green deployments and reducing platform downtime.
Develop monitoring solutions and health-checks.
Be point of contact to help other engineers understand how to build automation to run on the platform.
Build automation on the platform to manage the platform.
Be a first responder for platform incidents.
What We're Looking for (Minimum Qualifications)
5+ years of working with Linux systems and scripting.
5+ years of experience with PostgreSQL replication, pubsub, and backups.
Understand Nginx and how to manage web services, including knowledge of RPC and HTTP APIs.
Experience with the TCP/IP stack and how to troubleshoot issues.
Knowledge of building and deploying Go microservices.
What Will Make You Stand Out (Preferred Qualifications)
Experience with the AWS cloud, including ALB, S3, IAM, and monitoring.
Working knowledge of Docker and Kubernetes
Knowledge of Grafana.