Sr. Site Reliability Engineer - Infrastructure Services
Hashicorp
About this Role
As a Senior Site Reliability Engineer on the Infrastructure Services team, you will play a pivotal role in designing, building, and maintaining the infrastructure that underpins all HashiCorp cloud products. Your work will ensure our systems are robust, scalable, and performant, facilitating seamless operations and enhancing service availability. This is crucial for maintaining the trust and satisfaction of our customers who rely on HashiCorp's products to be available, reliable, and secure.
In this role, you can expect to:
Design and implement resilient infrastructure solutions, using automation and best practices to enhance system reliability, scalability, security, and compliance.
Implement comprehensive monitoring and alerting systems to ensure the health and performance of our infrastructure.
Lead the response to infrastructure incidents, ensuring swift resolution and minimizing impact on service availability.
Partner with cloud product teams to understand their infrastructure needs and provide technical guidance.
Drive the adoption of automation tools and processes to streamline operations and reduce manual intervention.
Create and maintain detailed documentation of infrastructure configurations, procedures, and troubleshooting guides.