Manager - Site Reliability Engineering (SRE) Team

Full Time
4 weeks ago
Work Remotely From
This job allows remote work from any country!
7am - 7am
Remote Support
Home office budget
Co-working stipend
Flexible hours
Paid get togethers
100% remote

Hasura Cloud generates a fully-featured unified GraphQL API on top of your databases

Hasura Cloud is a unique GraphQL product that lessens the effort that goes into building backends for applications. Our customers can use Hasura Cloud to generate a fully-featured unified GraphQL API connected to several databases and other REST/GraphQL APIs.

We are looking for a Manager for our team of Site Reliability Engineers (SREs) that are responsible for keeping Hasura Cloud systems running smoothly and making sure updates can be rolled out reliably without any downtime.

Key Responsibilities:

  • Provide technical leadership as well as people management responsibilities to a team of 4 (and growing) SREs globally distributed around the world

  • Be responsible for building out and implementing our incident management process

  • Perform incident commander responsibilities while coaching the team to be able to do the same

  • Able to take lead on performing incident writeups, as well as being responsible for ensuring our process for updating status.hasura.io is running smoothly

  • Help improve the deployment process to make it as reliable and boring as possible.

  • Use your experience to identify systemic issues as our platform grows, and ensure the team works on projects to proactively prevent incidents from happening.

  • Ensure the team has effective monitoring coverage of our infrastructure, and ensure alerting is meaningful and actionable.

  • Perform 1 on 1s with the team and perform coaching and mentorship

  • Act as hiring manager and help refine our interview process for SRE roles

  • Work closely with the leads of the Infrastructure, Backend and Frontend teams building Hasura Cloud to ensure alignment in vision and product delivery

You may be a fit to this role if you:

  • Have experience working on a fast-growing SaaS platform, either as an SRE or in a lead position

  • You enjoy running distributed systems at scale in production

  • Have at least 2 years of management experience

  • Enjoy people management responsibilities; feel joy and satisfaction to see your team growing and learning together.

  • Think about systems - edge cases, failure modes, behaviors, specific implementations

  • You are skilled in identifying performance bottlenecks, identifying anomalous system behavior, and resolving root cause of service issues, and can coach your team to learn to do the same

  • Have experience as an SRE or DevOps engineer and still enjoy technical tasks

  • Have strong programming skills (Go/Python).

  • Value asynchronous collaboration and communication with your globally distributed team.

  • Enjoy documenting all the things.

  • Have an urge to build automation and tooling so that you never have to do the same work twice.

  • Have an enthusiastic, go-for-it attitude. When you see something broken, you can't help but fix it.

Bonus points for:

  • Have experience with Hasura and its GraphQL APIs.

  • Have strong fundamentals in SQL, particularly with PostgreSQL.

  • Have experience with database management and scaling.

  • Have experience with optimized and scalable software that operates on a large number of nodes

  • Have experience with Nginx, Openresty, Docker, Kubernetes, Terraform, or similar technologies.

  • Have experience with various Cloud providers like AWS, GCP, Azure, DO etc., their systems, products and APIs.

  • Have experience with monitoring tools like Honeycomb/Datadog/Prometheus/Grafana


This role is fully remote. We hire in most countries. If you're applying from the US, we hire remotely in these 10 states in the US: Illinois, Virginia, California, Washington State, Maryland, Florida, Colorado, Massachusetts, Oregon, New York or this role will be based out of our office in Bangalore, India.

Working at Hasura:

At Hasura, we help developers build modern apps and APIs faster. Through your work at Hasura, you will have the opportunity to make a lasting impact on both Hasura as well as the larger developer ecosystem.

As a team, we take a lot of pride in our work. We obsess over the developer experience, and our first priority as a company will always be to make things easier for our users.

We offer competitive salaries, have a generous vacation policy and provide health insurance for everyone employed with Hasura.

We are an equal opportunity employer and do not tolerate discrimination of any kind.


We’d love to hear from you. Even if you don’t fulfil 100% of the above requirements, or are unsure about whether this would be the right fit, please do reach out to us with your questions!

About Hasura:

Hasura is a venture-backed open-source technology company with offices in San Francisco and Bangalore. Hasura makes your data instantly accessible over a real-time GraphQL API, so you can build and ship modern apps and APIs faster. Hasura connects to your databases, REST servers, GraphQL servers and third party APIs (eg: Stripe, Salesforce) and provides a unified API across all your data sources.