Site Reliability Engineer (Remote) (India/EU) at SigNoz (W21)
0.10% - 0.50%
Open source alternative to DataDog
IN / PL / UA / Remote (IN; PL; UA)
Full-time
3+ years
About SigNoz

SigNoz is an open source alternative to DataDog, NewRelic. We help developers monitor their applications and troubleshoot any problem.

We have got lots of interest from the dev community with 18000+ stars , 5200+ slack community members within two year of launching the project. Here's our Github repo

We are backed by prominent angels & funds including Y Combinator.

About the role
Skills: Go, Kubernetes

Looking for a SRE engineer to join our team at SigNoz. You will be part of the first few hires in our team and will have the opportunity to own a significant part of the product.

This is an opportunity to work on core developer infra open source product - and would love to chat with folks who are excited by this.

When applying please mention the number of vCPU cores managed in Kubernetes

Why us?

  • Opportunity to work in a global dev infra product from India.
  • Handle Petabyte scale
  • Work on an open source product (18K+ github stars). Engage with the community. Evangelise the product. Build your GitHub profile
  • Work with high volumes of data and real-time applications. There are some real perf challenges in doing this well you would love to solve.
  • Fully Remote
  • Founding team from IITs who are/have been devs themselves

Skills we are looking for

  • 3-6yrs of experience in building and leading large infra with uptime guarantees
  • Good grasp of golang to automate deployments, centrally manage platform, etc
  • Working in a small team and owning uptime of SaaS
  • Have run Kubernetes in production for 1000+ vCPUs
  • Deep expertise in AWS or GCP. We are on GCP
  • Knowledge of helm, terraform, argoCD, clickhouse, kafka
  • Have experience maintaining 99.99+ SLAs. Strong understanding of VPA and HPA in action
  • Cloud infrastructure automation using Kubernetes and operators
  • Have been on-call for incident resolution
  • Troubleshooting networking, computing, storage and Kubernetes failures. Running statefulsets in kubernetes

Next steps

Seems like something right up your alley?

Just apply on this site or email your CV and an optional intro note to me at [ankit at signoz dot io]. Feel free to include links to your GitHub, LinkedIn, Twitter, or blog posts.

Our process involves a short initial exploratory chat, followed by three interviews/discussions. The aim is for both sides to learn more about each other.

Technology
  • ReactJS with Typescript for frontend
  • Go for writing services and processors
  • ClickHouse as the datastore
  • Opentelemetry as the instrumentation library
  • Supported deployment models - Docker, on K8s via Helm charts

Here's our architecture

Other jobs at SigNoz

fulltimeIN / Remote (IN)UI / UX0.10% - 0.50%3+ years

fulltimeIN / DE / EE / GB / PL / Remote (IN; DE; EE; GB; PL)0.10% - 0.50%3+ years

fulltimeIN / PL / UA / GB / DE / EE / Remote (IN; PL; UA; GB; DE; EE)Full stack3+ years

fulltimeIN / Remote (IN)0.10% - 0.30%1+ years

fulltimeIN / Remote (IN)Frontend0.10% - 0.50%3+ years

fulltimeIN / PL / DE / GB / Remote (IN; PL; DE; GB)Full stack0.10% - 0.50%1+ years

fulltimeIN / PL / UA / Remote (IN; PL; UA)Devops0.10% - 0.50%3+ years

Hundreds of YC startups are hiring on Work at a Startup.

Sign up to see more ›