SRE 2 - Database
slice
Job Description
About the roleWe are looking for a Site Reliability Engineer (Database) to help us build and operate highly available, scalable, and resilient data infrastructure that powers slice's core banking products. This role will focus on managing distributed databases, self-hosted data platforms, and Kubernetes-native infrastructure to ensure reliability at scale. You will work closely with platform and engineering teams to automate operations, improve system performance, and build robust foundations for mission-critical workloads. Ultimately, you will play a key role in operating complex database systems securely, efficiently, and with high reliability. What You'll DoBuild, operate, and maintain large-scale distributed database systems in production environments.Manage self-hosted database infrastructure with a strong focus on scalability, reliability, and operational excellence.Deploy and operate cloud-native database platforms on Kubernetes.Manage and optimize distributed data systems such as CockroachDB and ScyllaDB.Drive automation for database operations including backups, failover, scaling, upgrades, and recovery workflows.Design and implement infrastructure as code using Terraform and GitOps.Monitor database health, performance, replication, and cluster stability.Participate in incident response and root cause analysis.Collaborate with application and platform teams.Improve operational processes, documentation, and runbooks.Support adjacent distributed infrastructure components such as Kafka, Temporal, or EMQX where required. What We're Looking For4 to 9 years of experience in SRE, Database Engineering, Infrastructure Engineering.Experience managing distributed databases in production.Experience working with at least some of the following database systems: CockroachDB or ScyllaDB.Strong understanding of self-hosted infrastructure.Hands-on Kubernetes experience.Experience with AWS, GCP, or Azure.Understanding of distributed systems concepts.Terraform experience.Strong Linux knowledge.Ability to automate using Bash, Python, or Go.Good-to-HaveExperience with distributed messaging and streaming system such as Apache Kafka or Redpanda.Exposure to Temporal.Experience in SaaS or open-source environments.DBA background transitioning into SRE/platform engineering.
Life at sliceLife so good, you'd think we're kidding:Competitive salaries. Period.An extensive medical insurance that looks out for our employees & theirdependents. We'll love you and take care of you, our promise.Flexible working hours. Just don't call us at 3AM, we like our sleep schedule.Tailored vacation & leave policies so that you enjoy every important moment inyour life.A reward system that celebrates hard work and milestones throughout the year.Expect a gift coming your way anytime you kill it here.Learning and upskilling opportunities. Seriously, not kidding.Good food, games, and a cool office to make you feel like home. An environment so good, you'll forget the term "colleagues can't be your friends"