Bare Metal Senior DevOps
DeOS
Job Description
DeOS is a next-generation interactive edge platform enabling instant, high-performance gaming on any device. Powered by breakthrough streaming technology and a global edge network, DeOS removes the need for consoles, high-end PCs, or downloads—just click and play. This isn’t about keeping servers online.
It’s about engineering the platform that powers the future of interactive entertainmen Why This Role Is Differe nt You’ll architect infrastructure that operates at global scale with near-zero tolerance for latency or downti me.You’ll shape the engineering culture by defining DevOps standards and best practices across the compa ny.Every deployment, automation, and optimization directly impacts the experience of gamers around the wor ld.You’ll work across Kubernetes, Linux, CI/CD, infrastructure-as-code, and distributed systems with significant ownership and autono What Sets You Ap art You challenge assumptions instead of accepting the status quo. quo.You naturally think in terms of automation, resilience, and repeatabil ity.You enjoy mentoring engineers and raising the technical bar for the entire t eam.You thrive in ambiguous startup environments where speed and execution mat ter.You’re equally comfortable troubleshooting a kernel issue, designing deployment architecture, or improving developer workfl What You'l l Do Design, implement, and maintain cloud-native and hybrid infrastructure optimized for reliability, scalability, and perform ance.Lead and mentor engineering teams on DevOps practices, Infrastructure as Code, Kubernetes, CI/CD, security, and operational excel ence.Develop and maintain system architecture documentation, technical standards, and operational runb ooks.Analyze performance-critical components and continuously optimize platform effici ency.Own and evolve release engineering processes and CI/CD pipelines to enable rapid and reliable deploym ents.Collaborate closely with software engineers, product teams, and QA to build automated, observable, and reproducible delivery pipel ines.Evaluate and introduce modern infrastructure technologies and tooling to improve engineering productivity and platform resili ence.Support production environments, troubleshoot critical incidents, and participate in on-call rotations when requ ired.Drive improvements in monitoring, observability, security, and system reliability across the plat What You Bring 8+ years of experience in DevOps, Site Reliability Engineering, or software engineering with demonstrated technical leade rship.Deep expertise in Linux system administration, operating system internals, performance tuning, and scripting using Bash, Python, or similar lang uages.Proven experience operating Kubernetes clusters and containerized infrastructure at scale.Strong knowledge of CI/CD platforms such as GitHub Actions, GitLab CI, or Je nkins.Extensive experience with Infrastructure as Code tools including Terraform, Ansible, or equivalent technol ogies.Experience working with major cloud platforms including AWS, Google Cloud Platform, or Microsoft Azure.Strong understanding of monitoring and observability platforms such as Prometheus, Grafana, ELK, or similar solu tions.Experience implementing secure infrastructure and DevSecOps best prac tices.Excellent understanding of Git-based development workflows and collaborative engineering prac tices.Strong knowledge of scalable distributed systems, deployment strategies, high availability, and fault tole Points Experience working with GPU infrastructure, including NVIDIA or AMD drivers and performance optimi zation.Familiarity with AI infrastructure and tooling, including Large Language Models (LLMs), vector databases, or Model Context Protocol (MCP).Experience building or operating latency-sensitive platforms such as gaming, streaming, or real-time s ystems.Contributions to open-source infrastructure or DevOps pr #J-18808-Ljbffr