Director of DevOps and SRE in Toronto
United States Digital Space LLC
Job Description
Lead and innovate as the Director of DevOps and Site Reliability Engineering in Toronto. This hybrid position focuses on optimizing reliability, scalability, and cost-efficiency within the Group Functions Data Office platform. This senior leadership role requires overseeing end-to-end delivery, enhancing automation, and implementing best practices in DevOps and SRE.
You will collaborate with various stakeholders to ensure the platforms meet high standards in service continuity and security. A successful candidate will engage hands-on with teams to drive operational excellence and efficient incident management while scaling AI workloads. Key Responsibilities: • Lead global DevOps and SRE teams for optimal operation • Ensure production readiness and cost optimization of platforms • Drive automation and establish self-healing systems • Champion security, compliance, and governance • Deliver transparent operational reporting on performance Requirements: • Bachelor’s or Master’s degree in Computer Science • 8-10 years in technology leadership with 3 years in enterprise delivery • Strong expertise in cloud platforms and CI/CD practices • Proficiency in Java and Python required • Experience with GitHub Actions implementation Elevate the Group Functions Data Office as a leader, enhancing operational efficiency and compliance across platforms. #J-18808-Ljbffr