Site Reliability Engineer (Phoenix)
Mastech Digital
Phoenix, Japan Full Time Engineering Jobs Japan
Job Description
Site Reliability Engineer (SRE)
Phoenix, AZ (Hybrid)
We are seeking a hands-on Site Reliability Engineer (SRE) to help build, maintain, and improve highly available, scalable, and resilient production systems. This role will partner closely with engineering, infrastructure, and operations teams to ensure platform reliability, automation, and operational excellence across cloud environments.
Responsibilities
- Design, implement, and support scalable, fault-tolerant cloud infrastructure.
- Monitor system performance, availability, and reliability across production environments.
- Automate operational processes and improve deployment efficiency through CI/CD pipelines.
- Define and manage SLOs, SLIs, and error budgets.
- Lead incident response, root cause analysis, and post-incident improvements.
- Collaborate with development teams to improve application reliability and system performance.
- Build and maintain monitoring, logging, and alerting solutions.
- Continuously optimize infrastructure, tooling, and operational workflows.
Required Skills & Qualifications
- 5+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering.
- Strong experience with cloud platforms such as AWS, Azure, or GCP.
- Proficiency with scripting/programming languages like Python, Bash, or Go.
- Experience with Infrastructure as Code tools such as Terraform or CloudFormation.
- Hands-on experience with CI/CD tools and automation pipelines.
- Strong knowledge of Linux systems administration and networking fundamentals.
- Experience with monitoring and observability tools such as Prometheus, Grafana, Datadog, or New Relic.
- Excellent troubleshooting, analytical, and communication skills.
Preferred Qualifications
- Experience with Kubernetes and containerized environments.
- Familiarity with GitOps, service mesh, and modern deployment strategies.
- Understanding of security best practices and compliance frameworks.
Work Environment
- Onsite work model based in Phoenix, AZ.
- Collaborative and fast-paced engineering environment.
- Opportunity to work on large-scale, mission-critical systems.
Posted May 2, 2026