Senior Software Engineer - Cloud Infrastructure Reliability [t500-25052]
ANSR
Job Description
ANSR is hiring for one of its clients.
ABOUT UNDER ARMOUR:
Our story is a classic American entrepreneurial journey, born from a simple, unmet need identified by an athlete. In 1996, founder Kevin Plank, then captain of the University of Maryland football team, set out to solve the problem of sweat-soaked cotton shirts by creating a moisture-wicking synthetic alternative. That innovation – HeatGear – redefined the athletic base layer and laid the foundation for what is today a global leader in performance apparel, footwear, and accessories.
Headquartered in Baltimore, Maryland, Under Armour is driven by a clear mission – to make all athletes better. This relentless pursuit of better defines who we are, shaping our focus on performance, innovation, and continuous improvement.
OUR PURPOSE:
Building on this global foundation, Under Armour India is purpose-built to access India’s top-tier talent and embed the brand’s culture into scalable, tech-driven solutions. At its core, this is about empowering those who strive for more – a belief reflected in our values: Act Sustainably, Celebrate the Wins, Fight on Together, Love Athletes, and Stand for Equality.
These values serve as a shared framework that guides how we think, build, and collaborate. They connect teams across geographies, reinforce our purpose, and ensure that everything we do is aligned to a common goal – enabling better outcomes for athletes and the business.
At Under Armour India, this translates into an environment where individuals have the freedom to go further, regardless of role. Teams are empowered to develop and deliver state-of-the-art products and digital solutions that enhance performance and drive impact at scale.
VALUES & INNOVATION:
Across Under Armour globally, our values act as the thread that unites every teammate, shaping a culture grounded in purpose, accountability, and shared ambition. They are not just principles, but active drivers of how we operate, innovate, and grow together.
This culture is deeply anchored in innovation – a continuous pursuit of better that pushes boundaries and challenges convention. Whether through product innovation or digital transformation, teams are enabled to create solutions that elevate performance and redefine possibilities for athletes everywhere.
PURPOSE OF ROLE:
The Site Reliability Engineering (SRE) team at Under Armour drives continuous improvements in performance, resiliency, and operational excellence across our technology platforms. We take a consultative, engineering first approach to reliability- partnering closely with cross functional teams to deliver guidance, automation, and best practices that improve the scalability, stability, and reliability of the services that power our products and digital experiences.
We are seeking a Site Reliability Engineer to help strengthen the reliability and scalability of critical systems. In this role, you will build automation, enhance observability, improve operational workflows, and participate in incident response and problem management. The ideal candidate brings a strong foundation in distributed systems, cloud native platforms, and performance optimization, along with a collaborative mindset and a passion for applying SRE principles across the organization.
Innovation is a core part of how we work at Under Armour. Success in this role requires adaptability, continuous learning, and the ability to pivot as technologies, priorities, and business needs evolve.
YOUR IMPACT (Job Responsibilities):
- Engineer and improve reliable, scalable, and high performing systems supporting critical business services.
- Build automation across deployments, monitoring, alerting, and operational workflows to reduce toil and improve resiliency.
- Partner with engineering and platform teams to apply SRE principles, including SLIs, SLOs, error budgets, and automated remediation.
- Enhance CI/CD pipelines and software delivery processes to improve reliability and efficiency.
- Develop observability solutions across metrics, logs, and distributed tracing to improve system visibility.
- Participate in incident response, root cause analysis, and corrective actions to prevent recurrence.
- Support capacity planning, performance tuning, and scaling strategies for cloud native and distributed systems.
- Maintain Infrastructure as Code, cloud configurations, and operational documentation, including runbooks and standards.
- Collaborate with teams to identify reliability risks and drive continuous improvement.
QUALIFICATIONS:
- Bachelor's degree in computer science, Engineering, or a related field with typically 3-5 years of experience in Site Reliability Engineering, DevOps, Platform Engineering, or a related discipline or Master's degree with typically 3 years of relevant experience or typically 9 years of relevant work experience without a degree.
- Proficiency in one or more programming or scripting languages such as Python, Go, JavaScript, or Bash.
- Solid working knowledge of Linux/Unix based systems.
- Experience building or supporting CI/CD pipelines using tools such as GitHub Actions, GitLab CI, or Jenkins.
- Familiarity with Infrastructure as Code practices and tools (e.g., Terraform, CloudFormation).
- Experience with containerization and orchestration technologies, including Docker and Kubernetes.
- Understanding of networking fundamentals, distributed systems, and system design principles.
PREFFERED QUALIFICATIONS:
- Handsome experience with modern observability stacks such as Prometheus, Grafana, ELK/EFK, or Datadog. Experience contributing to SLI/SLO frameworks and applying error budgets to guide reliability decisions.
- Exposure to GitOps workflows and tooling such as Argo CD or Flux.
- Working knowledge of service mesh architectures (e.g., Istio, Linkerd).
- Familiarity with performance and load testing tools and techniques.
- Experience with asynchronous and distributed systems, including message queues, event driven architectures, or distributed data platforms.
- Cloud or DevOps certifications (e.g., AWS Associate or Specialty, GCP Professional, Kubernetes CKA/CKS) are a plus.
- Experience operating in largescale enterprise environments and collaborating with globally distributed teams. Experience using AI assisted development tools (such as Copilot, Cursor, or similar) to improve code quality, accelerate development, and enhance documentation.
- Understanding of foundational AI/ML concepts, with exposure to cloud native AI services and/or the ability to leverage AI tools to automate cloud and operational tasks.
WORKPLACE LOCATION:
- Location: This individual must reside within commuting distance from our office.
- Work Schedule: This role follows a hybrid work schedule, requiring 4 days in-office per week
OUR COMMITMENT TO EQUAL OPPORTUNITY:
At Under Armour, we are committed to providing an environment of mutual respect where equal employment opportunities are available to all applicants and teammates without regard to race, color, religion or belief, sex, pregnancy (including childbirth, lactation and related medical conditions), national origin, age, physical and mental disability, marital status, sexual orientation, gender identity, gender expression, genetic information (including characteristics and testing), military and veteran status, family or paternal status and any other characteristic protected by applicable law. Under Armour seeks to recruit, develop and retain the most talented people representing a wide variety of backgrounds and perspectives. Reasonable accommodations are available for applicants with disabilities upon request.