Staff Site Reliability Engineer

Visa remote • Ashburnfull_time

What a Staff Reliability Engineer Does at Visa?

As a Staff Site Reliability Engineering (SRE) team, you will be part of a cross-functional Operations & Infrastructure group responsible for the reliability, availability, performance, and optimization of Visa Spend Clarity for Enterprises (VSCE). You will support teams in running robust applications, lead incident resolution efforts, and drive operational excellence through automation, observability, and platform modernization.

This role is critical to Visa’s transformation as we scale our product to a broader range of issuers through cloud infrastructure and automation. You will work closely with engineering, operations, and product teams to ensure our systems are resilient, secure, and continuously improving.

 

Why This Role Matters
You will be part of a critical global function within the VSCE product at a time when we are modernizing our platform through cloud infrastructure and automation. This transformation enables us to scale our product to a broader range of issuers and is a key focus area within Visa Commercial Solutions with ambitious growth goals.

Our Culture
At Visa, your individuality fits right in. Working here gives you an opportunity to impact the world, invest in your career growth, and be part of an inclusive and diverse workplace. We are a global team of disruptors, trailblazers, innovators, and risk-takers who are helping drive economic growth in even the most remote parts of the world. We’re creatively moving the industry forward and doing meaningful work that brings financial literacy and digital commerce to millions of unbanked and underserved consumers.
You're an individual. We're the team for you. Together, let's transform the way the world pays.

Essential Functions

  • Operate and improve distributed systems and SaaS applications in production environments.
  • Lead and coordinate incident response efforts, ensuring timely resolution and root cause analysis.
  • Collaborate with engineering teams to enhance system reliability, uptime, and performance.
  • Automate operational tasks using scripting and orchestration tools (e.g., PowerShell).
  • Support and configure middleware, load balancers, and Web Application Firewalls.
  • Drive strategic initiatives such as cloud migration and platform modernization.
  • Apply AWS cloud expertise to solve infrastructure problems and scalability challenges.
  • Monitor and manage enterprise systems using observability and alerting tools.
  • Participate in a 24/7/365 On Call rotation, including shift and weekend support as needed.
  • Contribute to internal platform development with a product-led mindset.
  • Ensure secure and compliant software delivery in regulated environments.
  • Support geographically dispersed systems across multiple time zones.
  • Provide support and documentation for task handoffs and transitions.

This is a hybrid position. Expectation of days in office will be confirmed by your hiring manager.