Software Engineer

Visa hybrid • Basingstokefull_time

Site Reliability Engineering (SRE) is essential to Visa’s Cloud platform strategy. In this role, you’ll ensure our development platform and tools let engineers focus on innovation instead of infrastructure. You’ll promote observability best practices and automate resolution of recurring issues, working closely with software engineering teams to support security, availability, and performance. Responsibilities include triaging issues, collaborating on infrastructure management, and setting up monitoring for full coverage. Hands-on expertise is required, especially with major DevTools like GitHub, Jenkins, Jira, and Artifactory. 

We seek a Software Engineer + SRE hybrid engineer. The ideal candidate deeply understands at least one major DevTool, quickly resolves tool-related issues in collaboration with developers, and applies systems thinking to maintain reliable applications and infrastructure while improving developer productivity. 

Key Responsibilities:

Tools Support: 

  • You will be the primary point of contact for developers using tools like GitHub, Jenkins, Jira, or Artifactory. 
  • Troubleshoot and resolve tool-related issues promptly to minimize developer downtime. 
  • Maintain and optimize CI/CD pipelines and integrations for reliability and scalability. 
  • Collaborate with development teams to improve workflows and automation. 

Site Reliability Engineering:

  • Design, implement, and maintain systems for high availability, scalability, and performance. 
  • Monitor and improve application reliability through proactive measures and incident response. 
  • Develop and maintain observability solutions (metrics, logging, tracing). 
  • Participate in on-call rotations and drive root cause analysis for incidents. 

Collaboration & Continuous Improvement:

  • Partner with engineering teams to identify reliability risks and implement best practices. 
  • Document processes, troubleshooting guides, and reliability of playbooks. 
  • Advocate for automation and self-service solutions to reduce operational overhead.