Site Reliability Engineer
Visa · Bangalore · 3+ yrs experience · Posted 2026-04-21
Tech stack: Ansible, Kubernetes, Python, SQL, Terraform
About the role
At Visa, you'll have the opportunity to create impact at scale — tackling meaningful challenges, growing your skills and seeing your contributions impact lives around the world.
Responsibilities:
- The Data Reliability Engineer is responsible for designing, building, and evolving cloud‑native, containerized infrastructure that powers our data products and services. This role plays a critical part in advancing our platform maturity by supporting cross-functional squads, leading complex technical initiatives, and ensuring the availability, security, scalability, and reliability of our data ecosystem.
- As a Staff Engineer, you will bring deep expertise in systems design, cloud infrastructure, networks, databases, and modern data technologies. You will also contribute hands-on experience with complex technology adoption, infrastructure automation, and high-scale distributed systems.
- This is a remote
- position
- . A remote position does not require job duties
- performed within proximity of a Visa office location. Remote
- positions may
- be required
- to be present at a Visa office with scheduled notice.
Qualifications:
- Bachelor's degree, OR 3+ years of relevant work experience
- Preferred
- Bachelor's degree, OR 3+ years of relevant work experience
- Bachelor’s degree in Computer Science, Engineering, or a related field (Desirable but not mandatory).
- Hands-on experience designing and operating cloud‑native infrastructure.
- Knowledge of Infrastructure as Code (Terraform), including contributing to reusable modules and platform components.
- Good understanding of Kubernetes and container orchestration concepts.
- Familiarity with CI/CD systems, pipeline configuration, automation, and secure deployment practices.
- Foundational competencies in reliability engineering concepts (SLOs, error budgets, incident response).
- Basic understanding of database technologies including SQL, NoSQL, and common data storage patterns.
- Experience using observability tools and stacks (Prometheus, Grafana, OpenTelemetry, ELK/EFK, Datadog, or similar).
- Basic automation experience using Bash, Python, or Ansible-like tools.
- Working knowledge of software engineering practices including version control, testing, code reviews, and common design patterns.
- Leadership & Execution:
- Ability to contribute to cross-functional technical initiatives from design through production under guidance.
- Experience supporting technology adoption and platform improvements across teams.
- Capability to follow and help implement infrastructure standards, best practices, and architectural guidelines.
- Comfortable working in partially ambiguous situations, escalating risks appropriately, and learning to make sound technical trade-offs.
- Strong problem-solving skills with demonstrated ability to reduce toil, address technical debt, and improve system stability.
- Experience participating in on-call rotations, incident response, and post-incident reviews.
- Communication & Collaboration:
- Clear written and verbal English communication skills.
- Ability to collaborate effectively with data engineers, platform engineers, SREs, security teams, and product teams.
- Capable of producing clear technical documentation and contributing to architectural discussions and decision records.
Qualifications
- Bachelor's degree, OR 3+ years of relevant work experience
- Bachelor’s degree in Computer Science, Engineering, or a related field (Desirable but not mandatory).
- Hands-on experience designing and operating cloud‑native infrastructure.
- Knowledge of Infrastructure as Code (Terraform), including contributing to reusable modules and platform components.
- Good understanding of Kubernetes and container orchestration concepts.
- Familiarity with CI/CD systems, pipeline configuration, automation, and secure deployment practices.
- Foundational competencies in reliability engineering concepts (SLOs, error budgets, incident response).
- Basic understanding of database technologies including SQL, NoSQL, and common data storage patterns.
- Experience using observability tools and stacks (Prometheus, Grafana, OpenTelemetry, ELK/EFK, Datadog, or similar).
- Basic automation experience using Bash, Python, or Ansible-like tools.
- Working knowledge of software engineering practices including version control, testing, code reviews, and common design patterns.
- Leadership & Execution:
- Ability to contribute to cross-functional technical initiatives from design through production under guidance.
- Experience supporting technology adoption and platform improvements across teams.
- Capability to follow and help implement infrastructure standards, best practices, and architectural guidelines.
- Comfortable working in partially ambiguous situations, escalating risks appropriately, and learning to make sound technical trade-offs.
- Strong problem-solving skills with demonstrated ability to reduce toil, address technical debt, and improve system stability.
- Experience participating in on-call rotations, incident response, and post-incident reviews.
- Communication & Collaboration:
- Clear written and verbal English communication skills.
- Ability to collaborate effectively with data engineers, platform engineers, SREs, security teams, and product teams.
- Capable of producing clear technical documentation and contributing to architectural discussions and decision records.
Responsibilities
- The Data Reliability Engineer is responsible for designing, building, and evolving cloud‑native, containerized infrastructure that powers our data products and services.
- This role plays a critical part in advancing our platform maturity by supporting cross-functional squads, leading complex technical initiatives, and ensuring the availability, security, scalability, and reliability of our data ecosystem.
- As a Staff Engineer, you will bring deep expertise in systems design, cloud infrastructure, networks, databases, and modern data technologies.
- You will also contribute hands-on experience with complex technology adoption, infrastructure automation, and high-scale distributed systems.
- This is a remote position.
- A remote position does not require job duties performed within proximity of a Visa office location.
- Remote positions may be required to be present at a Visa office with scheduled notice.