ST

Engineer, Reliability Engineering - CoE Job ID: 17825

Standard Chartered Bank
Chennai5-8 LPA Posted 25 Apr 2025
FULL TIME
Devops
Vm
Azure
Linux
Aws

Job Description

Key Responsibilities

  • Working with Platform, Production engineering and application SREs to manage and resolve complex production issues.
  • Improving Platform performance, availability, and reliability.
  • Implement observability solutions for proactive issue identification and optimization.
  • Managing processes for incidents, changes, releases, and deployments.
  • Developing automation tools (IaC, alerts as code, dashboard as code) to enhance efficiency.
  • Conducting POCs to implement tools to improve performance, scaling, reliability and availability.
  • Analysing trends in incidents, problems, and alerts to drive operational improvements.
  • Documenting SOPs, critical systems information, and best practices for current and future use.
  • Providing technical guidance to necessary stakeholders.
  • Staying updated on advancements in Software Engineering with extended focus on Reliability Engineering.

Skills and Experience

  • Programming Languages   
  • Linux, VM, Containers and Kubernetes   
  • AWS and Azure   
  • Database   
  • Observability   

Qualifications

Mandatory Skills:

  • Proficient in one or more of the following languages (Java, Python and Go) with full SDLC experience.
  • Expertise in Reliability Engineering principles: Anomaly detection, root cause analysis, and predictive maintenance.
  • Knowledge in defining SLIs, SLOs, and error budgets.
  • Hands-on experience with Kubernetes, Containers, Cloud, and Database.
  • Strong knowledge in Observability Tools and Open Telemetry.
  • Familiarity with DevOps methodologies, tools, and automating (e.g. Azure Pipelines, Terraform, Helm etc.,)
  • Experience with public/private cloud platforms including AWS and Azure.
  • Experience in leading an operations team in application Production Environments.

Preferred Skills:

  • Experience in Messaging Platforms (e.g. MQ/Solace/Kafka), API Gateways and Service Mesh.
  • Knowledge in Generative AI and Responsible AI

Required Skills

Join WhatsApp Channel