CO

Site Reliability Engineer

Cognizant Technology Solutions India Pvt Ltd
Bangalore6-9 LPA Posted 21 Nov 2025
FULL TIME
Dynatrace
Grafana
Prometheus
Site Reliability Engineering
SRE

Job Description

Key Responsibilities

  • Apply SRE principles including error budgets, SLIs/SLOs, and incident management.
  • Use observability and APM tools such as Grafana, Prometheus, OpenTelemetry, Dynatrace, and AppDynamics.
  • Perform performance profiling, load testing, and capacity planning for distributed systems.
  • Conduct resiliency testing using frameworks like Chaos Mesh, Gremlin, and LitmusChaos.
  • Manage cloud infrastructure and services on AWS, Azure, or GCP.
  • Orchestrate containerized environments using Kubernetes.
  • Automate and build tools using Go, Python, and Bash scripting.
  • Analyze and troubleshoot reliability and performance issues using a data-driven approach.

Join WhatsApp Channel