COCognizant Technology Solutions India Pvt Ltd
Site Reliability Engineer
Bangalore ₹6-9 LPA Posted 21 Nov 2025
FULL TIME
Dynatrace
Grafana
Prometheus
Site Reliability Engineering
SRE
Job Description
Key Responsibilities
- Apply SRE principles including error budgets, SLIs/SLOs, and incident management.
- Use observability and APM tools such as Grafana, Prometheus, OpenTelemetry, Dynatrace, and AppDynamics.
- Perform performance profiling, load testing, and capacity planning for distributed systems.
- Conduct resiliency testing using frameworks like Chaos Mesh, Gremlin, and LitmusChaos.
- Manage cloud infrastructure and services on AWS, Azure, or GCP.
- Orchestrate containerized environments using Kubernetes.
- Automate and build tools using Go, Python, and Bash scripting.
- Analyze and troubleshoot reliability and performance issues using a data-driven approach.
