PHPhonePe
Site Reliability Engineer 2 - Hadoop
Bangalore ₹5-9 LPA Posted 23 May 2025
FULL TIME
Microsoft Azure
Linux
Mysql
Python
Hadoop
Job Description
Key deliverables:
- Manage and administer Hadoop ecosystem components and performance tuning
- Automate operational workflows using Airflow, Ansible, or scripting languages
- Monitor and troubleshoot system-level and network-level issues across hybrid cloud environments
- Collaborate with security and platform teams to apply patches, upgrades, and enhance data infrastructure
Role responsibilities:
- Ensure high availability and reliability of Hadoop clusters and supporting tools
- Maintain infrastructure across AWS/Azure/GCP and optimize load balancing with Nginx
- Develop tools and monitoring solutions using Prometheus, Grafana, ELK
- Support on-call rotations, resolve production incidents, and drive root cause analysis
