OROracle
Site Reliability Developer 4
Pune ₹3-12 LPA Posted 27 Oct 2025
FULL TIME
Netconf
Prometheus
Network Monitoring
Mpls
Bgp
+1 more
Job Description
RESPONSIBILITIES:
- The NRE (Network Reliability Engineering) team is accountable for ensuring the robustness of the Oracle Cloud Network Infrastructure.
- An NRE role is primarily focused on applying an engineering approach to measure and automate a network's reliability to align with Organization's service-level objectives, agreements, and goals.
- The duties entail promptly responding to network disruptions, pinpointing the underlying cause, and collaborating with internal and external stakeholders to fully restore functionality.
- NRE team members play a critical role in automation of recurring tasks in daily operations to streamline processes, enhance workflow efficiency, and increase overall productivity.
- This support covers a cloud-based network with a global footprint, including hundreds of thousands of network devices supporting millions of servers, connected over a mix of dedicated backbone infrastructure, CLoS Network, and the Internet.
- Responsibilities include designing, writing, and deploying network monitoring and automation software, to improve the availability, scalability, and efficiency of Oracle products and services.
Key Responsibilities and Duties
- Supports the design, deployment, and operations of a large-scale global Oracle Cloud Infrastructure (OCI).
- Primarily focused on the development and support of network fabric and systems through a combination of a deep level understanding of networking at the protocol level coupled with programming skills.
- Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas.
- Develop solutions to enable front line support teams to act on network failure conditions.
- Frequently develops scripts to automate routine tasks for team and business units.
- Coordinate with network monitoring to gather telemetry and create alerts rules using them.
- Build dashboards to represent data at various network layers and device roles that help identify network issues, anomalies.
- Mentor junior engineers and participate in network solution and architecture design process.
- Participate in an on-call rotation (primary or secondary).
- Provide break-fix support for events and serve as the escalation point for event remediation. Lead post-event root cause analysis.
- Serves as SME on software development projects for network automation and network monitoring.
- Collaborate with network vendor technical account team and internal Quality Assurance team to drive bug resolution and assist in the qualification of new firmware and/or operating systems.
Requirements
- Bachelor's degree in CS or related engineering field with 5+ years of Network Engineering experience or Master's with 5+ years of Network Engineering experience.
- Experience working in a large ISP or cloud provider environment.
- Experience working in a network operations role.
- Deeper understanding of Data Center build and design - CLoS architecture etc.
