Senior, Software Engineer
Job Description
Strong experience operating distributed in memory cache systems (Redis, Memcached, Aerospike, or similar) in production at scale.
Solid understanding of distributed systems fundamentals, including consistency, replication, partitioning, and fault tolerance.
Proven experience with SRE and DevOps practices, including monitoring, alerting, incident management, and reliability engineering.
Hands on expertise with cloud platforms, container orchestration (Docker/Kubernetes), and Infrastructure as Code tools.
Experience building and operating CI/CD pipelines and automating infrastructure and platform operations.
Strong knowledge of observability stacks (metrics, logs, dashboards) and performance troubleshooting techniques.
Ability to perform capacity planning, performance tuning, and cost optimisation for memory intensive systems.
Good understanding of security, compliance, and enterprise governance requirements in large organisations.
Strong collaboration skills to work effectively with software engineers, platform teams, and site reliability teams.
Ability to lead operational ownership, make sound technical decisions under pressure, and continuously improve platform reliability.
