At Odido, we're on a mission to become the most customer-driven telco in the Netherlands-and our IT landscape plays a crucial role in making that happen. As Senior Software Engineer SRE, you are at the heart of service reliability, building and maintaining the systems that keep millions of customers connected-no matter how intense the traffic or how high the stakes.
You'll be the technical lead ensuring end-to-end resiliency across our applications and infrastructure. From proactive monitoring to self-healing automation, you prevent outages before they impact our customers. Imagine this: it's 3 AM, an anomaly is detected-your smart monitoring and automated scripts spring into action, rerouting traffic and neutralizing the risk before anyone even notices. The next day, your team dives deep into post-incident reviews, continuously improving how we work, learn, and scale.
You'll work closely with SRE leads, platform engineers, and software teams to drive automation, reliability engineering, and observability at every level. Your expertise in fault-tolerant design, distributed systems, and DevOps practices will directly impact our ability to deliver seamless customer experiences-whether it's over mobile, broadband, or fiber.
And if you have telecom experience-perfect. It's not just a bonus here, it's essential. We're not just building any digital platform; we're building for the unique demands of a high-volume, always-on telecom environment. Without prior experience in telecom ecosystems, we unfortunately cannot proceed with your profile for this role.
You'll bring deep technical knowledge, a collaborative mindset, and a passion for solving complex problems in fast-moving environments. Together, we'll raise the bar on performance, security, and customer satisfaction-every day.
You'll be part of a high-impact engineering team, collaborating with developers, platform engineers, and operations specialists to continuously improve Odido's service reliability, scalability, and efficiency. Your work will drive the automation, instrumentation, and observability that power Odido's digital services.
Key Responsibilities:
- Continuous Improvement: Oversee and enhance incident-response processes, ensuring lessons learned translate into structural improvements.
- Automation & Application as Code: Develop reusable patterns for automation, configuration management, and deployment across teams and products.
- Service Ownership: Take full responsibility for several critical services, ensuring high availability and reliability.
- Incident Management: Lead or participate in outage response calls, quickly resolving incidents and minimizing downtime.
- Monitoring & Observability: Design and implement proactive monitoring strategies using tools like Prometheus, Grafana, and Kibana to improve system performance.
- Troubleshooting & Debugging: Analyze and fix system issues in a complex distributed environment and application stack
- Engineering Best Practices: Advocate for DevOps and SRE principles, mentoring junior engineers on automation and operational excellence.
Must-Have Skills and Qualifications
- Experience on NodeJs, Python and Rest Services
- Experience with public cloud platforms (AWS, Azure) and related technologies (Docker, Kubernetes, CloudFormation).
- Strong understanding of storage, database systems, caching, queueing, and networking.
- Experience in leading technical recoveries and troubleshooting distributed systems.
- Ability to debug, optimize code, and automate routine operational tasks.
- Solid foundation in Linux or Windows administration and troubleshooting.
- Strong knowledge of monitoring/observability tools (Prometheus, Grafana, Kibana, Elasticsearch).
- Understanding of Service Level Agreements (SLAs) and Service Level Objectives (SLOs).
- Proficiency in at least one programming language for automation and scripting.
- Excellent command of English, both written and spoken.
Nice-to-Have Skills
- Knowledge of AI-driven operational solutions for predictive monitoring.
- Background in security practices and compliance for cloud environments.
We are Odido, the new provider of mobile, fiber optic and TV. And with almost 2,000 colleagues, we show that telecom can be improved. Because technology is for everyone. Wherever you come from, wherever you go. With Odido everyone participates in the digital world. That is our ambition. Everyone at Odido helps to build a brand that is human, optimistic and progressive.
Is that really something for you? Then we might fit well together.
This is what we stand forOur name - you can also read it from back to front - consists of different shapes. Which together are one. Because that's how we look at the world around us. As a place where people, no matter how different, move forward together. We're there for each other. We always look at opportunities. We celebrate diversity and are committed to an inclusive work environment with equal opportunities for all. That sounds good of course. But we don't stop at fine words: at Odido we are a recognized Top Employer. A confirmation that we are proud of.
What we offer.- Good salary and variable bonus scheme;
- Hybrid working;
- A progressive pension scheme;
- 30 vacation days (if you work for us full-time) and an extra day off after Ascension Day;
- Redeemable holidays;
- An Odido subscription;
- Real growth opportunities;
- Personal annual learning budget and over 200 digital training and courses;
- Workshops, learning weeks, annual ski trip, fun outings and parties.
Some qualities can't be taught-they're in your nature. You're driven by curiosity, calm in chaos, and always two steps ahead when it comes to service reliability. You love solving problems before they happen, and your mindset is one of ownership, accountability, and constant improvement. You understand how tech impacts people-and that's what keeps you moving.
You combine strategic thinking with hands-on engineering skills, and you know how to bridge conversations between operations, development, and business. Whether it's designing for scale or jumping in at 3 AM when the pressure's on, you know what it takes to keep mission-critical services running-because you've done it before.
You bring:
- A HBO or Bachelor's degree in IT, or equivalent practical experience.
- 10+ years of experience in Site Reliability Engineering or IT operations, with significant exposure in telecom environments-this is a must.
- A Bachelor's or Master's degree in IT, Telecom, Computer Science, or a related technical field.
- A deep understanding of telecom services, IT ecosystems, and customer-impacting KPIs.
- Expertise in service monitoring tools like Netcool, Zabbix, Dynatrace, Splunk, and SolarWinds.
- A proven track record managing both technical SLAs and business KPIs under pressure.
- Strong working knowledge of ITIL v3/v4 (certification preferred).
- Hands-on experience with automation, AIOps, and digital operations platforms.
- Excellent communication and reporting skills, especially in executive-level interactions and stakeholder engagement.
- A pragmatic mindset, strong debugging skills, and the confidence to act fast when the stakes are high.
Bonus points if you are certified in one or more of the following:
- ITIL Foundation/Intermediate/Expert
- TM Forum Frameworx (eTOM, SID, TAM)
- PMP / PRINCE2 (Project Management)
- Cloud certifications (AWS, Azure, GCP)
- Data/Analytics certifications (a plus, but not required)
At Odido we learn every day. All of us. You are responsible for your own development. That is why you decide how, what and when you learn. We have more than 200 digital training courses with which you can work on professional and personal goals. We don't do old-fashioned performance reviews and assessments. You keep your manager and colleagues informed of your goals and progress. You are in control.
Press on the buttonAre you as excited about Odido as we are? Then we are probably a good match. We are looking forward to meet you! You can apply via the application button. Done in a minute!