Senior Site Reliability Engineer
Cloud Orbit Technologies
About the role
Greetings!
We are looking for a Senior Site Reliability Engineer (SRE)
Location: Netherlands / Belgium
Experience: 5–8+ years
Job Type: Remote/Onsite
Work Type: Freelancer
Key Responsibilities
Define, implement and maintain SLIs and SLOs for customer‑specific DSPC platform environments.
Design and operate observability solutions including metrics, logs, traces and dashboards.
Own incident response processes, including P1/P2 escalation management and coordination.
Perform incident analysis, root cause investigations and post‑incident reviews.
Design, document and maintain operational runbooks and escalation procedures.
Automate operational tasks, remediation actions and self‑healing mechanisms using IaC and scripting.
Drive reliability improvements based on SLO performance, error budgets and incident trends.
Collaborate with platform and security engineers to maintain healthy, secure and resilient clusters.
Guide and enable the 24/7 operations team to operate environments according to defined runbooks and SLOs.
Support customer‑facing incident management, reporting and reliability reviews.
Advise customers on performance, availability and reliability enhancements.
Required Skills & Experience
Strong hands‑on experience with SRE practices in production environments.
Proven experience implementing monitoring, alerting and observability for Kubernetes or Red Hat OpenShift.
Strong troubleshooting, incident management and root cause analysis capabilities.
Experience automating operational workflows and remediation actions.
Strong understanding of SLIs, SLOs, error budgets and reliability‑driven operations.
Calm, structured decision‑maker under pressure.
Self‑driven team player with strong ownership and operational discipline.
Fluent in English & Dutch.
Profile Requirements
Open to candidates based in Netherlands & Belgium.
Strong IT background with focus on operations, reliability engineering and automation.
Minimum 5–8 years of relevant experience in SRE, operations or platform reliability roles.
Comfortable working in customer‑facing environments with strict SLAs and compliance requirements.
Willingness to participate in on‑call and escalation rotations.
Willingness to undergo required background screening (e.g., BO+).
Relevant certifications (Cloud, Kubernetes, SRE, ITIL) are a plus.
If you are interested kindly share your cv to Het emailadres op click.appcast.io bekijken
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free