Skip to content
mimi

(Senior) Site Reliability Engineer (m/w/d)

Kiwigrid GmbH

Dresden · Hybrid Senior 3d ago

About the role

About Kiwigrid GmbH

What sets Kiwigrid apart from its competitors is that it doesn't just sell a concept for the Internet of Things. Kiwigrid offers cross-industry value creation and sees itself as an open platform for the future of energy and e-mobility. Kiwigrid's customers use the platform for a wide variety of reasons, depending on where they are today, in which industry they operate, and what they want to offer. The platform can help them open up new markets, diversify their offerings, and develop cross-industry business models. To achieve this, Kiwigrid provides them with pioneering technologies and services that increase energy efficiency, optimize the use of renewable energies, pave the way for electric vehicles, and stabilize power grids.

Responsibilities

As a Site Reliability Engineer in our dynamic SRE team, you will play a crucial role in ensuring the reliability, scalability, and performance of our KiwiOS IoT platform and the products built on it. You will work closely with our software development teams to ensure that the decentralized energy world is available around the clock for everyone.

Your tasks:

  • Monitoring and Incident Management: You will equip our services with the important metrics to continuously monitor our system for malfunctions. In the event of an incident, you will support debugging in Google Cloud in collaboration with the development teams to resolve the error as quickly as possible. Leading incident response activities and taking on on-call duty will also be part of your responsibilities.
  • Automation and Scalability: You will take our cloud infrastructure to the next level by identifying and automating recurring tasks. You will also recommend managed services from Google Cloud to the development teams and drive their implementation within the company. The continuous further development of our deployment process will also be your responsibility, helping the teams to achieve simpler and more secure deployments in Kiwigrid's production environment.
  • Security and Compliance: You will help Kiwigrid achieve a secure cloud environment by integrating tooling for monitoring vulnerabilities and detecting external, potentially dangerous activities. You will be involved in creating compliance policies and will support audits.
  • Documentation and Knowledge Sharing: You will help the development teams prepare their documentation so that the stack and request flows are understandable. Additionally, you will bring best practices from the SRE team into the development teams to achieve a high degree of standardization.

Requirements

  • Professional experience with cloud infrastructures and implementation of IT projects
  • Enthusiasm for new technologies and willingness for continuous learning
  • Good knowledge of current cloud technologies, especially extensive experience with Google Cloud (GCP)
  • Independent, highly structured, and conscientious work style
  • Analytical and conceptual skills combined with cross-system and process-oriented thinking
  • High quality awareness and assertiveness
  • Practical experience with agile working methods
  • Proactive and clear communication
  • Hybrid working with at least 1-2 days of office presence per week
  • Very good English skills, both written and spoken
  • Desirable: Good German skills, both written and spoken

What We Offer

  • A permanent employment contract with performance-related pay
  • Flexible working hours and various working time models; mobile working
  • Agile working; light-flooded lofts; work islands & standing desks; English courses
  • Company health management; fresh fruit, free drinks, and coffee specialties as much as you want
  • Joint sports events like company runs or volleyball; company bike leasing; grace days for annoying colds, subsidy for Urban Sports Club for your fitness
  • Job ticket; bonus for employee acquisition; company pension scheme
  • Personal responsibility; regular feedback meetings; fixed further training budgets and on-the-job training
  • Sponsored leisure activities (billiards, foosball, table tennis, etc.)
  • Individual design of your own workplace

Contact

Contact (via Apply-Contact Form)

Important keywords for this advertisement: Dresden, Cloud, IoT, Kubernetes, GCP, Google Cloud Platform, Cloud Architecture, Incident Management, Energy Management, SRE, Site Reliability Engineering, Cloud Infrastructure, Cloud Engineering, IaC, Infrastructure as Code, K8s and IT.

Requirements

  • Professional experience with cloud infrastructures and implementation of IT projects
  • Enthusiasm for new technologies and willingness for continuous further training
  • Good knowledge of current cloud technologies, especially extensive experience with Google Cloud (GCP)
  • Independent, very structured and conscientious work
  • Analytical and conceptual skills combined with system-wide and process-oriented thinking
  • High quality awareness and assertiveness
  • Practical experience with agile working methods
  • Proactive and clear communication
  • Very good English skills in speaking and writing

Responsibilities

  • You will equip our services with the important metrics to continuously monitor our system for malfunctions.
  • In the event of an incident, you will support debugging in Google Cloud in collaboration with the development teams to resolve the error as quickly as possible.
  • Leading incident response activities and taking on on-call duty will also be part of your job.
  • You will bring our cloud infrastructure to the next level by automating recurring tasks.
  • You will also recommend managed services from Google Cloud to the development teams and promote their use in the company.
  • The continuous further development of our deployment process is also your responsibility and thus helps the teams to have simpler and safer deployments in Kiwigrid's production environment.
  • You will help Kiwigrid achieve a secure cloud environment by integrating tooling for monitoring vulnerabilities and detecting external, potentially dangerous activities.
  • You will be involved in creating compliance policies and support audits.
  • You will help the development teams prepare their documentation so that the stack and request flows are understandable.
  • Additionally, you will introduce best practices from the SRE team into the development teams to achieve a high degree of standardization there.

Benefits

flexible working hoursvarious working time modelsmobile workingagile workingEnglish coursescompany health managementfresh fruitfree drinkscoffee specialtiescompany sports eventscompany bike leasingpaid time off for illnessUrban Sports Club subsidyjob ticketemployee referral bonuscompany pension schemeregular feedback sessionstraining budgetstraining on the jobsponsored leisure activities

Skills

Cloud ArchitectureCloud EngineeringCloud InfrastructureGCPGoogle Cloud PlatformIaCInfrastructure as CodeIncident ManagementIoTK8sKubernetesSRESite Reliability Engineering

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free