Technical Operations & Cloud Engineer Job in MDA - Medical Decision Alliance

Technical Operations & Cloud Engineer

Leipzig, SN, DE, Germany

Job Description

Über uns

Wir sind ein junges, technologiegetriebenes Start-up an der Schnittstelle von künstlicher Intelligenz, Softwareentwicklung und Gesundheitswesen. Unsere Mission: Die Versorgung von Patient:innen mithilfe smarter, digitaler Lösungen nachhaltig verbessern. Unser interdisziplinäres Team vereint medizinisches Know-how mit technischer Exzellenz – und arbeitet dynamisch und mit echtem Impact.

Deine Rolle

Der

Technical Operations & Cloud Engineer

ist verantwortlich dafür, dass die digitale Infrastruktur der MDA zuverlässig, sicher und in voller Übereinstimmung mit internen Richtlinien sowie externen Serviceverpflichtungen betrieben wird.

Diese Position vereint

IT-Betrieb

Infrastrukturautomatisierung

und

Compliance-Management

. Sie stellt sicher, dass die Systeme von MDA die definierten Service Levels erfüllen, dass Deployments konsistent und prüfbar sind und dass Dokumentation, Monitoring sowie Notfallwiederherstellungsverfahren stets aktuell bleiben.

Sie übernehmen sämtliche operativen Aufgaben im Tagesgeschäft und leiten Automatisierungsinitiativen unter Nutzung von Cloud-Services, um Betriebsprozesse zu standardisieren und zu vereinfachen.

Hauptaufgaben

1. Infrastruktur-Betrieb & Cloud Engineering

Aufbau und Verwaltung aller Umgebungen (Entwicklung, Staging, Produktion) zur Gewährleistung von Stabilität, Verfügbarkeit, Hochverfügbarkeit und Skalierbarkeit. Durchführung täglicher Betriebsaufgaben wie Deployment, Monitoring, Incident-Management und Zugriffsverwaltung. Pflege und Optimierung von

CI/CD-Pipelines

für automatisierte Build-, Test- und Release-Prozesse. Überwachung von Backup-, Wiederherstellungs- und Disaster-Recovery-Prozessen, um die Geschäftskontinuität sicherzustellen. Führen klarer Änderungsprotokolle, Rollback-Pläne und Bereitstellungsdokumentationen für alle Releases.

2. Automatisierung & Konfigurationsmanagement

Entwicklung, Pflege und Verwaltung von

Ansible Playbooks

und Rollen für Konfigurations-, Bereitstellungs- und Wartungsaufgaben. Automatisierung wiederkehrender Systemadministrationsprozesse (Updates, Patches, Compliance-Prüfungen). Integration der Ansible-Automatisierung in CI/CD-Pipelines zur Steigerung der Effizienz und Reduzierung manueller Eingriffe. Sicherstellung, dass alle Automatisierungsskripte Sicherheits-, Nachverfolgbarkeits- und Compliance-Standards entsprechen.

3. Servicezuverlässigkeit & Monitoring

Implementierung und Pflege von System-Dashboards und Alarmierungsmechanismen (z. B. Prometheus, Grafana oder vergleichbare Tools). Analyse von Logs und Metriken zur proaktiven Vermeidung von Ausfallzeiten und Leistungsbeeinträchtigungen. Durchführung von Root-Cause-Analysen und Umsetzung präventiver Maßnahmen nach Vorfällen. Verantwortung für die Einsatzbereitschaft und Eskalationsprozesse bei kritischen Systemproblemen.

4. Compliance & Dokumentation

Umsetzung vertraglicher und interner Betriebsanforderungen (Verfügbarkeit, Wiederherstellung, Sicherheit, Audit-Trails) in dokumentierte Verfahren. Pflege der Dokumentation zur Infrastruktur-Topologie, zu Bereitstellungsprozessen, DR/BCP-Playbooks und Nachweisen für Compliance. Sicherstellung, dass alle Betriebsaktivitäten den Richtlinien von

GDPR (DSGVO)

ISO 27001

und internen Datenschutzvorgaben entsprechen. Regelmäßige Erstellung von Betriebs- und Compliance-Statusberichten für das Management.

5. Kontinuierliche Verbesserung

Identifizierung von Potenzialen zur Prozessoptimierung, Automatisierung und Standardisierung. Zusammenarbeit mit Produkt- und QA-Teams, um sicherzustellen, dass neue Funktionen vor der Bereitstellung betriebsbereit sind. Festlegung von Leistungsgrundwerten und kontinuierliche Überwachung wichtiger Servicemetriken (Verfügbarkeit, Reaktionszeit, Incident-Rate).

Anforderungen

Ausbildung & Erfahrung

Abgeschlossenes Bachelor- oder Masterstudium in Informatik, Informationssystemen oder einem verwandten Fachgebiet. Nachgewiesene Erfahrung im Bereich

Cloud Operations

Technischer Betrieb

oder

Infrastrukturmanagement

. Erfahrung mit Cloud-Infrastrukturen (z. B.

AWS

GCP

oder vergleichbar). Vertrautheit mit CI/CD-Tools, Monitoring-Systemen und Backup-Lösungen. Kenntnisse in den Bereichen

Disaster Recovery/Business Continuity Planning (DR/BCP)

Systemsicherheit

und

Datenschutzpraktiken

(GDPR, ISO 27001).

Technische Fähigkeiten

Linux-Infrastrukturautomatisierung:

Terraform, Ansible und Scripting.

CI/CD:

Git, Pipelines (z. B. GitLab), Testinfrastruktur.

Monitoring & Logging:

Grafana, Prometheus, ELK oder vergleichbare Tools.

Containerisierung & Orchestrierung:

Docker, Kubernetes.
English version

About us

We are a young, technology-driven start-up at the intersection of artificial intelligence, software development, and healthcare. Our mission: to sustainably improve patient care through smart digital solutions. Our interdisciplinary team combines medical expertise with technical excellence – working dynamically and with real impact.

Role Overview

The

Technical Operations & Cloud Engineer

will be responsible for ensuring MDA’s digital infrastructure runs reliably, securely, and in full compliance with internal and external service obligations.

This role combines

IT operations

infrastructure automation

, and

compliance management

. It ensures that MDA’s systems meet defined service levels, that deployments are consistent and auditable, and that documentation, monitoring, and disaster recovery procedures are always up to date.

You will own and execute all Operations-related daily tasks while leading automation initiatives using cloud services to standardize and simplify operations.

Key Responsibilities

1. Infrastructure Operations & Cloud Engineering

Build and Manage all environments (development, staging, production) ensuring stability, uptime,

High Availability

and scalability. Perform daily Operations tasks including deployment, monitoring, incident handling, and access management. Maintain and optimise CI/CD pipelines for automated build, test, and release workflows. Oversee backup, restore, and disaster-recovery processes, ensuring readiness for business continuity. Maintain clear change logs, rollback plans, and deployment documentation for all releases.

2. Automation & Configuration Management

Design, develop, and maintain

Ansible playbooks and roles

for configuration, provisioning, and maintenance tasks. Automate repetitive system administration processes (updates, patching, compliance checks). Integrate Ansible automation into CI/CD pipelines to improve efficiency and reduce manual interventions. Ensure automation scripts adhere to security, traceability, and compliance standards.

3. Service Reliability & Monitoring

Implement and maintain system health dashboards and alerting mechanisms (e.g., Prometheus, Grafana, or equivalent). Analyze logs and metrics to proactively prevent downtime and performance degradation. Conduct root-cause analyses and implement preventive measures after incidents. Own on-call readiness and escalation processes for critical system issues.

4. Compliance & Documentation

Translate contractual and internal operational requirements (uptime, recovery, security, audit trails) into documented procedures. Maintain documentation for infrastructure topology, deployment processes, DR/BCP playbooks, and compliance evidence. Ensure all operational activities follow GDPR, ISO 27001, and internal data-protection guidelines. Provide periodic operational and compliance status reports to management.

5. Continuous Improvement

Identify opportunities for process improvement, automation, and standardization. Collaborate with product and QA teams to ensure new features are operationally ready before deployment. Establish performance baselines and continuously track key service metrics (availability, response time, incident rate).

Requirements

Education & Experience

Bachelor’s or Master’s degree in Computer Science, Information Systems, or related field. Proven experience in Cloud Ops, Technical Operations, or Infrastructure Management. Experience with

cloud infrastructure

(AWS, GCP, or Similar). Familiarity with

CI/CD tools

monitoring systems

, and

backup solutions

. Knowledge of

DR/BCP frameworks

system security

, and

data-protection practices

(GDPR, ISO 27001).

Technical Skills

Linux Infrastructure automation: Terraform, Ansible and scripting. CI/CD: Git, pipelines, such as GitLab, testing infrastructure. Monitoring & logging: Grafana, Prometheus, ELK, or similar. Containerization & orchestration using Docker, Kubernetes.
Art der Stelle: Vollzeit

Arbeitsort: Zum Teil im Homeoffice in 04109 Leipzig

Beware of fraud agents! do not pay money to get a job

MNCJobs.de will not be responsible for any payment made to a third-party. All Terms of Use are applicable.

Related Jobs

Cloud Engineer – Top Secret/SCI Clearance | Moehringen, Germany

Cambridge International Systems

Möhringen, BW, DE

Apply Now
Cloud Engineer, Senior

CACI International

Wiesbaden, HE, DE

Apply Now

Cloud Engineer, Senior

CACI International

Wiesbaden, HE, DE

Apply Now
Cloud Engineer, Senior

CACI International

Wiesbaden, HE, DE

Apply Now

Job Detail

Job Id

JD3728738
Industry

Not mentioned
Total Positions

1
Job Type:

Vollzeit
Salary:

Not mentioned
Employment Status

Permanent
Job Location

Leipzig, SN, DE, Germany
Education

Not mentioned

Jobs by Function

Popular Job Skills

Popular Industries

Popular Cities

Jobseekers

Employers