Senior AI Platform Engineer (m/f/d)

Festanstellung, Vollzeit · Portugal, remote, Germany, Greifswald

Your mission

We are looking for a Go Platform Engineer who thrives at the intersection of infrastructure, AI systems, and DevOps. In this role, you will architect and scale the backbone of our AI Platform: Ensuring high availability, low latency, and seamless integration of machine learning capabilities into production. You will own the microservices that power AI inference, build robust multi-tenant infrastructure, and support our Data & AI team with production-grade DevOps practices.

Your responsibilities:

Design, build, and maintain Go microservices that handle AI model inference, data processing pipelines, and real-time streaming workflows.
Architect scalable APIs (gRPC/REST) that serve as the bridge between AI models and production applications.
Own the Kubernetes infrastructure (EKS), including deployments, autoscaling policies, service mesh, and cluster health monitoring.
Implement service-to-service communication using gRPC and message queues (RabbitMQ/SQS) for asynchronous processing.
Integrate with cloud AI services (AWS Bedrock, OpenAI, Anthropic) and manage model serving infrastructure.
Build multi-tenant capabilities including authentication (JWT/JWKS), rate limiting, usage tracking, and tenant isolation.
Partner with the Data & AI team to productionize machine learning models—wrapping them in production-ready services with proper health checks, circuit breakers, and graceful degradation.
Build comprehensive observability: structured logging, metrics (Prometheus), distributed tracing (Jaeger/Tempo), and alerting.
Implement CI/CD pipelines and infrastructure-as-code (Terraform) for automated deployments and disaster recovery.
Ensure high availability through proper monitoring, incident response, and post-mortem analysis.
Optimize resource utilization for GPU workloads and cost-efficient scaling strategies.

Your profile

Go Expertise: 3+ years of professional Go development experience with strong understanding of concurrency patterns, interfaces, channels, and error handling.
Kubernetes Production Experience: 3+ years managing production Kubernetes clusters, including deployments, services, ingress controllers, resource management, and troubleshooting.
Distributed Systems Knowledge: Deep understanding of CAP theorem, eventual consistency, idempotency, circuit breakers, and fault-tolerant design.
gRPC & Async Messaging: Hands-on experience with gRPC/Protocol Buffers and message queues (RabbitMQ, SQS, Kafka) in production systems.
Cloud Platform Experience: Strong experience with AWS services (EKS, S3, DynamoDB, Lambda) or equivalent cloud providers.
DevOps Mindset: Experience with Docker, CI/CD pipelines, infrastructure-as-code, and GitOps workflows.
Spoken language: You communicate confidently in English (C1 level); German skills are a plus.

Why us?

A responsible task with meaning: We build software to digitize the social care sector and thus enable our customers to focus on a better life for their clients by giving them more time for care & support
A remote working time model to keep your everyday life flexible
Exciting, challenging tasks in a dynamic, future-oriented environment
A culture of appreciation and a harmonious working atmosphere in a growing, international company with opportunities to get involved
A creative working environment, flat hierarchies and short decision-making processes
Attractive remuneration models, a permanent employment contract

contact information

If this sounds like you, we look forward to receiving your application including your earliest possible start date, through our online application form!

Auf diese Stelle bewerben

About us

Welcome to myneva - together, we shape digital care.

myneva is one of the leading European software providers for the social sector. Our solutions focus on shaping the world around our clients and their needs. By digitising processes, we help care givers gain more time to support their clients, enabling them to enjoy a better quality of life.    

As an ambitious team, we are pursuing increasing internationalisation and a clear mission to become #1 in Europe.

Deine Mission

Wir suchen einen Go Platform Engineer, der sich im Spannungsfeld von Infrastruktur, KI-Systemen und DevOps zuhause fühlt. In dieser Rolle entwirfst und skalierst du das technische Fundament unserer AI-Plattform und stellst hohe Verfügbarkeit, niedrige Latenzzeiten sowie die nahtlose Integration von Machine-Learning-Funktionalitäten in Produktionsumgebungen sicher. Du verantwortest die Microservices, die KI-Inferenz ermöglichen, baust eine robuste Multi-Tenant-Infrastruktur auf und unterstützt unser Data & AI Team mit produktionsreifen DevOps-Praktiken.

Deine Aufgaben:

Du konzipierst, entwickelst und betreibst Go-Microservices für KI-Modell-Inferenz, Datenverarbeitungspipelines und Echtzeit-Streaming-Workflows
Du architekturierst skalierbare APIs (gRPC/REST), die als Brücke zwischen KI-Modellen und produktiven Anwendungen dienen
Du verantwortest die Kubernetes-Infrastruktur (EKS), einschließlich Deployments, Autoscaling-Strategien, Service Mesh und Cluster-Monitoring
Du implementierst Service-zu-Service-Kommunikation mittels gRPC und Message Queues (RabbitMQ/SQS) für asynchrone Verarbeitung
Du integrierst Cloud-KI-Services (AWS Bedrock, OpenAI, Anthropic) und verwaltest die Model-Serving-Infrastruktur
Du entwickelst Multi-Tenant-Funktionalitäten wie Authentifizierung (JWT/JWKS), Rate Limiting, Usage Tracking und Mandantentrennung
Du arbeitest eng mit dem Data & AI Team zusammen, um Machine-Learning-Modelle produktionsreif zu machen – einschließlich Health Checks, Circuit Breakern und Graceful Degradation
Du etablierst umfassende Observability-Konzepte: strukturiertes Logging, Metriken (Prometheus), Distributed Tracing (Jaeger/Tempo) und Alerting
Du implementierst CI/CD-Pipelines und Infrastructure-as-Code (Terraform) für automatisierte Deployments und Disaster-Recovery-Szenarien
Du stellst hohe Verfügbarkeit durch Monitoring, Incident Response und strukturierte Post-Mortem-Analysen sicher
Du optimierst die Ressourcennutzung für GPU-Workloads und entwickelst kosteneffiziente Skalierungsstrategien

Dein Profil

Go-Expertise: Mindestens 3 Jahre professionelle Erfahrung in der Go-Entwicklung mit fundiertem Verständnis von Concurrency-Patterns, Interfaces, Channels und Fehlerbehandlung
Kubernetes-Erfahrung in Produktionsumgebungen: Mindestens 3 Jahre Erfahrung im Betrieb produktiver Kubernetes-Cluster, einschließlich Deployments, Services, Ingress-Controllern, Ressourcenmanagement und Troubleshooting
Kenntnisse verteilter Systeme: Tiefes Verständnis des CAP-Theorems, von Eventual Consistency, Idempotenz, Circuit Breakern und fehlertoleranten Architekturen
gRPC & Asynchrone Kommunikation: Praktische Erfahrung mit gRPC/Protocol Buffers und Message Queues (RabbitMQ, SQS, Kafka) in Produktionssystemen
Cloud-Plattform-Erfahrung: Fundierte Erfahrung mit AWS-Services (EKS, S3, DynamoDB, Lambda) oder vergleichbaren Cloud-Plattformen
DevOps-Mindset: Erfahrung mit Docker, CI/CD-Pipelines, Infrastructure-as-Code und GitOps-Workflows
Du kommunizierst sicher auf Englisch (C1); Deutschkenntnisse sind ein Plus

Warum myneva?

Eine verantwortungsvolle Aufgabe mit Sinn: Unsere Software-Produkte ermöglichen unseren Kunden den Fokus auf ein besseres Leben der Klient:innen durch mehr Zeit für Pflege & Betreuung
Spannende, herausfordernde Aufgaben in einem dynamischen, zukunftsorientierten Umfeld
Flexible Arbeitszeiten sowie ein hybrides Arbeitsmodell, um Deinen Alltag weiterhin flexibel zu gestalten
Eine Kultur der Wertschätzung und ein harmonisches Arbeitsklima in einem wachsenden, internationalen Unternehmen mit Möglichkeiten, sich einzubringen
Ein kreatives Arbeitsumfeld, flache Hierarchien und kurze Entscheidungswege
Attraktive Vergütungsmodelle, ein unbefristeter Arbeitsvertrag sowie Arbeitgeberzuschuss für deine betriebliche Altersvorsorge
Zugang zu Corporate Benefits, JobRad-Leasing und weitere Mobilitätsangebote wie das komplettfinanzierte Deutschlandticket

Kontaktinformationen

Wenn das nach dir klingt, freuen wir uns über deinen CV inkl. Angabe deiner Gehaltsvorstellung und dem nächstmöglichen Eintrittsdatum über unser Online-Bewerbungsformular!

Auf diese Stelle bewerben

Über uns

Willkommen bei myneva - gemeinsam gestalten wir die digitale Pflege.

myneva gehört zu den führenden europäischen Softwareanbietern für den sozialen Sektor. Unsere Lösungen konzentrieren sich darauf, die Welt um unsere Klient:innen und deren Bedürfnisse zu gestalten. Durch die Digitalisierung von Prozessen helfen wir Sozialeinrichtungen, mehr Zeit für die Unterstützung ihrer Klient:innen zu gewinnen, um ihnen ein bessere Lebensqualität zu ermöglichen.    

Als ambitioniertes Team verfolgen wir zunehmende Internationalisierung und die klare Mission #1 in Europa zu werden.

Suas tarefas

Estamos à procura de um Go Platform Engineer que se destaque na interseção entre infraestrutura, sistemas de IA e DevOps. Nesta função, será responsável por arquitetar e escalar a espinha dorsal da nossa AI Platform: garantindo alta disponibilidade, baixa latência e integração perfeita das capacidades de machine learning em produção. Será responsável pelos microsserviços que alimentam a inferência de IA, construirá uma infraestrutura robusta multi-tenant e apoiará a equipe de Data & AI com práticas de DevOps de nível de produção.

Responsabilidades:

Projetar, desenvolver e manter microsserviços em Go que gerenciem inferência de modelos de IA, pipelines de processamento de dados e fluxos de streaming em tempo real.
Arquitetar APIs escaláveis (gRPC/REST) que funcionem como ponte entre os modelos de IA e aplicações em produção.
Gerir a infraestrutura Kubernetes (EKS), incluindo deployments, políticas de autoscaling, service mesh e monitoramento da saúde do cluster.
Implementar comunicação entre serviços usando gRPC e filas de mensagens (RabbitMQ/SQS) para processamento assíncrono.
Integrar-se com serviços de IA em nuvem (AWS Bedrock, OpenAI, Anthropic) e gerenciar a infraestrutura de serving de modelos.
Construir funcionalidades multi-tenant, incluindo autenticação (JWT/JWKS), limitação de taxa, rastreamento de uso e isolamento de tenants.
Colaborar com a equipe de Data & AI para colocar modelos de machine learning em produção, encapsulando-os em serviços prontos para produção com health checks, circuit breakers e degradação graciosa.
Criar observabilidade completa: logging estruturado, métricas (Prometheus), tracing distribuído (Jaeger/Tempo) e alertas.
Implementar pipelines CI/CD e infraestrutura como código (Terraform) para deployments automatizados e recuperação de desastres.
Garantir alta disponibilidade por meio de monitoramento adequado, resposta a incidentes e análises post-mortem.
Otimizar o uso de recursos para workloads em GPU e estratégias de escalonamento custo-eficientes.

Seu perfil

Especialização em Go: Mais de 3 anos de experiência profissional em desenvolvimento com Go, com sólido conhecimento de padrões de concorrência, interfaces, canais e tratamento de erros.
Experiência em Kubernetes em Produção: Mais de 3 anos gerenciando clusters Kubernetes em produção, incluindo deployments, serviços, ingress controllers, gerenciamento de recursos e troubleshooting.
Conhecimento em Sistemas Distribuídos: Compreensão profunda do teorema CAP, consistência eventual, idempotência, circuit breakers e design tolerante a falhas.
gRPC e Mensageria Assíncrona: Experiência prática com gRPC/Protocol Buffers e filas de mensagens (RabbitMQ, SQS, Kafka) em sistemas de produção.
Experiência em Plataformas de Nuvem: Forte experiência com serviços AWS (EKS, S3, DynamoDB, Lambda) ou provedores de nuvem equivalentes.
Mentalidade DevOps: Experiência com Docker, pipelines CI/CD, infraestrutura como código e fluxos de trabalho GitOps.
Idioma Falado: Comunicação confiante em inglês (nível C1); conhecimentos de alemão são um diferencial.

Porque nós?

Uma tarefa responsável e com significado: criamos software para digitalizar o setor de assistência social e, assim, permitir que os nossos clientes se concentrem em proporcionar uma vida melhor aos seus clientes, dando-lhes mais tempo para cuidar e apoiar
Um modelo de trabalho remoto para manter a sua vida quotidiana flexível
Tarefas emocionantes e desafiadoras num ambiente dinâmico e voltado para o futuro
Uma cultura de valorização e um ambiente de trabalho harmonioso numa empresa internacional em crescimento, com oportunidades de envolvimento.
Um ambiente de trabalho criativo, hierarquias planas e processos de tomada de decisão rápidos.
Modelos de remuneração atrativos, um contrato de trabalho permanente.

Informação de contacto

Se você se identifica com esta descrição, aguardamos o seu currículo, incluindo a data mais próxima possível para início, através do nosso formulário de candidatura online!

Auf diese Stelle bewerben

Sobre nós

Auf diese Stelle bewerben

Wir freuen uns auf dich!

Wir freuen uns über dein Interesse an der myneva Group. Bitte fülle das folgende kurze Formular aus. Solltest du Schwierigkeiten mit dem Upload deiner Daten haben, wende dich gerne per Email an jobs@myneva.eu