MA

Senior Site Reliability Engineer

Manychat
Amsterdam, Netherlandsfull_timePosted 25 Mar 2026

About the role

<p><strong>WHO WE ARE 🌍</strong></p> <p>We help creators get more out of every conversation with Instagram-focused automations and support for other channels like Messenger, WhatsApp, and TikTok. The result? Better engagement, more sales, and real, sustainable growth.</p> <p>With a diverse team of 350+ people spread across three continents, we’re building the leading Chat Marketing platform that is used — and loved — by more than 1.5 million customers worldwide.</p> <p><br><span class="notion-enable-hover" data-token-index="0"><strong>WHO WE'RE LOOKING FOR </strong>🌟</span></p> <p>We’re looking for a Senior Site Reliability Engineer who thrives at the crossroads of classic Linux and AWS infrastructure and modern Site Reliability Engineering. This is a high-impact, hybrid role designed for someone who can manage cloud resources, harden Kubernetes clusters, and shape a more reliable and developer-friendly platform.</p> <p>We need you not just to maintain but to rethink and evolve our infrastructure, balancing hands-on operations with strategic improvements that future-proof our growing AI product landscape.<br><br>You’ll take over key responsibilities from our current Infra Lead who is transitioning to a software-focused role, giving you immediate ownership and space to shine.<br><br><strong>WHY THE ROLE IS SPECIAL</strong> <span class="notion-enable-hover" data-token-index="0">💡</span></p> <p>You won’t be a cog in a massive SRE org. You’ll be the bridge between Infrastructure and Engineering, shaping how we scale Kubernetes, how we approach platform reliability, and how developers ship fast without fear. You’ll get autonomy, ownership, and a smart, humble team excited to learn with you.<br><br><strong>WHAT YOU’LL DO 🤖</strong></p> <ul> <li>Maintain and harden AWS infrastructure (EC2, ALB/NLB, WAF, IAM, CloudWatch)</li> <li>Operate and evolve our EKS clusters powering Python-based AI services</li> <li>Migrate existing services to Kubernetes using Terraform and Helm</li> <li>Codify infrastructure with Terraform and manage host-level automation via Ansible</li> <li>Build and improve CI/CD pipelines with GitHub Actions</li> <li>Own observability efforts: Prometheus, Grafana, alerting, and on-call readiness</li> <li>Support OS-level patching, certs, WAF rules, and general infra hygiene</li> <li>Partner with engineers to guide best practices and drive platform reliability</li> <li>Create clean, maintainable infrastructure documentation and playbooks</li> <li>Occasionally support rare off-hours incidents (don’t worry, really rare)</li> </ul> <h4>TO SHINE IN THIS ROLE 💥</h4&

Apply for this role

Generate a tailored application kit with a matched cover letter, interview prep, and CV highlights — in under 60 seconds.

Generate Application Kit

Free account required — sign up in 30s

Company

Manychat

View all open roles →