You're the engineer who maintains uptime across 50+ SaaS products when others are still figuring out the basics. We need DevOps professionals capable of exploring unknown AWS environments, restoring order from instability, and driving availability beyond 99.9% through authentic monitoring, automation, and root-cause analysis. You'll break down complex initiatives into single-day increments, deliver production-ready Python or JavaScript, and leverage AI as your assistant.
While most organizations talk about "the cloud" while hand-holding legacy systems, we're scaling reliability across an entire portfolio of acquired products—often without the original engineers or complete documentation. The challenge: you'll employ agents and contemporary tooling to explore unfamiliar systems 5–10× faster, document your findings, and automate solutions so recurring failures are eliminated. Rather than judging you on certifications and vendor badges, we'll observe how you diagnose issues in real time, author a genuine 5-Whys analysis that identifies a single preventable root cause, and construct automations that hold up under production load.
This isn't an L2 "execute the runbook" position. Here, you author the runbooks, architect the deployment path from development through staging to canary and full release—with soak periods and rollback criteria—and create the monitors that surface corner cases. You block risky changes before deployment. You distinguish infrastructure failures you control from application bugs owned by Engineering, and you route permanent remediation to the correct team.
You'll operate at the engineering heart of reliability, managing infrastructure initiatives, incident triage and RCAs, and change requests backed by copy-paste-ready runbooks. If you've previously owned a critical SaaS platform and want to extend that rigor across a fleet, join us. Bring advanced AWS expertise, production-quality development skills, strict scope discipline, and daily, essential use of AI tooling. If you're prepared to safeguard uptime, please apply.
Crossover's skill assessment process combines innovative AI power with decades of human research, to take the guesswork, human bias, and pointless filters out of recruiting high-performing teams.






It’s super hard to qualify—extreme quality standards ensure every single team member is at the top of their game.
Over 50% of new hires double or triple their previous pay. Why? Because that’s what the best person in the world is worth.
We don’t care where you went to school, what color your hair is, or whether we can pronounce your name. Just prove you’ve got the skills.