You're the engineer who ensures 50+ SaaS products stay online while others are still troubleshooting. We need DevOps professionals capable of stepping into unknown AWS environments, restoring stability, and driving uptime beyond 99.9% through robust monitoring, automation, and thorough root cause analysis. You'll break complex projects into single-day tasks, deliver production-ready Python or JavaScript, and leverage AI as a force multiplier.
Many organizations claim "cloud expertise" while manually nursing infrastructure. We're scaling reliability engineering across a portfolio of acquired products where original developers are gone and documentation is incomplete. The challenge: you'll deploy agents and current-generation tooling to explore unfamiliar systems 5–10× faster, document your findings, and automate solutions so incidents don't recur. Rather than judging you on certifications and vendor badges, we'll observe you troubleshoot in real time, author a genuine 5-Whys analysis that identifies a single preventable root cause, and construct automations that withstand production conditions.
This is not a tier-two "follow the runbook" position. Here, you author the runbooks, architect the deployment path from development through staged environments to 10% and full rollout with soak periods and rollback criteria, and create monitoring that detects corner cases. You reject risky changes before execution. You distinguish infrastructure failures you're accountable for from application bugs Engineering must resolve, and you route permanent remediation to the correct team.
You'll operate at the engineering center of reliability, responsible for infrastructure initiatives, incident management and root cause documentation, and change execution with copy-paste-ready runbooks. If you've previously owned a substantial SaaS product and want to extend that expertise across an entire fleet, join us. Bring expert AWS knowledge, production-grade development skills, disciplined scope management, and daily, mission-critical use of AI tooling. If you're prepared to maintain service continuity, please apply.
Crossover's skill assessment process combines innovative AI power with decades of human research, to take the guesswork, human bias, and pointless filters out of recruiting high-performing teams.






It’s super hard to qualify—extreme quality standards ensure every single team member is at the top of their game.
Over 50% of new hires double or triple their previous pay. Why? Because that’s what the best person in the world is worth.
We don’t care where you went to school, what color your hair is, or whether we can pronounce your name. Just prove you’ve got the skills.