Senior DevOps Engineer
$100,000 USD/year Pay is set based on global value, not the local market. Most roles = hourly rate x 40 hrs x 50 weeks 

Not accepting applications on crossover.com at this time.

Description

You're the engineer who maintains uptime across 50+ SaaS products when nobody else knows where to start. We need DevOps professionals capable of entering unknown AWS environments, restoring order, and driving availability beyond 99.9% through genuine monitoring, automation, and root cause analysis. You'll break down complex projects into single-day increments, deliver production-ready Python or JavaScript, and leverage AI as your assistant.

Most organizations talk about "cloud infrastructure" while manually tending servers. We're systematizing reliability across a portfolio of acquired products whose original teams have departed and whose documentation is incomplete. That's where the challenge lies: you'll harness agents and current tooling to explore unfamiliar systems 5–10x faster, document your findings, and automate solutions so recurring failures become impossible. Rather than judge you on certifications and vendor badges, we'll observe how you troubleshoot in real time, author a genuine 5-Whys that identifies one actionable root cause, and construct automations that endure in production.

This is not a tier-two "follow the runbook" position. Here, you author the runbooks, architect the deployment path from development through staging to 10% rollout to full release with soak periods and rollback conditions, and create the monitoring that captures corner cases. You block risky changes before execution. You distinguish between infrastructure failures under your ownership and application bugs owned by Engineering, then route permanent remediation to the correct team.

You'll operate at the engineering center of reliability, taking charge of infrastructure initiatives, incident response with RCAs, and change requests accompanied by copy-paste-ready runbooks. If you've already operated a substantial SaaS platform and want to apply that expertise across an entire fleet, join us. Bring deep AWS knowledge, production-quality coding ability, strict scope discipline, and daily, mission-critical use of AI tooling. If you're prepared to ensure continuous operation, please apply.

What you will be doing

  • Advanced infrastructure migrations, consolidations, production-quality automations, and monitoring enhancements
  • Investigating production incidents, deploying immediate remediations, and authoring root cause analyses that assign permanent fixes to responsible teams
  • Drafting, reviewing, and implementing production changes, including evaluating the safety of proposed modifications before execution

What you will NOT be doing

  • Spending time in Jira and perpetual status updates—we value individuals who deliver solutions, not merely document issues
  • Preserving legacy systems without end—you'll be authorized to pursue substantive improvements
  • Waiting on bureaucratic approval processes—you'll possess the authority to implement immediate fixes during incidents

Key responsibilities

  • Advance reliability and consistency of cloud infrastructure across our expanding product portfolio by deploying comprehensive monitoring, automation, and AWS best practices.

Candidate requirements

  • Extensive AWS infrastructure knowledge (this is our core platform—experience with other clouds alone is insufficient)
  • Track record managing production infrastructure at a scale of hundreds of containers
  • Proficiency scripting in Python and Bash for routine administrative tasks
  • Experience administering and migrating production databases across multiple engines (including MySql, Postgres, Oracle, MS-SQL)
  • Hands-on experience with infrastructure automation tools (Terraform, Ansible, or CloudFormation)

Meet a successful candidate

Watch Interview
Anonymous
Anonymous  |  Elite Coder
Lebanon

Have you ever made so much money you had to remain anonymous to protect yourself? How about being able to fix an impossible coding problem i...

Meet Anonymous

Applying for a role? Here’s what to expect.

Crossover's skill assessment process combines innovative AI power with decades of human research, to take the guesswork, human bias, and pointless filters out of recruiting high-performing teams.

Chat-style
screening interview.
STEP 1

Chat-style
screening interview.

Cognitive 
aptitude test.
STEP 2

Cognitive 
aptitude test.

Prove real-world 
job skills.
STEP 3

Prove real-world 
job skills.

Interview with the hiring manager.
STEP 4

Interview with the hiring manager.

Pass
proctored test.
STEP 5

Pass
proctored test.

Accept job offer.
STEP 6

Accept job offer.

Frequently asked questions

About the role

About Crossover

Meet some people who've landed similar jobs

Why Crossover

Recruitment sucks. So we’re fixing it.

The Olympics of work

The Olympics of work

It’s super hard to qualify—extreme quality standards ensure every single team member is at the top of their game.

Premium pay for premium talent

Premium pay for premium talent

Over 50% of new hires double or triple their previous pay. Why? Because that’s what the best person in the world is worth.

Shortlist by skills, not bias

Shortlist by skills, not bias

We don’t care where you went to school, what color your hair is, or whether we can pronounce your name. Just prove you’ve got the skills.

Crossover Logo White
Follow us on
Have a question?

Get answers to common questions using our smart chatbot Crosby.

HELP AND FAQs

Join the world's largest community of  AI first Remote WorkersAI-first remote workers.