Senior DevOps Engineer
$100,000 USD/year Pay is set based on global value, not the local market. Most roles = hourly rate x 40 hrs x 50 weeks 

Worldwide
Fully-remote
full-time (40 hrs/week)
Flexible schedule
Long-term role

Senior DevOps Engineer   $100,000 USD/year

Description

You're the engineer who maintains uptime for 50+ SaaS products while others are still figuring things out. We need DevOps engineers who can step into unknown AWS environments, restore order from disorder, and drive availability beyond 99.9% through genuine monitoring, genuine automation, and genuine root cause analysis. You'll break down complex projects into single-day increments, deliver production-ready Python or JavaScript, and leverage AI as your assistant.

Most organizations talk about "cloud" while manually tending servers. We're building industrial-scale reliability across dozens of acquired products where the founding engineers have departed and the documentation is incomplete. That's where it gets interesting: you'll employ agents and contemporary tools to understand unfamiliar systems 5–10x faster, document your findings, and automate solutions so the same incident never recurs. Rather than judging you on certifications and vendor badges, we'll observe you troubleshoot in real time, produce an actual 5-Whys that identifies one preventable root cause, and construct automations that withstand production conditions.

This is not a tier-2 "follow the runbook" position. In this capacity, you author the runbooks, architect the deployment from development to staging to 10% to full rollout with soak periods and rollback conditions, and create the monitors that detect corner cases. You block risky changes before someone executes them. You distinguish infrastructure failures you control from application bugs Engineering controls, and you route permanent remediation to the appropriate team.

You'll operate at the engineering center of reliability, taking ownership of infrastructure initiatives, incident management and root cause analyses, and change requests accompanied by copy-paste-executable runbooks. If you've previously owned a substantial SaaS product and wish to apply that expertise to an entire fleet, join us. Bring expert AWS knowledge, production-quality coding ability, disciplined scope management, and daily, essential use of AI tooling. If you're prepared to maintain operational continuity, please apply.

What you will be doing

  • Advanced infrastructure migrations, consolidations, production-quality automations, monitoring modifications
  • Diagnosing production incidents, deploying immediate remediation, and authoring root cause analyses with permanent solutions assigned to responsible teams
  • Designing, reviewing, and implementing production changes, including assessing whether a proposed change is safe to deploy

What you will NOT be doing

  • Spending your time in Jira and continuous status meetings - we value individuals who can deliver solutions, not simply document problems
  • Supporting legacy systems forever - you'll be authorized to implement substantial improvements
  • Waiting on bureaucratic approval processes - you'll possess the authority to deploy immediate remediation to address incidents

Key responsibilities

  • Lead reliability and standardization of cloud infrastructure across our expanding product portfolio by deploying comprehensive monitoring, automation, and AWS best practices.

Candidate requirements

  • Extensive AWS infrastructure expertise (this is our core platform - experience with other clouds alone won't suffice)
  • Track record owning substantial production infrastructure and resolving production outages autonomously (not merely executing a runbook)
  • Track record scripting with Python and Bash for routine administrative operations
  • Track record managing and migrating production databases across multiple engines (including MySql, Postgres, Oracle, MS-SQL)
  • Track record with infrastructure automation (Terraform, Ansible, or CloudFormation)
  • Linux systems administration expertise

Meet a successful candidate

Watch Interview
Anonymous
Anonymous  |  Elite Coder
Lebanon

Have you ever made so much money you had to remain anonymous to protect yourself? How about being able to fix an impossible coding problem i...

Meet Anonymous

Applying for a role? Here’s what to expect.

Crossover's skill assessment process combines innovative AI power with decades of human research, to take the guesswork, human bias, and pointless filters out of recruiting high-performing teams.

Chat-style
screening interview.
STEP 1

Chat-style
screening interview.

Cognitive 
aptitude test.
STEP 2

Cognitive 
aptitude test.

Prove real-world 
job skills.
STEP 3

Prove real-world 
job skills.

Interview with the hiring manager.
STEP 4

Interview with the hiring manager.

Pass
proctored test.
STEP 5

Pass
proctored test.

Accept job offer.
STEP 6

Accept job offer.

Frequently asked questions

About the role

About Crossover

Meet some people who've landed similar jobs

Why Crossover

Recruitment sucks. So we’re fixing it.

The Olympics of work

The Olympics of work

It’s super hard to qualify—extreme quality standards ensure every single team member is at the top of their game.

Premium pay for premium talent

Premium pay for premium talent

Over 50% of new hires double or triple their previous pay. Why? Because that’s what the best person in the world is worth.

Shortlist by skills, not bias

Shortlist by skills, not bias

We don’t care where you went to school, what color your hair is, or whether we can pronounce your name. Just prove you’ve got the skills.

Crossover Logo White
Follow us on
Have a question?

Get answers to common questions using our smart chatbot Crosby.

HELP AND FAQs

Join the world's largest community of  AI first Remote WorkersAI-first remote workers.