DevOps Architect
$100,000 USD/year Pay is set based on global value, not the local market. Most roles = hourly rate x 40 hrs x 50 weeks 

Worldwide
Fully-remote
full-time (40 hrs/week)
Flexible schedule
Long-term role

DevOps Architect   $100,000 USD/year

Description

You're the engineer who maintains uptime for 50+ SaaS products when others are still figuring things out. We need DevOps architects who can enter unknown AWS environments, restore stability, and drive availability beyond 99.9% through effective monitoring, automation, and rigorous root cause analysis. You'll break down complex projects into daily deliverables, deploy production-ready Python or JavaScript, and leverage AI as a force multiplier.

While most organizations tout "cloud expertise" while manually nursing their infrastructure, we're scaling reliability across dozens of acquisitions where original engineers have departed and documentation is incomplete. The challenge is clear: you'll use agents and current tooling to explore unfamiliar systems 5–10x faster, document your findings, and automate them so repeat incidents are eliminated. Rather than judging you on certifications and tool badges, we'll observe how you troubleshoot in real time, produce a genuine 5-Whys analysis that identifies one actionable root cause, and construct automations that withstand production conditions.

This is not an L2 "follow the script" position. Here, you author the runbooks, architect the deployment pipeline from dev through staged to 10% and full rollout with soak periods and rollback logic, and create the monitoring that surfaces edge-case failures. You block risky changes before they go live. You distinguish infrastructure issues you own from application bugs that belong to Engineering, and you route permanent fixes to the appropriate team.

You'll operate at the engineering center of reliability, managing infrastructure initiatives, incident response with RCAs, and change tickets supported by copy-paste-executable instructions. If you've already operated a production SaaS platform and want to apply that rigor across a portfolio, join us. Bring advanced AWS knowledge, production-quality coding skills, strict scope discipline, and daily, essential use of AI tooling. If you're prepared to ensure continuous operations, apply now.

What you will be doing

  • Lead complex infrastructure migrations, consolidation efforts, production-grade automation development, and monitoring enhancements
  • Triage live production incidents, deploy immediate remediation, and author root cause analyses with permanent corrective actions assigned to owning teams
  • Author, review, and implement production changes, including assessing whether proposed changes meet safety criteria for execution

What you will NOT be doing

  • Spending hours in Jira and recurring status syncs - we prioritize people who deliver solutions, not just monitor tasks
  • Keeping legacy systems running forever - you'll be empowered to implement substantive enhancements
  • Waiting on multilayer approval processes - you'll possess the authority to deploy immediate fixes during incidents

Key responsibilities

  • Lead reliability and standardization initiatives for cloud infrastructure across our expanding product portfolio by deploying comprehensive monitoring, automation, and AWS best practices.

Candidate requirements

  • Advanced AWS infrastructure knowledge (this is our core platform - experience with other clouds alone is insufficient)
  • Track record managing production infrastructure running hundreds of containers
  • Proficiency scripting in Python and Bash for routine administrative tasks
  • Experience administering and migrating production databases across multiple engines (including MySql, Postgres, Oracle, MS-SQL)
  • Hands-on experience with infrastructure automation tools (Terraform, Ansible, or CloudFormation)

Meet a successful candidate

Watch Interview
Anonymous
Anonymous  |  Elite Coder
Lebanon

Have you ever made so much money you had to remain anonymous to protect yourself? How about being able to fix an impossible coding problem i...

Meet Anonymous

Applying for a role? Here’s what to expect.

Crossover's skill assessment process combines innovative AI power with decades of human research, to take the guesswork, human bias, and pointless filters out of recruiting high-performing teams.

Chat-style
screening interview.
STEP 1

Chat-style
screening interview.

Cognitive 
aptitude test.
STEP 2

Cognitive 
aptitude test.

Prove real-world 
job skills.
STEP 3

Prove real-world 
job skills.

Interview with the hiring manager.
STEP 4

Interview with the hiring manager.

Pass
proctored test.
STEP 5

Pass
proctored test.

Accept job offer.
STEP 6

Accept job offer.

Frequently asked questions

About the role

About Crossover

Meet some people who've landed similar jobs

Why Crossover

Recruitment sucks. So we’re fixing it.

The Olympics of work

The Olympics of work

It’s super hard to qualify—extreme quality standards ensure every single team member is at the top of their game.

Premium pay for premium talent

Premium pay for premium talent

Over 50% of new hires double or triple their previous pay. Why? Because that’s what the best person in the world is worth.

Shortlist by skills, not bias

Shortlist by skills, not bias

We don’t care where you went to school, what color your hair is, or whether we can pronounce your name. Just prove you’ve got the skills.

Crossover Logo White
Follow us on
Have a question?

Get answers to common questions using our smart chatbot Crosby.

HELP AND FAQs

Join the world's largest community of  AI first Remote WorkersAI-first remote workers.