Cloud Architect
$100,000 USD/year Pay is set based on global value, not the local market. Most roles = hourly rate x 40 hrs x 50 weeks 

Not accepting applications on crossover.com at this time.

Description

You're the engineer who keeps 50+ SaaS products running when everyone else is making educated guesses. We need DevOps engineers who can step into unknown AWS environments, restore order to chaotic systems, and drive uptime above 99.9% through real monitoring, real automation, and real root cause analyses. You'll break down complex projects into daily tasks, deliver production-ready Python or JavaScript, and leverage AI as your assistant.

Most organizations talk about "cloud" while manually tending servers. We're scaling reliability across dozens of acquired products where the original builders have departed and the documentation is incomplete. That's where it gets interesting: you'll leverage agents and contemporary tooling to explore unfamiliar systems 5–10x faster, document your findings, and automate them so identical outages become impossible. Rather than judging you on certifications and vendor logos, we'll observe you troubleshoot in real time, produce a genuine 5-Whys that identifies one preventable root cause, and create automations that withstand production conditions.

This is not an L2 "follow the runbook" position. In this role, you author the runbooks, architect the deployment from dev to staged to 10% to 100% with soak periods and rollback criteria, and create the monitors that detect the corner cases. You reject risky changes before anyone executes them. You distinguish infrastructure failures you own from application bugs Engineering owns, and you route permanent fixes to the correct team.

You'll operate at the engineering center of reliability, owning infrastructure projects, incident response and RCAs, and change requests with copy-paste-executable runbooks. If you've already managed a substantial SaaS product and want to apply that discipline to an entire fleet, come join us. Bring expert-level AWS, production-grade coding skills, ruthless scope discipline, and daily, critical use of AI tools. If you're ready to keep the lights on, please apply.

What you will be doing

  • Complex infrastructure migrations, consolidations, production-grade automations, monitoring changes
  • Triaging production outages, implementing immediate fixes, and producing root cause analyses with permanent fixes assigned to the responsible teams
  • Creating, reviewing, and executing changes in production, including determining whether a proposed change is safe to execute

What you will NOT be doing

  • Living in Jira and endless status meetings - we value people who can drive solutions, not just track problems
  • Maintaining outdated systems indefinitely - you'll be empowered to drive meaningful improvements
  • Getting blocked by bureaucratic approval chains - you'll have the authority to execute immediate fixes to resolve incidents

Key responsibilities

  • Drive reliability and standardization of cloud infrastructure across our growing product portfolio by implementing robust monitoring, automation, and AWS best practices.

Candidate requirements

  • Deep AWS infrastructure expertise (this is our primary platform - other cloud experience alone won't cut it)
  • Experience managing production infrastructure at a scale of hundreds of containers
  • Experience scripting with Python and Bash for day-to-day administration operations
  • Experience managing and migrating production databases with multiple engines (including MySql, Postgres, Oracle, MS-SQL)
  • Experience with infrastructure automation (Terraform, Ansible, or CloudFormation)

Meet a successful candidate

Watch Interview
Anonymous
Anonymous  |  Elite Coder
Lebanon

Have you ever made so much money you had to remain anonymous to protect yourself? How about being able to fix an impossible coding problem i...

Meet Anonymous

Applying for a role? Here’s what to expect.

Crossover's skill assessment process combines innovative AI power with decades of human research, to take the guesswork, human bias, and pointless filters out of recruiting high-performing teams.

Chat-style
screening interview.
STEP 1

Chat-style
screening interview.

Cognitive 
aptitude test.
STEP 2

Cognitive 
aptitude test.

Prove real-world 
job skills.
STEP 3

Prove real-world 
job skills.

Interview with the hiring manager.
STEP 4

Interview with the hiring manager.

Pass
proctored test.
STEP 5

Pass
proctored test.

Accept job offer.
STEP 6

Accept job offer.

Frequently asked questions

About the role

About Crossover

Meet some people who've landed similar jobs

Why Crossover

Recruitment sucks. So we’re fixing it.

The Olympics of work

The Olympics of work

It’s super hard to qualify—extreme quality standards ensure every single team member is at the top of their game.

Premium pay for premium talent

Premium pay for premium talent

Over 50% of new hires double or triple their previous pay. Why? Because that’s what the best person in the world is worth.

Shortlist by skills, not bias

Shortlist by skills, not bias

We don’t care where you went to school, what color your hair is, or whether we can pronounce your name. Just prove you’ve got the skills.

Crossover Logo White
Follow us on
Have a question?

Get answers to common questions using our smart chatbot Crosby.

HELP AND FAQs

Join the world's largest community of  AI first Remote WorkersAI-first remote workers.