Cloud Architect
$100,000 USD/year Pay is set based on global value, not the local market. Most roles = hourly rate x 40 hrs x 50 weeks 

Worldwide
Fully-remote
full-time (40 hrs/week)
Flexible schedule
Long-term role

Cloud Architect   $100,000 USD/year

Description

You're the engineer who ensures 50+ SaaS products stay online while others are still troubleshooting. We need DevOps professionals capable of stepping into unknown AWS environments, restoring stability, and driving uptime beyond 99.9% through robust monitoring, automation, and thorough root cause analysis. You'll break complex projects into single-day tasks, deliver production-ready Python or JavaScript, and leverage AI as a force multiplier.

Many organizations claim "cloud expertise" while manually nursing infrastructure. We're scaling reliability engineering across a portfolio of acquired products where original developers are gone and documentation is incomplete. The challenge: you'll deploy agents and current-generation tooling to explore unfamiliar systems 5–10× faster, document your findings, and automate solutions so incidents don't recur. Rather than judging you on certifications and vendor badges, we'll observe you troubleshoot in real time, author a genuine 5-Whys analysis that identifies a single preventable root cause, and construct automations that withstand production conditions.

This is not a tier-two "follow the runbook" position. Here, you author the runbooks, architect the deployment path from development through staged environments to 10% and full rollout with soak periods and rollback criteria, and create monitoring that detects corner cases. You reject risky changes before execution. You distinguish infrastructure failures you're accountable for from application bugs Engineering must resolve, and you route permanent remediation to the correct team.

You'll operate at the engineering center of reliability, responsible for infrastructure initiatives, incident management and root cause documentation, and change execution with copy-paste-ready runbooks. If you've previously owned a substantial SaaS product and want to extend that expertise across an entire fleet, join us. Bring expert AWS knowledge, production-grade development skills, disciplined scope management, and daily, mission-critical use of AI tooling. If you're prepared to maintain service continuity, please apply.

What you will be doing

  • Executing sophisticated infrastructure migrations, consolidations, production-quality automations, and monitoring enhancements
  • Responding to production incidents, deploying immediate remediation, and documenting root cause analyses with permanent corrective actions assigned to accountable teams
  • Authoring, reviewing, and implementing production changes, including assessing whether a proposed modification is safe for execution

What you will NOT be doing

  • Spending your day in Jira and recurring status calls - we prioritize individuals who deliver solutions, not just document issues
  • Keeping legacy systems running indefinitely - you'll be empowered to implement substantive upgrades
  • Waiting on bureaucratic approval processes - you'll possess the authority to deploy immediate fixes during incidents

Key responsibilities

  • Advance reliability and consistency of cloud infrastructure across our expanding product portfolio by deploying comprehensive monitoring, automation, and AWS best practices.

Candidate requirements

  • Extensive AWS infrastructure expertise (this is our core platform - experience with other clouds alone is insufficient)
  • Experience operating production infrastructure at a scale of hundreds of containers
  • Experience writing scripts in Python and Bash for routine administrative tasks
  • Experience administering and migrating production databases across multiple engines (including MySql, Postgres, Oracle, MS-SQL)
  • Experience with infrastructure automation (Terraform, Ansible, or CloudFormation)

Meet a successful candidate

Watch Interview
Anonymous
Anonymous  |  Elite Coder
Lebanon

Have you ever made so much money you had to remain anonymous to protect yourself? How about being able to fix an impossible coding problem i...

Meet Anonymous

Applying for a role? Here’s what to expect.

Crossover's skill assessment process combines innovative AI power with decades of human research, to take the guesswork, human bias, and pointless filters out of recruiting high-performing teams.

Chat-style
screening interview.
STEP 1

Chat-style
screening interview.

Cognitive 
aptitude test.
STEP 2

Cognitive 
aptitude test.

Prove real-world 
job skills.
STEP 3

Prove real-world 
job skills.

Interview with the hiring manager.
STEP 4

Interview with the hiring manager.

Pass
proctored test.
STEP 5

Pass
proctored test.

Accept job offer.
STEP 6

Accept job offer.

Frequently asked questions

About the role

About Crossover

Meet some people who've landed similar jobs

Why Crossover

Recruitment sucks. So we’re fixing it.

The Olympics of work

The Olympics of work

It’s super hard to qualify—extreme quality standards ensure every single team member is at the top of their game.

Premium pay for premium talent

Premium pay for premium talent

Over 50% of new hires double or triple their previous pay. Why? Because that’s what the best person in the world is worth.

Shortlist by skills, not bias

Shortlist by skills, not bias

We don’t care where you went to school, what color your hair is, or whether we can pronounce your name. Just prove you’ve got the skills.

Crossover Logo White
Follow us on
Have a question?

Get answers to common questions using our smart chatbot Crosby.

HELP AND FAQs

Join the world's largest community of  AI first Remote WorkersAI-first remote workers.