Staff Site Reliability Engineer – Production Engineering

Other Jobs To Apply

No other job posts for this day.

Job Description:

  • Define and evolve Dropbox’s company-wide technical reliability strategy to support the changing engineering environment created by AI-assisted and agentic software development.
  • Set multi-year reliability goals, standards, and roadmaps across observability, debugging, incident management, service health, and operational readiness.
  • Lead cross-team initiatives that reduce reliability risk as software delivery velocity, pull request volume, service complexity, and incident volume increase.
  • Partner with engineering leaders and platform teams to improve monitoring, alerting, debugging, SLOs, SLAs, and incident response systems at company scale.
  • Identify emerging reliability risks introduced by AI-enabled development workflows and design scalable systems, processes, and guardrails to mitigate them.
  • Provide technical leadership and mentorship to engineers across teams, raising engineering quality, reliability judgment, and operational excellence.
  • Drive clear communication and alignment with senior stakeholders on reliability priorities, tradeoffs, risks, and execution progress.

Requirements:

  • BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent technical experience.
  • 12+ years of experience in software engineering, site reliability engineering, infrastructure engineering, or related technical roles.
  • Proven ability to define and deliver multi-year, multi-team reliability, infrastructure, or platform strategies with measurable business and customer impact.
  • Deep experience with distributed systems, production operations, observability, incident response, SLOs/SLAs, debugging, and reliability risk management.
  • Demonstrated ability to diagnose complex technical problems, debug production systems, automate operational workflows, and design resilient software components.
  • Experience influencing engineering roadmaps across multiple teams and making technical decisions that optimize for the broader engineering organization.
  • Strong communication and collaboration skills, with the ability to align cross-functional stakeholders through ambiguity and drive execution across teams.

Benefits:

    Back to blog