← Back to results

Senior Site Reliability Engineer - Hiring Sprint

Join Airbyte as a Senior Site Reliability Engineer to enhance infrastructure and reliability for a data replication platform.

Location
San Francisco, CA, USA
Compensation
Not disclosed
Level
senior
Type
full time

AI in the day-to-day

Engineers actively use AI as a force multiplier to automate toil, augment incident response, and build smarter internal tooling.

Requirements

Experience
7+ years

Benefits

Health Insurance 401k Match Parental Leave Commuter Benefits Education Budget Flexible PTO Meals Provided

Joblaze summary

In this role, the Senior Site Reliability Engineer will focus on maintaining and enhancing the infrastructure for Airbyte's Data Replication platform, ensuring reliability and efficiency across millions of sync jobs. Key skills include expertise in Kubernetes, Terraform, and observability tools, along with a strong understanding of CI/CD processes. This position is ideal for seasoned professionals with a background in infrastructure or DevOps who thrive in fast-paced, startup environments. The team emphasizes the use of AI to streamline operations and improve tooling.

Quick facts

Is the Senior Site Reliability Engineer - Hiring Sprint role remote?
No — this is an on-site role in San Francisco, CA, USA.
How much experience is required?
At least 7 years of relevant experience for this Senior Site Reliability Engineer - Hiring Sprint role.
Where is the role based?
Airbyte is hiring for this position in San Francisco, CA, USA.
What's the tech stack?
Joblaze extracted these technologies from the posting: Terraform, Datadog, AWS, Grafana, Prometheus, GCP.
What seniority level is this role?
Airbyte targets senior candidates for this position.
Is this full-time or contract?
Full-time for this Senior Site Reliability Engineer - Hiring Sprint role at Airbyte.

From the original posting

Airbyte is the data and action layer for AI agents. We give agents fast, accurate, authenticated access to business data across hundreds of sources, so they can discover the entities that matter, reason over real-time context, and take action in the systems they read from, not just observe them.

We started as the open-source standard for data movement and proved the economics of data integration at scale: hundreds of connectors, thousands of companies, and, since 2020, have raised $181M from leading investors including Benchmark, Accel, Altimeter, Coatue, and Y Combinator. As our CEO Michel Tricot puts it, "the last ten years were all about structured data. The future is all about context." We're now building that context infrastructure for production-grade agents on the same open foundation, as agents become the primary consumers of enterprise data.

Our mission is unchanged: make data available and actionable to everyone, everywhere. That everyone now includes AI agents.

Engineering Hiring Sprint:

We're growing our engineering team and are accelerating hiring through a focused Engineering Hiring Sprint. Rather than stretching interviews over several weeks, we're bringing exceptional candidates through an expedited process and making hiring decisions quickly.

 

Interview process:

  1. Apply

  2. Technical Take-Home (Java or Python)

  3. Hiring Manager Interview

  4. In-Person Onsite (the week of July 20)

  5. Hiring decision by the end of the week

 

We're hiring across multiple engineering teams, including:

  • ⚙️ Platform Engineers

  • 🗄️ Database Engineers

  • ☁️ Site Reliability Engineers

  • 🔌 Extensibility API Engineers

  • 🤖 AI Agents Engineers

  • 👤 Engineering Managers

If you enjoy solving complex technical problems, moving quickly, embracing AI, and taking ownership of your work, we'd love to meet you.

 
 

The Role:

You'll be the infrastructure and reliability engineer on the Data Replication team - a full-stack product team running over 3 million sync jobs a week powering thousands of data use cases across multiple regions and clouds. You’ll build and maintain the infrastructure, set reliability standards, drive down incidents, and make it easier and safer for engineers to ship through tooling. You're equally comfortable in a Terraform file, a Kubernetes cluster, and a postmortem doc.


We expect engineers here to actively use AI as a force multiplier - agentic tools to automate toil, augment incident response, and build smarter internal tooling. If you're not already doing this, you should be excited to start. We care as much about how you work as what you build. Trust, directness, and craftsmanship matter here.

 

What You’ll Do:

  • Own the infrastructure underpinning the Data Replication platform - Kubernetes clusters, CI/CD pipelines, secrets management, networking, and cloud resource configuration across AWS and GCP.

  • Partner with product engineers to reliably integrate product features with infrastructure.

  • Maintain and enhance observability, alerting, and anomaly detection with an eye towards LLM automation.

  • Maintain and enhance AI-augmented release and internal tooling: canary deployments, progressive rollouts, automated release qualification, and rollback automation - with an eye towards LLM automation.

  • Set the infrastructure bar for the team - build self-serve tooling, write runbooks, and coach engineers to own more of their stack.

 

What You’ll Need:

  • 7+ years in infrastructure, platform engineering, SRE, or DevOps.

  • Hands-on ownership of Kubernetes, Helm, and Terraform in production environments.

  • Deep experience with observability stacks (Prometheus, Grafana, Datadog) and on-call operations.

  • Experience with CI/CD pipeline ownership and developer tooling.

  • Ability & willingness to read backend code to understand how systems break and instrument them correctly.

  • Fluency with AI tools - LLMs and agentic frameworks to automate, debug faster, and reduce toil.

  • A startup-ready mindset: comfortable with ambiguity, moving fast, and owning problems end-to-end.

 

Nice To Have:

  • Data pipelines, replication systems, or ETL/ELT platforms.

  • Control plane / data plane architectures or internal developer platforms.

  • Experience with Airbyte, CDKs, or connector-based architectures.

 

Location:

  • Onsite 4 days/week in San Francisco, CA

Why You'll Love Working at Airbyte:

At Airbyte, we believe great work happens when people feel supported, trusted, and empowered to grow. Our market-leading Total Rewards package is designed to help you thrive professionally and personally. Our benefits and perks include:

  • Flexible PTO with a culture that encourages at least 25 days off annually

  • 16 weeks fully paid parental leave for all parents

  • Comprehensive medical, dental, and vision coverage for employees and dependents

  • 401(k) retirement plan

  • Professional development budget, conference sponsorship, and book reimbursement

  • Commuter benefits and monthly internet reimbursement

  • Breakfast and lunch in our San Francisco office

  • A collaborative, in-person culture focused on learning, growth, and impact

If you find this role exciting, we encourage you to apply even if you think you don’t meet all of the requirements!

We are not accepting agency submissions or recruiting firm support for this role. Unsolicited resumes will not be considered.

Airbyte is an equal opportunity employer that does not discriminate on the basis of actual or perceived race, creed, color, religion, national origin, ancestry, age, physical or mental disability, pregnancy, genetic information, sex, sexual orientation, gender identity or expression, marital status, familial status, domestic violence victim status, veteran or military status, or any other legally recognized protected basis under federal, state or local laws. Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Airbyte is committed to providing reasonable accommodations for qualified individuals with disabilities in our job application procedures. Please let us know if you need assistance or accommodations due to a disability.

Similar positions

Ramp
Applied AI Engineer
Ramp · New York, NY (HQ)
Block
Senior Site Reliability Engineer
Block · Bay Area, CA, United States of America
Block
Senior Site Reliability Engineer
Block · New York, NY, United States of America
Ramp
Technical Program Manager
Ramp · New York, NY (HQ)
Fivetran
People Strategy & Operations Lead - Contract
Fivetran · Oakland, California, United States, AMER