Datasite

Head of Reliability

USA - NY - New York City - BlueFlame AI Full time

Datasite and its associated businesses are the global center for facilitating economic value creation for companies across the globe. From data rooms to AI deal sourcing

and more. Here you’ll find the finest technological pioneers: Datasite, Blueflame AI, Firmex, Grata, and Sherpany. They all, collectively, define the future for business growth.

 

Apply for one position or as many as you like. Talent doesn’t always just go in one direction or fit in a single box. We’re happy to see whatever your superpower is and find the best place for it to flourish.

 

Get started now, we look forward to meeting you..

Job Description:

Blueflame AI for Datasite is looking for a Head of Reliability to own reliability, quality, and release assurance across the entire Blueflame AI platform.

This is not a support role — it’s a technical leadership position that combines QA and platform reliability ownership to ensure that every feature shipped is tested, stable, and trustworthy.

You’ll manage the reliability roadmap, set quality standards, and work closely with our engineering and product teams to make reliability a priority in everything we build.

Key Responsibilities

Quality Assurance (QA) Ownership

  • Lead the QA function — defining frameworks, tooling, and processes for automated and manual testing.
  • Ensure every release meets strict reliability and data integrity standards.
  • Work with engineering to build and maintain CI/CD-integrated test automation for frontend, backend, and model workflows.
  • Partner with product managers to define acceptance criteria, regression suites, and go/no-go release thresholds.

Reliability & Platform Resilience

  • Define and own Blueflame’s reliability strategy — uptime, latency, and system integrity across core services (API, search, context engine, data integrations).
  • Establish and manage SLOs/SLIs with engineering squads, ensuring proactive monitoring and error budgeting.
  • Review architectural designs for resilience, scalability, and recoverability.
  • Implement and manage monitoring and alerting across our platform, including within AWS. Oversee observability stack and monitoring pipelines (logs, metrics, traces, dashboards).
  • Establish real-time performance insights and alerting mechanisms.

Release Assurance & Continuous Improvement

  • Implement consistent release and rollback processes across environments.
  • Manage release readiness reviews and reliability audits.
  • Work with support team for post-incident reviews and implementation of long-term fixes.

Leadership & Culture

  • Build and lead a small, high-impact reliability engineering and QA team.
  • Champion quality-by-design principles within all engineering squads.
  • Assist with SOC-2 readiness.

Requirements

  • 8+ years in reliability, QA, or platform engineering roles, including 1+ years in a management role.
  • Strong experience designing and running QA and automated testing frameworks within CI/CD pipelines.
  • Hands-on experience with AWS cloud infrastructure and observability tools including Datadog and ELK stack.
  • Familiarity with LLM or AI-driven systems a plus (especially testing non-deterministic or probabilistic outputs).
  • Track record of improving uptime, release quality, and user trust in production environments.
  • Excellent collaboration skills — able to work across Product, Engineering, and Security functions.

The base salary range represents the estimated low and high end for this position based on a good faith assessment of the role and market data at the time of posting. Consistent with applicable law, each candidate’s compensation offer may vary and will be determined based on but not limited to, your geographic region, skills, qualifications, and experience along with the requirements of the position. This position may be eligible for bonuses, commissions, or overtime if applicable. Benefits include health insurance (medical, dental, vision), a retirement savings plan, paid time off, and other employee benefits. Specific details will be provided during the interview process. Datasite reserves the right to modify this pay range at any time.

$141,000.00 - $248,000.00

Our company is committed to fostering a diverse and inclusive workforce where all individuals are respected and valued. We are an equal opportunity employer and make all employment decisions without regard to race, color, religion, sex, gender identity, sexual orientation, age, national origin, disability, protected veteran status, or any other protected characteristic. We encourage applications from candidates of all backgrounds and are dedicated to building teams that reflect the diversity of our communities.