Staff Platform Engineer

Remote
Full Time
Senior Manager/Supervisor

Staff Platform Engineer

About the Role


We’re hiring a Staff DevOps Engineer to join Manifest, a new product being built in a high-autonomy, fast-moving environment.

This is a hands-on, staff-level role for someone who can own critical infrastructure, improve the developer experience, and partner closely with product engineers, DevOps leadership, and technical leads. We’re looking for someone who can operate production systems, but also design the guardrails, patterns, and platform capabilities that allow the team to move faster and more safely over time.

This role is a strong fit for someone who enjoys working close to the product team, understands the realities of building in a startup-like environment, and can bring structure, reliability, and technical depth to a fast-moving team.

What You’ll Do

  • Work on a team with two other platform engineers.
  • Own and evolve the infrastructure that supports Manifest, including AWS environments, networking, compute, data services, observability, CI/CD, and operational tooling.
  • Work with Pulumi and TypeScript to define, maintain, and improve infrastructure as code across the platform.
  • Support and improve our containerized application platform, including deployment pipelines, rollback mechanisms, and runtime configuration.
  • Help operate and harden our data infrastructure, including connection pooling, backups, disaster recovery, replication, and safe schema-change practices.
  • Partner with engineers to improve the reliability and safety of releases, including database migrations, deployment workflows, environment management, and production readiness checks.
  • Improve CI/CD workflows so that builds, tests, infrastructure changes, and deployments are fast, reliable, and easy for engineers to understand.
  • Lead observability and incident readiness work, including alerting, dashboards, SLOs, runbooks, incident response practices, and post-incident follow-up.
  • Help ensure the platform is secure, cost-conscious, and maintainable as the product scales.
  • Mentor engineers on infrastructure, operations, reliability, and production ownership.

What We’re Looking For

We’re looking for someone who has operated meaningful production systems and can bring staff-level judgment to infrastructure, reliability, and developer experience.

Strong candidates will have:

  • Deep production experience with AWS, especially services such as ECS/Fargate, RDS/Aurora PostgreSQL, VPC networking, load balancing, IAM, KMS, Secrets Manager, CloudFront, WAF, and related managed services.
  • Experience designing and operating systems that serve a global user base, seamless multi-region availability, and disaster recovery procedures.
  • Treats reliability, scalability, performance, and observability as a first-class design constraint, building these into designs from the start rather than bolting them on later.
  • Strong infrastructure-as-code experience. Pulumi with TypeScript is ideal, but deep experience with Terraform or another mature IaC approach is also valuable.
  • Strong operational knowledge of PostgreSQL, including performance investigation, connection pooling, backups, replication, locking, migrations, and safe schema-change patterns.
  • Experience designing and maintaining CI/CD systems, ideally with GitHub Actions, OIDC-based cloud authentication, container builds, environment promotion, required checks, and deployment gates.
  • Experience supporting containerized production workloads and improving deployment safety, rollback strategies, and runtime reliability.
  • Strong observability and incident response experience, including metrics, logs, traces, alerting, dashboards, runbooks, and post-incident learning.
  • The ability to work effectively in ambiguity, make pragmatic tradeoffs, and communicate clearly with both infrastructure specialists and product engineers.
  • A track record of raising the engineering bar through reusable patterns, documentation, automation, mentoring, and thoughtful technical leadership.


Our Environment

Manifest operates with a lean process and a high degree of ownership. Engineers are expected to work effectively in ambiguity, clarify requirements, collaborate directly across functions, and ship pragmatic, high-quality solutions.

The DevOps function is critical to that operating model. Resilient, well-planned infrastructure is critical, but we also do not want speed to come at the expense of reliability, security, or maintainability. This role exists to help Manifest find that balance as the product moves toward launch and scale.

You’ll work closely with product engineers, technical leads, DevOps leadership, and other stakeholders to ensure the platform is ready for real customers, real traffic, and real operational demands.

 

Why Join

 

This is an opportunity to help shape the foundation for a new product at an important stage.

You’ll be joining early enough to have real influence over how Manifest operates, deploys, scales, and responds to incidents. You’ll work on meaningful infrastructure problems, partner with a highly autonomous engineering team, and help define the standards that will carry the product into production and beyond.

If you’re excited by the combination of hands-on infrastructure work, production reliability, developer experience, and staff-level technical leadership, we’d love to talk.


 
Share

Apply for this position

Required*
We've received your resume. Click here to update it.
Attach resume as .pdf, .doc, .docx, .odt, .txt, or .rtf (limit 5MB) or Paste resume

Paste your resume here or Attach resume file

Human Check*