We are building the infrastructure that moves money at scale, and we need a Senior Platform Engineer to help us get it right. In this role you will own the design, delivery, and reliability of the core developer platform that every engineering team in our company depends on. You will set the standard for how we build, deploy, and observe distributed systems in a regulated financial environment and you will mentor the next generation of engineers who carry that mission forward.
This is a high-impact, high-autonomy role for someone who has done this before: built platforms from 0→1, scaled them through hypergrowth, and cared deeply about developer experience without sacrificing security or compliance.
Design, build, and maintain cloud-native infrastructure (AWS/GCP) supporting payment processing, lending, and digital banking workloads
Own the internal developer platform — CI/CD pipelines, GitOps workflows, IaC (Terraform/ArgoCD), containerization (Docker/Kubernetes), and networks
Drive the architecture of microservices and event-driven systems, ensuring low-latency, high-throughput performance under financial transaction volumes
Lead platform migrations and modernization initiatives, balancing velocity with zero-downtime requirements
Define and own SLOs/SLAs for platform components; build error budgets and lead blameless post-incident reviews
Implement and mature observability stacks (Datadog, Prometheus/Grafana, OpenTelemetry) across services
Champion Site Reliability Engineering principles including chaos engineering, load testing, and automated runbooks
Ensure 24/7 platform availability for systems processing real-money transactions
Embed security into the SDLC: secrets management (Vault), SAST/DAST tooling, dependency scanning, and container hardening
Partner with security and compliance teams to maintain PCI-DSS, SOC 2, and applicable regulatory standards
Implement identity and access controls, audit logging, and data encryption at rest and in transit
Build and maintain internal tooling, self-service platforms, and golden path templates that reduce friction for product engineering teams
Create and maintain comprehensive runbooks, architecture decision records (ADRs), and platform documentation
Evaluate and integrate emerging technologies that improve developer productivity without introducing undue risk
Serve as a technical lead and informal mentor to junior and mid-level platform engineers
Participate in architectural reviews, RFCs, and cross-team design discussions
Advocate for engineering best practices across the organization, driving adoption of platform standards
6+ years of software or infrastructure engineering experience, with at least 3 years focused on platform engineering, DevOps, or SRE
Deep expertise in one or more cloud platforms: AWS (preferred) or GCP — including managed Kubernetes (EKS/GKE), serverless, and networking
Strong proficiency in Infrastructure as Code — Terraform
Experience with CI/CD platforms (GitHub Actions, ArgoCD, Jenkins, or equivalent) and GitOps workflows
Solid programming skills in Python, Go, or a JVM language for tooling, automation, and glue code
Hands-on experience with observability and monitoring at scale (Datadog, Prometheus, Grafana, OpenTelemetry)
Familiarity with financial services compliance requirements (PCI-DSS, SOC 2, GDPR) and how they shape infrastructure decisions
Experience designing and operating distributed, high-availability systems with strict latency and reliability requirements
Ability to hold a high bar for quality, are a self starter, and have strong interpersonal skills
Strong problem solving skills and ability to identify problems, determine their root cause, and see them through to solution
Ability to balance business needs with technical solutions
Has experience scaling backend infrastructure
Experience in fintech, payments, banking, or a similarly regulated industry
Familiarity with financial APIs and integrations (Plaid, Stripe, ACH/SWIFT)
Knowledge of service mesh technologies (Cilium) and zero-trust networking
Experience with AI/ML infrastructure or LLM integration workloads
Background with cost optimization at scale — FinOps practices, Reserved Instances, and cloud egress management
Contributions to open-source projects or published engineering writing
We own our systems end-to-end — from infrastructure to incident to remediation
We treat developer experience as a product, not an afterthought
We build for observability first — if you can’t measure it, you can’t improve it
We move fast without breaking customer trust — security and compliance are foundational, not constraints
We write things down — ADRs, runbooks, post-mortems, and PRD are first-class artifacts
Other similar jobs that might interest you