What you'd actually do

Design, build, and maintain the release infrastructure that powers Brex’s deployment pipelines and incident workflows

Drive technical strategy and architecture for release and observability systems, making them more scalable, reliable, and secure

Collaborate with product, engineering, and operations partners to ensure Brex’s releases are safe, predictable, and low-friction

Identify and deliver improvements to the end-to-end release process (from code merge to production) to reduce risk and cycle time

Build and evolve tooling for observability and incident response, enabling fast detection, triage, and resolution

What the JD emphasized

7+ years of professional experience designing, building, and operating backend or infrastructure systems in production

Strong proficiency in backend programming languages (e.g., Go, Java, Kotlin, or Python) with a focus on reliability and performance

Hands-on experience with CI/CD and release pipelines (e.g., GitHub Actions, CircleCI, Buildkite, Argo, Spinnaker, Jenkins) including build, test, and deployment automation

Experience architecting and operating scalable, high-availability distributed systems on cloud platforms (e.g., AWS, GCP, Azure)

Deep familiarity with containerization and orchestration (e.g., Docker, Kubernetes) and infrastructure-as-code (e.g., Terraform, CloudFormation)

Experience designing and maintaining observability tooling (metrics, logs, tracing) and integrating it into incident response workflows

Strong understanding of reliability and SRE practices, including SLIs/SLOs, error budgets, and incident management best practices

Proven track record of improving release processes (e.g., reducing deployment risk, increasing deployment frequency, automating rollbacks)

Why join us

Brex is the intelligent finance platform that enables companies to spend smarter and move faster in more than 200 markets. By combining global corporate cards and banking with intuitive spend management, bill pay, and travel software, Brex enables founders and finance teams to accelerate operations, gain real-time visibility, and control spend effortlessly. Brex’s AI-native automation and world-class service eliminate manual expense and accounting tasks for customers so they can focus on what matters most. Tens of thousands of the world's best companies run on Brex, including DoorDash, Coinbase, Robinhood, Zoom, Plaid, Reddit, and SeatGeek.

Working at Brex allows you to push your limits, challenge the status quo, and collaborate with some of the brightest minds in the industry. We’re committed to building a diverse team and inclusive culture and believe your potential should only be limited by how big you can dream. We make this a reality by empowering you with the tools, resources, and support you need to grow your career.

Engineering at Brex

Engineering at Brex is about building systems that scale with speed and intention. Our teams span Software, Data, Security, and IT, and operate with high autonomy and deep collaboration. We tackle hard technical problems, own our outcomes, and push for excellence at every level — from architecture to deployment. It’s an environment where engineering is a craft, and builders become leaders.

What you’ll do

As a Senior Software Engineer, Infrastructure (Release Engineering) at Brex, you will design, build, and operate the core systems that power Brex’s release, observability, and incident management processes. You will partner closely with product, platform, and operations teams to ensure releases are safe, fast, and reliable, and that our infrastructure scales securely as Brex grows.

Where you’ll work

This role will be based in our São Paulo office. We are a hybrid environment that combines the energy and connections of being in the office with the benefits and flexibility of working from home. We currently require a minimum of three coordinated days in the office per week on Monday, Wednesday and Thursday. As a perk, we also have up to four weeks per year of fully remote work!

Responsibilities

Design, build, and maintain the release infrastructure that powers Brex’s deployment pipelines and incident workflows
Drive technical strategy and architecture for release and observability systems, making them more scalable, reliable, and secure
Collaborate with product, engineering, and operations partners to ensure Brex’s releases are safe, predictable, and low-friction
Identify and deliver improvements to the end-to-end release process (from code merge to production) to reduce risk and cycle time
Build and evolve tooling for observability and incident response, enabling fast detection, triage, and resolution
Proactively identify and mitigate risks in our release and infrastructure stack, including performance, reliability, and security concerns
Define, instrument, and monitor key metrics for release engineering (e.g., deployment frequency, change failure rate, MTTR) and use them to guide improvements
Partner with other infrastructure and product teams to debug complex production issues and drive long-term fixes
Contribute to and champion best practices in release engineering, reliability, and operational excellence across the organization
Mentor other engineers on the team, providing technical guidance and code reviews to elevate the overall quality of our infrastructure
Stay up-to-date on emerging tools and practices in release engineering, observability, and SRE, and bring relevant ideas into Brex’s stack

Requirements

7+ years of professional experience designing, building, and operating backend or infrastructure systems in production
Strong proficiency in backend programming languages (e.g., Go, Java, Kotlin, or Python) with a focus on reliability and performance
Hands-on experience with CI/CD and release pipelines (e.g., GitHub Actions, CircleCI, Buildkite, Argo, Spinnaker, Jenkins) including build, test, and deployment automation
Experience architecting and operating scalable, high-availability distributed systems on cloud platforms (e.g., AWS, GCP, Azure)
Deep familiarity with containerization and orchestration (e.g., Docker, Kubernetes) and infrastructure-as-code (e.g., Terraform, CloudFormation)
Experience designing and maintaining observability tooling (metrics, logs, tracing) and integrating it into incident response workflows
Strong understanding of reliability and SRE practices, including SLIs/SLOs, error budgets, and incident management best practices
Experience designing and optimizing data storage systems (SQL and/or NoSQL) for operational and observability use cases
Proven track record of improving release processes (e.g., reducing deployment risk, increasing deployment frequency, automating rollbacks)
Comfort working cross-functionally with product and other engineering teams to debug complex production issues and ship changes safely
Strong communication and collaboration skills, including writing clear design docs and driving technical decisions across teams

Brex LLC is a wholly owned subsidiary of Capital One, N.A.

Please be aware, job-seekers may be at risk of targeting by malicious actors looking for personal data. Brex recruiters will only reach out via LinkedIn or email with a brex.com domain. Any outreach claiming to be from Brex via other sources should be ignored.

Why join us

Engineering at Brex

What you’ll do

Where you’ll work

Responsibilities

Design, build, and maintain the release infrastructure that powers Brex’s deployment pipelines and incident workflows
Drive technical strategy and architecture for release and observability systems, making them more scalable, reliable, and secure
Collaborate with product, engineering, and operations partners to ensure Brex’s releases are safe, predictable, and low-friction
Identify and deliver improvements to the end-to-end release process (from code merge to production) to reduce risk and cycle time
Build and evolve tooling for observability and incident response, enabling fast detection, triage, and resolution
Proactively identify and mitigate risks in our release and infrastructure stack, including performance, reliability, and security concerns
Define, instrument, and monitor key metrics for release engineering (e.g., deployment frequency, change failure rate, MTTR) and use them to guide improvements
Partner with other infrastructure and product teams to debug complex production issues and drive long-term fixes
Contribute to and champion best practices in release engineering, reliability, and operational excellence across the organization
Mentor other engineers on the team, providing technical guidance and code reviews to elevate the overall quality of our infrastructure
Stay up-to-date on emerging tools and practices in release engineering, observability, and SRE, and bring relevant ideas into Brex’s stack

Requirements

7+ years of professional experience designing, building, and operating backend or infrastructure systems in production
Strong proficiency in backend programming languages (e.g., Go, Java, Kotlin, or Python) with a focus on reliability and performance
Hands-on experience with CI/CD and release pipelines (e.g., GitHub Actions, CircleCI, Buildkite, Argo, Spinnaker, Jenkins) including build, test, and deployment automation
Experience architecting and operating scalable, high-availability distributed systems on cloud platforms (e.g., AWS, GCP, Azure)
Deep familiarity with containerization and orchestration (e.g., Docker, Kubernetes) and infrastructure-as-code (e.g., Terraform, CloudFormation)
Experience designing and maintaining observability tooling (metrics, logs, tracing) and integrating it into incident response workflows
Strong understanding of reliability and SRE practices, including SLIs/SLOs, error budgets, and incident management best practices
Experience designing and optimizing data storage systems (SQL and/or NoSQL) for operational and observability use cases
Proven track record of improving release processes (e.g., reducing deployment risk, increasing deployment frequency, automating rollbacks)
Comfort working cross-functionally with product and other engineering teams to debug complex production issues and ship changes safely
Strong communication and collaboration skills, including writing clear design docs and driving technical decisions across teams

Brex LLC is a wholly owned subsidiary of Capital One, N.A.

Senior Software Engineer, Release Infra

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized