What you'd actually do

creating and evolving systems to automatically run our suite of products and services reliably and consistently

help define service level objectives (SLOs) that determine success and build systems to achieve those objectives

utilize your strong background in deploying, managing, and maintaining production systems, working with developers to operate and monitor large-scale services with complex distributed systems and data integrations

incorporate observability tools (monitoring, telemetry, tracing, alerting), perform incident management, conduct root cause analyses, eliminate single points of failure, build reliability and redundancy into our infrastructure, establish and test our recoverability, mitigate failures, and do all of these things through automation and tools

take independent responsibility for building and managing large subsets of our systems

Skills

Required

Linux/UNIX-based systems
cloud environments (GCP & AWS)
Reliability Engineering, DevOps, or infrastructure role
infrastructure-as-code tools (e.g. Terraform, Puppet, Ansible, Chef)
containers and orchestration platforms, including Kubernetes and Docker
infrastructure systems, networking, and security
operational reliability, scalability, recoverability (backups, disaster recovery, failover), and capacity planning
operational activities including batch processing, system backups, maintenance, monitoring, and providing first-tier on-call support and being part of a 24/7 response team
distributed, scalable microservices and event-driven architectures
data storage, replication, caching, and search technologies, such as PostgreSQL, MySQL, MS SQL Server, Amazon RDS, GCP CloudSQL, Redis, Elasticsearch, and Lucene/Solr
professional certification in AWS or GCP (DevOps or SysOps Engineer preferred)
Microsoft Office suite including in-depth knowledge of Outlook, Word, and Excel with the ability to pick up new systems and software easily

Nice to have

Master's degree

At PitchBook, a Morningstar company, we are always looking forward. We continue to innovate, evolve, and invest in ourselves to bring out the best in everyone. We’re deeply collaborative and thrive on the excitement, energy, and fun that reverberates throughout the company.

Our extensive learning programs and mentorship opportunities help us create a culture of curiosity that pushes us to always find new solutions and better ways of doing things. The combination of a rapidly evolving industry and our high ambitions means there’s going to be some ambiguity along the way, but we excel when we challenge ourselves. We’re willing to take risks, fail fast, and do it all over again in the pursuit of excellence.

If you have a good attitude and are willing to roll up your sleeves to get things done, PitchBook is the place for you.

About the Role:

As a member of the Product and Engineering team at PitchBook, you will be part of a team of big thinkers, innovators, and problem solvers who strive to deepen the positive impact we have on our customers and our company every day. We value curiosity and the drive to find better ways of doing things. We thrive on customer empathy, which remains our focus when creating excellent customer experiences through product innovation.

We know that greatness is achieved through collaboration and diverse points of view, so we work closely with partners around the globe. As a team, we assume positive intent in each other’s words and actions, value constructive discussions, and foster a respectful working environment built on integrity, growth, and business value. We invest heavily in our people, who are eager to learn and constantly improve. Join our team and grow with us!

As a Sr. Site Reliability Engineer (SRE) in PitchBook’s engineering division, you will be creating and evolving systems to automatically run our suite of products and services reliably and consistently. As part of a team of site reliability engineers and platform engineers and in conjunction with group leadership, you will help define service level objectives (SLOs) that determine success and build systems to achieve those objectives.

You will utilize your strong background in deploying, managing, and maintaining production systems, working with developers to operate and monitor large-scale services with complex distributed systems and data integrations. You will incorporate observability tools (monitoring, telemetry, tracing, alerting), perform incident management, conduct root cause analyses, eliminate single points of failure, build reliability and redundancy into our infrastructure, establish and test our recoverability, mitigate failures, and do all of these things through automation and tools.

As a Sr. Site Reliability Engineer, you will take independent responsibility for building and managing large subsets of our systems. You will help build our best practices for infrastructure-as-code and your code will exemplify our quality controls. You will mentor and train other Site Reliability Engineers, platform engineers, and software engineers in reliability topics.

Your ability to collaborate with colleagues, exhibit poise and adaptability in stressful situations, communicate effectively, and build resilient systems that can be consistently relied upon will be critical to your success. You will solicit feedback, learn constantly, engage others with empathy, and help create a culture of belonging, teamwork, and purpose.

If you love building customer-centric solutions, strive for excellence every day, are adaptable and focused, and believe work should be fun, come join us!

Primary Job Responsibilities:

Bachelor's in Computer Science, Software Engineering, or related (Master's preferred)
5+ years of experience building and maintaining Linux/UNIX-based systems, primarily in cloud environments (preferably GCP & AWS)
5+ years of experience in a Reliability Engineering, DevOps, or infrastructure role, where infrastructure-as-code tools (e.g. Terraform, Puppet, Ansible, Chef) were used as a primary job function
2+ years of experience with containers and orchestration platforms, including Kubernetes and Docker
Deep knowledge of infrastructure systems, networking, and security, including in a cloud environment
Experience owning operational reliability, scalability, recoverability (backups, disaster recovery, failover), and capacity planning
Experience performing operational activities including batch processing, system backups, maintenance, monitoring, and providing first-tier on-call support and being part of a
24/7 response team
Experience with distributed, scalable microservices and event-driven architectures
Experience with data storage, replication, caching, and search technologies, such as PostgreSQL, MySQL, MS SQL Server, Amazon RDS, GCP CloudSQL, Redis, Elasticsearch, and Lucene/Solr
Hold at least one professional certification in AWS or GCP (DevOps or SysOps Engineer preferred)
Proficiency with the Microsoft Office suite including in-depth knowledge of Outlook, Word, and Excel with the ability to pick up new systems and software easily
Support the vision and values of the company through role modeling and encouraging desired behaviors
Participate in various company initiatives and projects as requested

Skills and Qualifications:

Bachelor's in Computer Science, Software Engineering, or related (Master's preferred)
5+ years of experience building and maintaining Linux/UNIX-based systems, primarily in cloud environments (preferably GCP & AWS)
5+ years of experience in a Reliability Engineering, DevOps, or infrastructure role, where infrastructure-as-code tools (e.g. Terraform, Puppet, Ansible, Chef) were used as a primary job function
2+ years of experience with containers and orchestration platforms, including Kubernetes and Docker
Deep knowledge of infrastructure systems, networking, and security, including in a cloud environment
Experience owning operational reliability, scalability, recoverability (backups, disaster recovery, failover), and capacity planning
Experience performing operational activities including batch processing, system backups, maintenance, monitoring, and providing first-tier on-call support and being part of a 24/7 response team
Experience with distributed, scalable microservices and event-driven architectures
Experience with data storage, replication, caching, and search technologies, such as PostgreSQL, MySQL, MS SQL Server, Amazon RDS, GCP CloudSQL, Redis, Elasticsearch, and Lucene/Solr
Hold at least one professional certification in AWS or GCP (DevOps or SysOps Engineer preferred)
Proficiency with the Microsoft Office suite including in-depth knowledge of Outlook, Word, and Excel with the ability to pick up new systems and software easily
Must be authorized to work in the United States without the need for visa sponsorship now or in the future

Benefits + Compensation at PitchBook:

Physical Health

Comprehensive health benefits
Additional medical wellness incentives
STD, LTD, AD&D, and life insurance

Emotional Health

Paid sabbatical program after four years
Paid family and paternity leave
Annual educational stipend
Ability to apply for tuition reimbursement
CFA exam stipend
Robust training programs on industry and soft skills
Employee assistance program
Generous allotment of vacation days, sick days, and volunteer days

Social Health

Matching gifts program
Employee resource groups
Subsidized emergency childcare
Dependent Care FSA
Company-wide events
Employee referral bonus program
Quarterly team building events

Financial Health

401k match
Shared ownership employee stock program
Monthly transportation stipend

*Please be aware the above PitchBook benefit and perk offerings are subject to corresponding plan and policy documents and may change during the course of your employment.

Compensation

Annual base salary: $175,000-$200,000
Target annual bonus percentage: 10%

Working Conditions:

At the heart of our company is a belief in the power of in-person collaboration. Being together in the office fuels our creativity, strengthens our connections, and drives the innovation that sets us apart. Our culture is built on spontaneous moments—those hallway conversations, whiteboard brainstorms, and shared celebrations in each of our global offices—that simply can’t be replicated remotely. This role is expected to be in the office 5 days a week.

The job conditions for this position are in a standard office setting. Employees in this position use PC and phone on an on-going basis throughout the day. Limited corporate travel may be required to remote offices or other business meetings and events.

We are excited to get to know you and your background. Concerned that you might not meet every requirement? We encourage you to still apply as you might be the right candidate for the role or other roles at PitchBook.

#LI-MS1

#LI-Onsite