What you'd actually do

Design and implement monitoring and alerting systems to ensure the stability, reliability, and performance of data platforms. Join in on-call shift to quickly respond to and resolve issues.

Develop and maintain automation tools and scripts for deployment, monitoring, backup and disaster recovery.

Analyze and optimize the performance of data storage, query performance, and data flows to ensure efficient processing of large-scale datasets, reduce latency, an improve processing speed.

Respond quickly to platform failures, perform troubleshooting, and coordinate cross-team efforts to resolve issues and ensure high availability and reliability.

Work with engineering teams to analyze and forecast capacity requirements, ensuring the system can handle traffic growth and scale infrastructure accordingly. Support Freewheel powered Live events.

Skills

Required

3+ years of experience as an SRE, DevOps or Operations Engineer
Experience with an automation tool or framework such as Ansible, Terraform, Kubernetes, Docker for automating system deployment
Proficient in at least one programming language, such as Python, Go, Java, or Scala
Familiar with using monitoring and log management tools such as Prometheus, Grafana, ELK Stack, or other similar tools
Excellent communication skills

Nice to have

Experience with cloud platforms (e.g. AWS, OCI, GCP, Azure)
Hands-on experience with Terraform and infrastructure as code principle

FreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, we’re making it easier for buyers and sellers to transact across all screens, data types, and sales channels. As a global company, we have offices in nine countries and can insert advertisements around the world.

Job Summary

FreeWheel is seeking an SRE to join Freewheel OPS team based in Denver, CO or Chicago, IL. As a member of the Global Operation team, you will be responsible for ensuring the reliability, scalability, and performance of Freewheel systems. Working closely with engineers and other operation sub-teams, you will manage infrastructure, optimize system reliability, automate daily operations, and resolve technical issues that impact upstream/downstream platform.

Job Description

Qualifications:

3+ years of experience as an SRE, DevOps or Operations Engineer.
Experience with cloud platforms (e.g. AWS, OCI, GCP, Azure) is a plus.
Hands-on experience with Terraform and infrastructure as code principle is a huge plus.
Experience with an automation tool or framework such as Ansible, Terraform, Kubernetes, Docker for automating system deployment.
Programming Skills: Proficient in at least one programming language, such as Python, Go, Java, or Scala, with the ability to write efficient scripts and automation tools.
System Monitoring and Log Management: Familiar with using monitoring and log management tools such as Prometheus, Grafana, ELK Stack, or other similar tools.
Team Collaboration and Communication: Excellent communication skills with the ability to convey technical information clearly and concisely to both technical and non-technical stakeholders.
Proactive learner eager to grow in operations and governance.
Education: Bachelor’s degree or higher in Computer Science, Software Engineering, or a related field.

Key Responsibilities:

System Monitoring and Optimization: Design and implement monitoring and alerting systems to ensure the stability, reliability, and performance of data platforms. Join in on-call shift to quickly respond to and resolve issues.
Automation and Tool Development: Develop and maintain automation tools and scripts for deployment, monitoring, backup and disaster recovery.
Performance Optimization: Analyze and optimize the performance of data storage, query performance, and data flows to ensure efficient processing of large-scale datasets, reduce latency, an improve processing speed.
Incident Response and Troubleshooting: Respond quickly to platform failures, perform troubleshooting, and coordinate cross-team efforts to resolve issues and ensure high availability and reliability.
Capacity Planning and Scaling: Work with engineering teams to analyze and forecast capacity requirements, ensuring the system can handle traffic growth and scale infrastructure accordingly. Support Freewheel powered Live events.
Documentation and Knowledge Sharing: Document the architecture, configurations, and operational procedures for platforms, ensuring knowledge is shared across the team and providing relevant training.
Security and Compliance: Ensure platforms meet security standards and compliance requirements to prevent breaches or misuse.
Cross-Team Collaboration: Collaborate with engineering team, product team, and project management team to support product design and implementation, solving reliability-related issues.

Core Responsibilities Engineering technical solutions for infrastructure and application management, monitoring, and operations with standardization and automation focus Collaborating with cross-functional teams to identify and address reliability and performance issues Providing cybersecurity support such as vulnerability cleanup, secure server configuration, testing and validation, technical controls implementation and incident remediation Working closely with developers to ensure software releases are well-designed, planned, implemented, released, and monitored Measuring and improving reliability, quality and efficiency of platforms Working on-call shift, and support incident prevention, response, and retrospect Performing a variety of complex analytical duties in the planning, deployment, testing and evaluation of products Contributing to the design and implementation of reliable and scalable infrastructure solutions with best practices, tool use, and quality assurance Monitoring system performance and implementing improvements to optimize reliability, availability, production quality, operational efficiency, and engineering productivity Developing and maintaining tools for monitoring, deployment, and operations Consistent exercise of independent judgment and discretion in matters of significance. Regular, consistent and punctual attendance. Must be able to work nights and weekends, variable schedule(s) as necessary. Other duties and responsibilities as assigned. Employees at all levels are expected to:

Understand our Operating Principles; make them the guidelines for how you do your job. Own the customer experience - think and act in ways that put our customers first, give them seamless digital options at every touchpoint, and make them promoters of our products and services. Know your stuff - be enthusiastic learners, users and advocates of our game-changing technology, products and services, especially our digital tools and experiences. Win as a team - make big things happen by working together and being open to new ideas. Be an active part of the Net Promoter System - a way of working that brings more employee and customer feedback into the company - by joining huddles, making call backs and helping us elevate opportunities to do better for our customers. Drive results and growth. Support a culture of inclusion in how you work and lead. Do what's right for each other, our customers, investors and our communities.

Disclaimer: This information has been designed to indicate the general nature and level of work performed by employees in this role. It is not designed to contain or be interpreted as a comprehensive inventory of all duties, responsibilities and qualifications.

Skills

Collaborating, Design, Net Promoter Score (NPS)

Compensation

Primary Location Pay Range: $99,684.63 - $149,526.95

This job can be performed in Denver Campus with a Pay Range of $99,684.63 - $156,647.28

Comcast intends to offer the selected candidate base pay within this range, dependent on job-related, non-discriminatory factors such as experience. The application window is 30 days from the date job is posted, unless the number of applicants requires it to close sooner or later.

Base pay is one part of the Total Rewards that Comcast provides to compensate and recognize employees for their work. Most sales positions are eligible for a Commission under the terms of an applicable plan, while most non-sales positions are eligible for a Bonus. Additionally, Comcast provides best-in-class Benefits to eligible employees. We believe that benefits should connect you to the support you need when it matters most, and should help you care for those who matter most. That’s why we provide an array of options, expert guidance and always-on tools, that are personalized to meet the needs of your reality – to help support you physically, financially and emotionally through the big milestones and in your everyday life. Please visit the compensation and benefits summary on our careers site for more details.

Education

Bachelor's Degree

While possessing the stated degree is preferred, Comcast also may consider applicants who hold some combination of coursework and experience, or who have extensive related professional experience.

Certifications (if applicable)

Relevant Work Experience

5-7 Years

Comcast is an equal opportunity workplace. We will consider all qualified applicants for employment without regard to race, color, religion, age, sex, sexual orientation, gender identity, national origin, disability, veteran status, genetic information, or any other basis protected by applicable law.