What you'd actually do

Define and drive the technical strategy for AI/ML platform infrastructure supporting generative AI applications, LLM integrations, model routing, and enterprise AI services.

Architect, build, and operate scalable cloud platforms using AWS services such as EKS, ECS Fargate, Lambda, DynamoDB, S3, OpenSearch, Secrets Manager, CloudWatch, ALB, and MWAA.

Establish reusable infrastructure patterns using CloudFormation, Helm, and Terraform to support reliable multi-environment and multi-region deployments.

Lead CI/CD architecture using GitHub Actions, reusable workflows, OIDC-based AWS authentication, automated quality gates, deployment promotion, and environment approvals.

Design and improve observability across AI platforms, including CloudWatch dashboards, logs, alarms, Prometheus/Grafana, OpenSearch, Langfuse, and LLM-specific operational metrics.

Skills

Required

7+ years of experience in DevOps, platform engineering, cloud infrastructure, site reliability engineering, or software engineering roles.
Strong hands-on experience with AWS/Azure/GCP infrastructure and services, including container, serverless, networking, storage, observability, and security services.
Experience designing and operating production systems on Kubernetes, ECS/Fargate, or comparable container orchestration platforms.
Proficiency with infrastructure-as-code, especially CloudFormation, Terraform, Helm, or similar tooling.
Strong CI/CD experience with GitHub Actions or similar platforms, including reusable workflows, automated testing, deployment gates, and cloud authentication.
Experience building and operating observability solutions using CloudWatch, Prometheus/Grafana, OpenSearch, or similar tools.
Strong understanding of cloud security practices, IAM, secrets management, least-privilege access, audit logging, and compliance requirements.
Experience supporting distributed systems, microservices, APIs, asynchronous workloads, and multi-environment deployments.
Demonstrated ability to lead technical design, mentor engineers, and influence engineering practices across teams.

Nice to have

Experience supporting AI/ML or generative AI platforms, including LLM gateways, model routing, prompt observability, token metering, or model failover.
Experience operating platforms in regulated enterprise environments, ideally healthcare, pharmaceutical, finance, or life sciences.
Experience with multi-account, multi-region AWS architectures and enterprise governance patterns.
Experience with cost optimization, autoscaling strategies, capacity planning, and cloud budget monitoring.
Experience with load testing and performance validation using tools such as Locust or comparable frameworks.
Strong Python or scripting skills for platform automation, operational tooling, and CI/CD extensions.
Ability to communicate complex technical decisions clearly to engineering, security, operations, and leadership audiences.

What the JD emphasized

enterprise-scale generative AI applications

AI/ML platform infrastructure

generative AI applications

LLM integrations

enterprise AI services

scalable cloud platforms

multi-environment and multi-region deployments

AI platforms

GenAI workloads

deployment reliability

security and compliance practices

cost optimization

capacity planning

operational resilience

regulated enterprise environments

Other signals

Provide technical leadership for cloud platforms, deployment systems, and operational foundations that power enterprise-scale generative AI applications.

Define and drive the technical strategy for AI/ML platform infrastructure supporting generative AI applications, LLM integrations, model routing, and enterprise AI services.

Architect, build, and operate scalable cloud platforms using AWS services such as EKS, ECS Fargate, Lambda, DynamoDB, S3, OpenSearch, Secrets Manager, CloudWatch, ALB, and MWAA.

Design and improve observability across AI platforms, including CloudWatch dashboards, logs, alarms, Prometheus/Grafana, OpenSearch, Langfuse, and LLM-specific operational metrics.

Staf****f Platform Engineer, AI/ML Infrastructure

Department:AI Software & Operations

Role Summary The Staff Platform Engineer, AI/ML Infrastructure will provide technical leadership for thecloud platforms, deployment systems, and operational foundations that power enterprise-scalegenerative AI applications. This role will define and evolve the infrastructure architecture for AI/ML platforms running across AWS,Kubernetes, serverless, and containerized environments. The engineer will lead platform standards forreliability, scalability, observability, CI/CD, security, and developer enablement, while partnering closelywith software engineering, AI engineering, security, and operations teams. The ideal candidate combines deep hands-on cloud engineering experience with staff-level technicalinfluence. They are comfortable designing infrastructure patterns, writing infrastructure-as-code,improving delivery pipelines, mentoring engineers, and making architectural decisions that raise theoperational maturity of AI platforms across multiple teams. Key Responsibilities Define and drive the technical strategy for AI/ML platform infrastructure supporting generative AIapplications, LLM integrations, model routing, and enterprise AI services. Architect, build, and operate scalable cloud platforms using AWS services such as EKS, ECSFargate, Lambda, DynamoDB, S3, OpenSearch, Secrets Manager, CloudWatch, ALB, and MWAA. Establish reusable infrastructure patterns using CloudFormation, Helm, and Terraform to supportreliable multi-environment and multi-region deployments. Lead CI/CD architecture using GitHub Actions, reusable workflows, OIDC-based AWSauthentication, automated quality gates, deployment promotion, and environment approvals. Design and improve observability across AI platforms, including CloudWatch dashboards, logs,alarms, Prometheus/Grafana, OpenSearch, Langfuse, and LLM-specific operational metrics. Build platform capabilities for GenAI workloads, including model availability monitoring. Partner with software engineering teams to improve deployment reliability, rollback strategies,health checks, autoscaling, load testing, and runtime performance. Define and enforce security and compliance practices for infrastructure, including IAM permissionboundaries, Secrets Manager usage, secret scanning, audit logging, tagging standards, andchange-management controls. Provide technical leadership for cost optimization, capacity planning, environment standardization,and operational resilience across development, test, production, and sandbox environments. Mentor engineers, review architecture and infrastructure designs, and influence platformengineering practices across teams.

Basic Qualifications Bachelor’s degree in Computer Science, Engineering, Information Technology, or a relatedtechnical field, or equivalent practical experience. 7+ years of experience in DevOps, platform engineering, cloud infrastructure, site reliabilityengineering, or software engineering roles. Strong hands-on experience with AWS/Azure/GCP infrastructure and services, including container,serverless, networking, storage, observability, and security services. Experience designing and operating production systems on Kubernetes, ECS/Fargate, orcomparable container orchestration platforms. Proficiency with infrastructure-as-code, especially CloudFormation, Terraform, Helm, or similartooling. Strong CI/CD experience with GitHub Actions or similar platforms, including reusable workflows,automated testing, deployment gates, and cloud authentication. Experience building and operating observability solutions using CloudWatch, Prometheus/Grafana,OpenSearch, or similar tools. Strong understanding of cloud security practices, IAM, secrets management, least-privilegeaccess, audit logging, and compliance requirements. Experience supporting distributed systems, microservices, APIs, asynchronous workloads, andmulti-environment deployments. Demonstrated ability to lead technical design, mentor engineers, and influence engineeringpractices across teams.

Preferred Qualifications Experience supporting AI/ML or generative AI platforms, including LLM gateways, model routing,prompt observability, token metering, or model failover. Experience operating platforms in regulated enterprise environments, ideally healthcare,pharmaceutical, finance, or life sciences. Experience with multi-account, multi-region AWS architectures and enterprise governancepatterns. Experience with cost optimization, autoscaling strategies, capacity planning, and cloud budgetmonitoring. Experience with load testing and performance validation using tools such as Locust or comparableframeworks. Strong Python or scripting skills for platform automation, operational tooling, and CI/CD extensions. Ability to communicate complex technical decisions clearly to engineering, security, operations,and leadership audiences. Technical Environment

This role works across a modern AI platform ecosystem including: Cloud: AWS EKS, ECS Fargate, Lambda, DynamoDB, S3, OpenSearch, CloudWatch, SecretsManager, ALB, VPC, IAM Infrastructure-as-Code: CloudFormation, Helm, Terraform CI/CD: GitHub Actions, reusable workflows, OIDC federation, environment approvals, automatedrelease promotion AI/ML Platform: AWS Bedrock, Azure OpenAI, LiteLLM, Langfuse Observability: CloudWatch dashboards and alarms, Prometheus, Grafana, OpenSearch, Langfuse,custom metrics Security & Governance: IAM permission boundaries, secret scanning, audit logging, taggingcompliance, change-management automation Engineering Practices: Docker, Python, pre-commit, automated testing, load testing, code qualitygates, monorepo service standards

Leadership Expectations As a J090 Staff-level engineer, this role is expected to operate beyond individual delivery. The engineerwill identify systemic platform gaps, define technical direction, create reusable standards, and raiseengineering maturity across multiple teams. Success in this role requires strong judgment, ownership, and communication. The engineer should beable to balance hands-on implementation with architectural leadership, guide teams through ambiguoustechnical decisions, and build platform capabilities that make AI product teams faster, safer, and morereliable.

Work location assignment : Remote

The annual base salary for this position ranges from €65.250,00 to €108.750,00. This salary range applies to the location France - Rives de Paris. We also offer a range of benefits and programs to meet colleagues’ needs. Benefits vary by location and can include health care coverage, retirement savings plans, insurance benefits, an Employee Assistance Program, wellness benefits and more. Additional details about total compensation and benefits will be provided during the hiring process. Pfizer compensation structures and benefit packages are aligned based on the location of hire. Final compensation will be determined based on the successful candidate’s relevant skills, experience, and qualifications, in accordance with pay equity principles and applicable employment laws. This role is posted in multiple locations. If you are applying for the role in an secondary job posting location where pay transparency regulations apply, your Talent Advisor will share the local pay information with you during the first interview.

Pfizer is an equal opportunity employer and complies with all applicable equal employment opportunity legislation in each jurisdiction in which it operates.

Égalité des chances & Emploi

Nous croyons que des équipes diversifiées et inclusives sont essentielles à la réussite d'une entreprise. En tant qu'employeur, Pfizer s'engage à valoriser la diversité et l’inclusion sous toutes ses formes. Cette diversité se reflète également à travers les patients et les communautés que nous servons. Ensemble, continuons à bâtir une culture qui encourage, soutient et responsabilise nos employés.

Handicap & Inclusion

Notre mission est de libérer le potentiel de nos collaborateurs et nous sommes fiers d'être un employeur inclusif pour les personnes handicapées, garantissant ainsi l'égalité des chances en matière d'emploi pour tous les candidats. Nous vous encourageons à donner le meilleur de vous-même en sachant que nous apporterons tous les ajustements raisonnables pour soutenir votre candidature et votre carrière future. Votre expérience avec Pfizer commence ici !

Pfizer endeavors to make www.pfizer.com/careers accessible to all users. If you would like to contact us regarding the accessibility of our website or need assistance completing the application process and/or interviewing, please email disabilityrecruitment@pfizer.com. This is to be used solely for accommodation requests with respect to the accessibility of our website, online application process and/or interviewing. Requests for any other reason will not be returned.

Pour mieux comprendre les usages autorisés et interdits de l’intelligence artificielle tout au long du processus de recrutement, nous vous invitons à consulter nos bonnes pratiques dédiées à l’utilisation de l’IA par les candidats sur Pfizer Careers.

Information & Business Tech