Platform Engineer - Infrastructure

Ford Ford · Auto · Dearborn, MI +1 · Ford Credit Services

Platform Engineer role focused on building and operating shared cloud infrastructure and reliability practices for product teams at Ford Credit Company. This role involves designing, building, and automating cloud platforms, defining reliability targets, and creating self-service workflows, with a strong emphasis on SRE principles and software development skills.

What you'd actually do

  1. Design, build, and operate cloud infrastructure and platform capabilities (networking, compute, Kubernetes, CI/CD, secrets, certificates, identity).
  2. Define and improve reliability using service-level indicators (SLIs), service-level objectives (SLOs), and error budgets.
  3. Implement observability (metrics, logs, traces) with actionable alerting focused on user impact.
  4. Create self-service workflows and automation (infrastructure as code, GitOps, build/release pipelines) that reduce toil.
  5. Improve security and compliance through least-privilege access, secure defaults, policy-as-code, and continuous hardening.

Skills

Required

  • Bachelors of Science Degree
  • 5 year’s experience in software development
  • Experience operating production cloud platforms and services (e.g., GCP/AWS/Azure) with an SRE mindset.
  • Strong fundamentals in Linux, networking, distributed systems, and debugging complex production issues.
  • Proficiency with infrastructure as code and automation (e.g., Terraform, Helm/Kustomize, GitOps tooling).
  • Experience with containers and orchestration (Docker, Kubernetes) and modern CI/CD.
  • Programming and scripting ability (e.g., Go, Python, Java, TypeScript) to build tooling and automate workflows.
  • Clear communication, effective incident leadership, and a customer-focused approach to platform work.

Nice to have

  • 7+ year’s experience in software development
  • Experience defining SLIs/SLOs and implementing SLO-based alerting and dashboards.
  • Observability platform experience (e.g., Prometheus/Grafana, OpenTelemetry, centralized logging).
  • Policy-as-code and supply chain security (e.g., OPA/Rego, SLSA concepts, SBOMs, artifact signing).
  • Experience building golden paths (container images, templates, reference architectures, paved pipelines) adopted by multiple teams.
  • Cost optimization experience (FinOps practices, capacity forecasting, right-sizing, multi-tenant platform controls).

What the JD emphasized

  • operating production cloud platforms and services
  • SRE mindset
  • Linux, networking, distributed systems, and debugging complex production issues
  • infrastructure as code and automation
  • containers and orchestration
  • modern CI/CD
  • Programming and scripting ability
  • Clear communication, effective incident leadership, and a customer-focused approach