Platform Engineer — Cloud Infrastructure (smts)

Salesforce · Enterprise · Redwood City, CA

Salesforce is seeking a Senior Member of Technical Staff (SMTS) for their Platform Engineering team within Cloud Infrastructure. This role focuses on applying AI/ML solutions to infrastructure and operations problems, building intelligent, self-healing platform tools. The engineer will write core platform services in Go and Python, design multi-agent workflows for automation, build RAG systems for documentation, and act as an AI amplifier for the engineering organization. The primary focus is on integrating AI, LLMs, and autonomous agents into multi-cloud platform services to improve reliability, reduce toil, and enhance developer experience.

What you'd actually do

Design, build, and operate platform services and infrastructure automation in Go and Python, embedding AI capabilities directly into the core platform software.
Architect and implement intelligent, closed-loop automation systems (AIOps) that leverage LLMs and autonomous agents to detect anomalies, perform root-cause analysis, and execute self-healing remediation playbooks.
Build and maintain Retrieval-Augmented Generation (RAG) applications over internal platform documentation, runbooks, and historical incident data to drastically reduce engineering MTTR.
Develop custom tools, CLI plugins, and Model Context Protocol (MCP) integrations that connect our cloud infrastructure APIs to agentic coding tools (like Claude Code), turning standard automation into autonomous workflows.
Partner with SRE, security, and platform specialists to identify highly repetitive operational work and build agentic solutions that delegate that toil to AI.

Skills

Required

5+ years of professional experience in software engineering, platform engineering, or DevOps, with a recent, heavy focus on building and implementing AI solutions.
Strong understanding of core AI and ML concepts applied practically to software engineering, including LLM context window optimization, embedding models, semantic search, vector databases, and prompt engineering/tuning.
Experience building with agentic frameworks and LLM orchestration tooling to execute multi-step, autonomous tasks.
Good programming skills in Golang and Python, with the ability to build production-grade backend services, APIs, and microservices.
Solid fundamental knowledge of cloud-native infrastructure, with hands-on experience in Kubernetes and multi-cloud environments (AWS, Azure, GCP, or OCI).
Familiarity with continuous deployment and infrastructure-as-code concepts (GitOps with Flux/Argo CD, Pulumi, or Terraform).
Demonstrated agentic and automation mindset — you have a proven track record of using AI to automate complex workflows and can speak deeply on how you design AI systems to handle edge cases, tool-calling errors, and non-deterministic outputs.
Strong communication and collaboration skills, with a passion for teaching, raising the team’s AI literacy, and evangelizing AI solutions across engineering boundaries.

Nice to have

Hands-on experience building custom extensions, plugins, or Model Context Protocol (MCP) servers for agentic developer tools like Claude Code or GitHub Copilot.
Experience applying AI specifically to observability data (parsing logs, analyzing metrics, or correlating distributed traces) for predictive scaling or automated alerting.
Deep experience working with vector databases (e.g., Pinecone, Qdrant, Milvus, pgvector) inside platform applications.
Experience operating AI-driven tools within compliance-driven environments (FedRAMP, SOC 2), ensuring strong data privacy boundaries, LLM guardrails, and secure handling of sensitive cloud credentials.
Experience with internal developer platforms (IDPs), platform APIs, or building developer experience

What the JD emphasized

strong AI/ML software engineering expertise
applying AI solutions directly to infrastructure and operations problems
design multi-agent workflows to automate complex operational tasks
build RAG systems over engineering documentation
act as the core AI amplifier
architecting the intelligent systems that multiply the entire engineering organization's output
embedding AI capabilities directly into the core platform software
leverage LLMs and autonomous agents
agentic coding tools
build agentic solutions that delegate that toil to AI
proven track record of using AI to automate complex workflows
design AI systems to handle edge cases, tool-calling errors, and non-deterministic outputs

Other signals

AI/ML software engineering expertise
applying AI solutions directly to infrastructure and operations problems
design multi-agent workflows to automate complex operational tasks
build RAG systems over engineering documentation
architecting the intelligent systems that multiply the entire engineering organization's output

Read full job description

To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.

Job Category

Software Engineering

Job Details

About Salesforce

Salesforce is the #1 AI CRM, where humans with agents drive customer success together. Here, ambition meets action. Tech meets trust. And innovation isn’t a buzzword — it’s a way of life. The world of work as we know it is changing and we're looking for Trailblazers who are passionate about bettering business and the world through AI, driving innovation, and keeping Salesforce's core values at the heart of it all.

Ready to level-up your career at the company leading workforce transformation in the agentic era? You’re in the right place! Agentforce is the future of AI, and you are the future of Salesforce.

Platform Engineering — Cloud Infrastructure Overview of the Role The SMTS role is part of our Platform Engineering team within the Cloud Infrastructure organization. Platform Engineering is made up of platform engineers, SREs, and DevOps specialists who design, build, and operate the internal developer platform powering hundreds of Kubernetes clusters across AWS, Azure, GCP, and OCI. Whether we are automating cluster lifecycle management, hardening our GitOps delivery pipelines, or architecting autonomous agents to manage production systems, we strive to give every product team a fast, secure, and reliable path to production.

We are looking for a Senior Member of Technical Staff with strong AI/ML software engineering expertise to build the next generation of intelligent, self-healing platform tools. Instead of managing GPU hardware, your focus will be applying AI solutions directly to infrastructure and operations problems. You will write core platform services in Go and Python, design multi-agent workflows to automate complex operational tasks, build RAG systems over engineering documentation, and act as the core AI amplifier — architecting the intelligent systems that multiply the entire engineering organization's output.

What You’ll Actually Be Doing Success will be measured by how effectively you integrate AI, LLMs, and autonomous agents into our multi-cloud platform services to improve system reliability, reduce operational toil, and elevate the developer experience.

Design, build, and operate platform services and infrastructure automation in Go and Python, embedding AI capabilities directly into the core platform software.

Architect and implement intelligent, closed-loop automation systems (AIOps) that leverage LLMs and autonomous agents to detect anomalies, perform root-cause analysis, and execute self-healing remediation playbooks.

Build and maintain Retrieval-Augmented Generation (RAG) applications over internal platform documentation, runbooks, and historical incident data to drastically reduce engineering MTTR.

Develop custom tools, CLI plugins, and Model Context Protocol (MCP) integrations that connect our cloud infrastructure APIs to agentic coding tools (like Claude Code), turning standard automation into autonomous workflows.

Partner with SRE, security, and platform specialists to identify highly repetitive operational work and build agentic solutions that delegate that toil to AI.

Maintain and improve standard continuous deployment pipelines using GitOps tooling (Flux, Argo CD) and infrastructure-as-code frameworks (Pulumi, Terraform) to ensure safe, repeatable delivery of both traditional platform code and AI-driven solutions.

Participate in design reviews, write clear technical documentation and RFCs, and mentor traditional platform engineers on AI/ML concepts, prompt engineering, and agentic design patterns.

Contribute to on-call rotations and continuously bring an AI-first perspective to improving incident management and platform post-mortems.

You’re Our Person If… 5+ years of professional experience in software engineering, platform engineering, or DevOps, with a recent, heavy focus on building and implementing AI solutions.

Strong understanding of core AI and ML concepts applied practically to software engineering, including LLM context window optimization, embedding models, semantic search, vector databases, and prompt engineering/tuning.

Experience building with agentic frameworks and LLM orchestration tooling to execute multi-step, autonomous tasks.

Good programming skills in Golang and Python, with the ability to build production-grade backend services, APIs, and microservices.

Solid fundamental knowledge of cloud-native infrastructure, with hands-on experience in Kubernetes and multi-cloud environments (AWS, Azure, GCP, or OCI).

Familiarity with continuous deployment and infrastructure-as-code concepts (GitOps with Flux/Argo CD, Pulumi, or Terraform).

Demonstrated agentic and automation mindset — you have a proven track record of using AI to automate complex workflows and can speak deeply on how you design AI systems to handle edge cases, tool-calling errors, and non-deterministic outputs.

Strong communication and collaboration skills, with a passion for teaching, raising the team’s AI literacy, and evangelizing AI solutions across engineering boundaries.

Even Better If… Hands-on experience building custom extensions, plugins, or Model Context Protocol (MCP) servers for agentic developer tools like Claude Code or GitHub Copilot.

Experience applying AI specifically to observability data (parsing logs, analyzing metrics, or correlating distributed traces) for predictive scaling or automated alerting.

Deep experience working with vector databases (e.g., Pinecone, Qdrant, Milvus, pgvector) inside platform applications.

Experience operating AI-driven tools within compliance-driven environments (FedRAMP, SOC 2), ensuring strong data privacy boundaries, LLM guardrails, and secure handling of sensitive cloud credentials.

Experience with internal developer platforms (IDPs), platform APIs, or building developer experience (DevEx) tooling.

Contributions to open-source projects is a plus

Unleash Your Potential

When you join Salesforce, you’ll be limitless in all areas of your life. Our benefits and resources support you to find balance and be your best, and our AI agents accelerate your impact so you can do your best. Together, we’ll bring the power of Agentforce to organizations of all sizes and deliver amazing experiences that customers love. Apply today to not only shape the future — but to redefine what’s possible — for yourself, for AI, and the world.

Accommodations

If you need a reasonable accommodation during the application or the recruiting process, please submit a request via this Accommodations Request Form.

Please note that Salesforce uses artificial intelligence (AI) tools to help our recruiters assess and evaluate candidates’ resumes and qualifications throughout the recruiting process. Humans will always make any candidate selection and hiring decisions. Please see our Candidate Privacy Statement for more information about how we use your personal data and your rights, including with regard to use of AI tools and opt out options.

Posting Statement

Salesforce is an equal opportunity employer and maintains a policy of non-discrimination with all employees and applicants for employment. What does that mean exactly? It means that at Salesforce, we believe in equality for all. And we believe we can lead the path to equality in part by creating a workplace that’s inclusive, and free from discrimination. Know your rights: workplace discrimination is illegal. Any employee or potential employee will be assessed on the basis of merit, competence and qualifications – without regard to race, religion, color, national origin, sex, sexual orientation, gender expression or identity, transgender status, age, disability, veteran or marital status, political viewpoint, or other classifications protected by law. This policy applies to current and prospective employees, no matter where they are in their Salesforce employment journey. It also applies to recruiting, hiring, job assignment, compensation, promotion, benefits, training, assessment of job performance, discipline, termination, and everything in between. Recruiting, hiring, and promotion decisions at Salesforce are fair and based on merit. The same goes for compensation, benefits, promotions, transfers, reduction in workforce, recall, training, and education.

In the United States, compensation offered will be determined by factors such as location, job level, job-related knowledge, skills, and experience. Certain roles may be eligible for incentive compensation, equity, and benefits. Salesforce offers a variety of benefits to help you live well including: time off programs, medical, dental, vision, mental health support, paid parental leave, life and disability insurance, 401(k), and an employee stock purchasing program. More details about company benefits can be found at the following link: https://www.salesforcebenefits.com.

At Salesforce, we believe in equitable compensation practices that reflect the dynamic nature of labor markets across various regions. The typical base salary range for this position is $148,500 - $223,900 annually. In select cities within the San Francisco and New York City metropolitan area, the base salary range for this role is $178,900 - $246,000 annually. The range represents base salary only, and does not include company bonus, incentive for sales roles, equity or benefits, as applicable.