HPC Solution Architect – AI Infrastructure(S2S)

Engineering and Product | Hybrid Cloud and Infrastructure Engineering

Same job available in 29 locations

Atlanta, Georgia, United States

Boston, Massachusetts, United States

Charlotte, North Carolina, United States

Chicago, Illinois, United States

Cincinnati, Ohio, United States

Cleveland, Ohio, United States

Columbus, Ohio, United States

Costa Mesa, California, United States

Dallas, Texas, United States

Detroit, Michigan, United States

Hartford, Connecticut, United States

Houston, Texas, United States

Kansas City, Missouri, United States

Los Angeles, California, United States

McLean, Virginia, United States

Miami, Florida, United States

Minneapolis, Minnesota, United States

Nashville, Tennessee, United States

New York, New York, United States

Philadelphia, Pennsylvania, United States

Pittsburgh, Pennsylvania, United States

Raleigh, North Carolina, United States

Richmond, Virginia, United States

Sacramento, California, United States

San Jose, California, United States

Seattle, Washington, United States

St. Louis, Missouri, United States

Stamford, Connecticut, United States

Tampa, Florida, United States

Position Summary

HPC Solution Architect – AI Infrastructure(S2S)

As a Solution Architect on the Silicon2Service team in Deloitte’s AI & Engineering practice, you will design and drive deployment of fully integrated architectures for GPU-accelerated AI factories and high-performance computing infrastructure in close partnership with Deloitte AI specialists and our ecosystem partners. You will shape end-to-end solutions—from discovery and reference architecture mapping through sizing and implementation. You will partner with Sales Executives, AI application specialists, delivery engineering, and managed services to help clients achieve measurable outcomes from private AI assets. You will lead technical solution strategy for pursuits and active opportunities and translate complex client needs into clear, complete solutions and delivery requirements.

Recruiting for this role ends on May 30th.

Work you’ll do As a Solution Architect on the Silicon2Service team, you will be responsible for:

Leading architecture for pursuits and active opportunities, including discovery, requirements, constraints, and target-state design
Creatively defining reference architectures for on-premises, cloud, and hybrid GPU platforms across compute, network, storage, security, software and operations
Driving architecture trade-offs and decisions across performance, scalability, reliability, locality, total cost of ownership, time-to-value, and risk
Owning the technical solution strategy in proposals and RFPs, including architecture narrative, assumptions, dependencies, sizing guidance, and delivery approach
Facilitating client workshops and technical reviews and translating engineering detail into executive-ready communications
Architecting complex, innovative technology solutions with a focus on business outcomes, cost of quality, and long-term scalability and sustainability.
Engaging with C-Suite client leadership during sales and delivery, including leading technical pre-sales discussions, shaping proposals, and supporting the closing of new business opportunities
Supporting go-to-market strategies, including participation in industry events, conferences, and client briefings

The Team

The Silicon to Service team at Deloitte delivers end-to-end AI factories and advanced technology services that help organizations build, deploy, and operate large-scale, private AI and data platforms. We enable the next phase of enterprise AI adoption through private AI economics with cloud-like ese of use. Join this unique opportunity to work on innovative AI platforms and emerging technologies in the rapidly evolving AI market while solving complex enterprise problems for some of the world’s largest organizations.

Qualifications

Required:

10+ years of experience in infrastructure architecture or engineering for large-scale platforms including design, implementation, operations, and optimization.
4+ years designing or delivering GPU-accelerated platforms for AI, ML, or high-performance computing
3+ years Linux system administration in production environments
3+ years designing or operating distributed compute clusters for AI/HPC in hybrid cloud setups, including multi-GPU topologies, partitioning, scheduler integration, and scalability for edge-to-cloud workloads.
2+ years with high-performance networking or storage for AI/HPC
2+ years building containerized platforms using Kubernetes or Red Hat OpenShift, including GPU operators/drivers, CUDA container runtime, and cluster lifecycle automation
2+ years automating infrastructure as code(IaC) with tools like Terraform and Ansible
At least 2 end-to-end deployments of reference architectures in the cloud or on-prem, including variants with security controls, network segmentation, operational runbooks, and validation testing
Experience in pre-sales or sales engineering, including discovery, solution demonstrations, and proposal/RFP contributions
Ability to travel 50%, on average, based on the work you do and the clients and industries/sectors you serve.
Limited immigration sponsorship may be available.

Preferred:

2+ years implementing AI/HPC cluster scheduling (Slurm and Kubernetes), including multi-tenant queues, quotas, and GPU-aware policies
2+ years supporting generative AI infrastructure patterns, including multi-node distributed training
Experience with AI agents and frameworks
Experience with high-throughput storage for AI/HPC
Experience executing NVIDIA co-sell motions with OEMS (Dell, HPC, Lenovo), CSPs ( AWS, Azure, Google Cloud), or independent software vendors ( Run:ai, OpenShift, Weights & Biases)

The wage range for this role takes into account the wide range of factors that are considered in making compensation decisions including but not limited to skill sets; experience and training; licensure and certifications; and other business and organizational needs. The disclosed range estimate has not been adjusted for the applicable geographic differential associated with the location at which the position may be filled. At Deloitte, it is not typical for an individual to be hired at or near the top of the range for their role and compensation decisions are dependent on the facts and circumstances of each case. A reasonable estimate of the current range is $130,800 to $241,000.

You may also be eligible to participate in a discretionary annual incentive program, subject to the rules governing the program, whereby an award, if any, depends on various factors, including, without limitation, individual and organizational performance.

Deloitte is committed to providing reasonable accommodations for people with disabilities. If you require a reasonable accommodation to participate in the recruiting process, please direct your inquiries to the Global Call Center (GCC) at USTalentCICInbox@deloitte.com.

**Recruiting tips**

From developing a stand out resume to putting your best foot forward in the interview, we want you to feel prepared and confident as you explore opportunities at Deloitte. Check out recruiting tips from Deloitte recruiters.

**Benefits**

At Deloitte, we know that great people make a great organization. We value our people and offer employees a broad range of benefits. Learn more about what working at Deloitte can mean for you.

**Our people and culture**

Our inclusive culture empowers our people to be who they are, contribute their unique perspectives, and make a difference individually and collectively. It enables us to leverage different ways of thinking, ideas, and perspectives, and bring more creativity and innovation to help solve our clients' most complex challenges. This makes Deloitte one of the most rewarding places to work.

Our purpose

Deloitte’s purpose is to make an impact that matters for our people, clients, and communities. At Deloitte, purpose is synonymous with how we work every day. It defines who we are. Our purpose comes through in our work with clients that enables impact and value in their organizations, as well as through our own investments, commitments, and actions across areas that help drive positive outcomes for our communities. Learn more.

**Professional development**

From entry-level employees to senior leaders, we believe there’s always room to learn. We offer opportunities to build new skills, take on leadership opportunities and connect and grow through mentorship. From on-the-job learning experiences to formal development programs, our professionals have a variety of opportunities to continue to grow throughout their career.

As used in this posting, "Deloitte" means Deloitte Consulting LLP, a subsidiary of Deloitte LLP. Please see [https://www.deloitte.com/us/about](https://www.deloitte.com/us/about) for a detailed description of the legal structure of Deloitte LLP and its subsidiaries.

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability or protected veteran status, or any other legally protected basis, in accordance with applicable law.

Qualified applicants with criminal histories, including arrest or conviction records, will be considered for employment in accordance with the requirements of applicable state and local laws, including the Los Angeles County Fair Chance Ordinance for Employers, City of Los Angeles’s Fair Chance Initiative for Hiring Ordinance, San Francisco Fair Chance Ordinance, and the California Fair Chance Act. See notices of various fair chance hiring and ban-the-box laws where available. **Fair Chance Hiring and Ban-the-Box Notices | Deloitte US Careers**

Requisition code: 327802

Job ID 327802