What you'd actually do

Drive infrastructure automation, CI/CD, and monitoring/alerting pipelines.

Collaborate with Field Engineering teams to support PoCs, and Platform deployments in customer Cloud VPCs and on-prem.

Deploy, scale, and optimize ML/NLP workloads, especially model inference.

Lead initiatives to improve system reliability, scalability, and developer experience.

Contribute to architecture and infrastructure decisions as we scale our platform.

Skills

Required

5+ years of experience as a software engineer with a focus on backend systems and platform engineering.
Deep experience with all computing environments (GCP, AWS, Onprem, or Azure).
Strong understanding of containerization and orchestration (Docker, Kubernetes).
Experience with observability tools (Prometheus, Grafana, ELK/EFK, etc.).
Proficiency in languages like Go, Python, or Java; experience with infrastructure-as-code (Terraform, Pulumi, etc.).

Nice to have

Experience working on ML platforms or supporting ML workloads in production.
Familiarity with data infrastructure (e.g., Kafka, Spark, Airflow).
Experience with providing technical support of custom-developed systems to customers.

Vectara provides a scalable platform to deploy your Enterprise AI Agents and AI Assistants with Accuracy, Security, and Explainability like no other solution. Our enterprise RAG and Agentic AI Platform offers unparalleled Accuracy, Security, and Explainability by leveraging the strongest models for retrieval, embedding, reranking, and reasoning, an optimized LLM trained for quality, and advanced Hallucination Mitigation. We are the developers of the Hughes Hallucination Evaluation Model and Correction model, core to ensuring accuracy, quality, and responsible AI that is production ready. These innovations have been cited in the New York Times, Visual Capitalist, and many other leading publications. This platform has allowed us to be very successful with over 100 Enterprise clients including the likes of large US High Tech companies, military organizations, Financial services, Healthcare, and Manufacturing.

Our founding team includes industry veterans and experts in neural information retrieval and distributed systems from Google. Join us as we pursue our mission to help the world find meaning. People at Vectara are passionate about ensuring customers take advantage of breakthroughs in applied Artificial Intelligence (AI) to solve real-world technology and business problems today. Our team is a group of unquestionable all-stars in their respective fields of computer science and business from Google, Cloudera, Splunk, MongoDB, Elastic, and more.

Role Overview:** **Vectara is seeking a Senior Platform Software Engineer with strong experience in modern DevOps practices and backend development. In this role, you’ll work on developing IaaC and Helm charts for deploying and managing the core infrastructure that powers our retrieval-augmented generation and agentic AI platform, as well as design and development of features in the platform. The design and deployment of the platform requires high availability, scalability, and automation. You will be part of our oncall rotation for addressing customer tickets. The role also requires basic platform and application development skills.

You will work as an integral part of our Forward Deployed Engineering function, working with our Field Engineering team to assist with building and supporting everything from POCs to production applications used by millions of people. In doing this you will contribute to the delivery of customer satisfaction and sales outcomes for our business.

Key Responsibilities:

Proficiency in AI-assisted coding; able to do multiple tasks at once and manage AI agents to quickly execute.
Drive infrastructure automation, CI/CD, and monitoring/alerting pipelines.
Collaborate with Field Engineering teams to support PoCs, and Platform deployments in customer Cloud VPCs and on-prem.
Deploy, scale, and optimize ML/NLP workloads, especially model inference.
Lead initiatives to improve system reliability, scalability, and developer experience.
Contribute to architecture and infrastructure decisions as we scale our platform.
Champion best practices in code quality, testing, and DevOps culture across the team.
Design, implement, and maintain scalable and secure backend services and platform components.

Qualifications:

5+ years of experience as a software engineer with a focus on backend systems and platform engineering.
Deep experience with all computing environments (GCP, AWS, Onprem, or Azure).
Strong understanding of containerization and orchestration (Docker, Kubernetes).
Experience with observability tools (Prometheus, Grafana, ELK/EFK, etc.).
Experience working on ML platforms or supporting ML workloads in production.
Familiarity with data infrastructure (e.g., Kafka, Spark, Airflow).
Proficiency in languages like Go, Python, or Java; experience with infrastructure-as-code (Terraform, Pulumi, etc.).
Experience with providing technical support of custom-developed systems to customers.

**Equity and Salary Range: **

Salary is just one component of Vectara’s employee compensation. Our full-time employees are also equity owners in the company, which although not an immediate cash component, can have positive impacts on long-term total compensation for each participating employee. We would be remiss if we didn’t highlight and celebrate our focus on engaging many of our employees in being economic co-owners of the business.

Vectara welcomes all. We value the collective wisdom of people from different backgrounds, experiences, abilities and perspectives. We never discriminate on the basis of race, religion, national origin, gender identity or expression, sexual orientation, age, or marital, veteran, or disability status. Vectara has a positive and supportive culture—we look for people who are inventive and work to be a little better every single day. We seek to be smart, humble, hardworking and, above all, curious. After all, we are on a mission to find meaning.