Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale — unleashing the potential of businesses and people. The Elastic Search AI Platform, used by more than 50% of the Fortune 500, brings together the precision of search and the intelligence of AI to enable everyone to accelerate the results that matter. By taking advantage of all structured and unstructured data — securing and protecting private information more effectively — Elastic’s complete, cloud-based solutions for search, security, and observability help organizations deliver on the promise of AI.

What is The Role:

As part of the Platform Engineering department, the Network Infrastructure team is crafting, building, and improving the multi-cloud platform at scale for Elastic CloudHosted andServerless. We grow and mature our distributed large-scale network infrastructure that spans across multiple cloud service providers to support our cloud services . We are built on Kubernetes, Go, and custom orchestration architectures. In your daily life with us, you will participate in coding, innovating technical designs, crafting solutions, improving resilience, and prioritizing security, bug fixes, and features. For example,Debugging Azure Networking for Elastic Cloud Serverless is part of our efforts, and we want your experience to contribute to a truly exceptional customer experience!

What You Will Be Doing:

Taking an engineering approach in leading technical initiatives for designing, building and automating network infrastructure and services to guarantee the reliability of the global Elastic network infrastructure. Focusing on Layer 2/3/4 of the TCP/IP stack (Ethernet and/or IP encapsulation, routing, firewalling, load balancing).
Growing our global Platform network infrastructure to meet the increasing scaling demands by Developing and maintaining software, codebases, tooling and automations to serve our Network Infrastructure as Code principle.
Collaborating in an environment with an inclusive approach, and focusing on operational excellence which uplifts others.
Preventing repeated customer impact in response to major incidents and prioritised problem management. Our on call rotation is spread well, and we address complex customer concerns too.

What You Bring:

Excellent networking skills, with knowledge of protocols such as IP/IPv6, TCP/UDP, BGP, DNS.
Strong technical depth for building and automating networks (Terraform, Ansible) in collaboration with other engineers as an authority in identifying, implementing and delivering solutions.
Good knowledge of public CSP network components (Load balancers, VPC peering/Transit gateways, VPN connectivity, Direct Connects)
Success and lessons of experiences from striving for 'progress not perfection' in the name of Platform reliability. We want to hear about your customer first approach in solving operational problems for both today and the future.
Passion for developing solutions that involve inclusive communication methods to grow and strengthen partner and team relationships. Examples of working in distributed teams or working remotely is desirable.
Site-Reliability Engineering experience. We tackle problems with code, but fundamentally we keep things working and have proven success in operational excellence. Responding to and preventing repeated customer impact in response to major incidents and prioritized problem management. Our on call rotation uses follow-the-sun model where everyone participates in it in (mostly) their working hours.

Bonus Points:

You have operated a SaaS product in a public cloud ideally built using Infrastructure-as-Code tooling such as Crossplane or Terraform.
You have designed and/or operated large network topologies that dynamic routing is based on BGP.
You have operated network topologies based on software routers.
You have experience in IP address management (IPAM) and you have used relevant tools for automated IP allocations.
You have designed and /or operated overlay networks with use of encapsulation protocols such as IPSec, GRE and VXLANYou have built or operated a Kubernetes-at-scale infrastructure, ideally across multiple cloud providers, with knowledge of the Cilium CNI.
You have written non-trivial programs in Golang or other programming languages.
You have worked with containerized services (such as Docker.)
You have proven experience in leading and improving alerting and major incident management standard processes metrics systems (e.g. Elastic Stack, Graphite, Prometheus, Influx) to diagnose issues and quantify impacts to present to others at varying level of the organization.
You have experience in system and network administration with professional skills in Linux on distributed systems at scale.
You have diagnosed or designed, implemented and created solutions with the Elastic Stack.
You are experienced in thriving in a self-organizing and sharing in a globally distributed team environment.
You strengthen team members in bringing out the best of each other by uplifting others with coaching and mentoring.

Compensation for this role is in the form of base salary. This role does not have a variable compensation component.

The typical starting salary range for new hires in this role is listed below. In select locations (including Seattle WA, Los Angeles CA, the San Francisco Bay Area CA, and the New York City Metro Area), an alternate range may apply as specified below.

These ranges represent the lowest to highest salary we reasonably and in good faith believe we would pay for this role at the time of this posting. We may ultimately pay more or less than the posted range, and the ranges may be modified in the future.

An employee's position within the salary range will be based on several factors including, but not limited to, relevant education, qualifications, certifications, experience, skills, geographic location, performance, and business or organizational needs.

Elastic believes that employees should have the opportunity to share in the value that we create together for our shareholders. Therefore, in addition to cash compensation, this role is currently eligible to participate in Elastic's stock program. Our total rewards package also includes a company-matched 401k with dollar-for-dollar matching up to 6% of eligible earnings, along with a range of other benefits offered with a holistic emphasis on employee well-being.

The typical starting salary range for this role is:

$179,800—$232,900 USD

The typical starting salary range for this role in the select locations listed above is:

$179,800—$232,900 USD

Additional Information - We Take Care of Our People

As a distributed company, diversity drives our identity. Whether you’re looking to launch a new career or grow an existing one, Elastic is the type of company where you can balance great work with great life. Your age is only a number. It doesn’t matter if you’re just out of college or your children are; we need you for what you can do.

We strive to have parity of benefits across regions and while regulations differ from place to place, we believe taking care of our people is the right thing to do.

Competitive pay based on the work you do here and not your previous salary
Health coverage for you and your family in many locations
Ability to craft your calendar with flexible locations and schedules for many roles
Generous number of vacation days each year
Increase your impact - We match up to $2000 (or local currency equivalent) for financial donations and service
Up to 40 hours each year to use toward volunteer projects you love
Embracing parenthood with minimum of 16 weeks of parental leave

Different people approach problems differently. We need that. Elastic is an equal opportunity employer and is committed to creating an inclusive culture that celebrates different perspectives, experiences, and backgrounds. Qualified applicants will receive consideration for employment without regard to race, ethnicity, color, religion, sex, pregnancy, sexual orientation, gender perception or identity, national origin, age, marital status, protected veteran status, disability status, or any other basis protected by federal, state or local law, ordinance or regulation.

We welcome individuals with disabilities and strive to create an accessible and inclusive experience for all individuals. To request an accommodation during the application or the recruiting process, please email candidate_accessibility@elastic.co. We will reply to your request within 24 business hours of submission.

Applicants have rights under Federal Employment Laws, view posters linked below: Family and Medical Leave Act (FMLA) Poster; Pay Transparency Nondiscrimination Provision Poster; Employee Polygraph Protection Act (EPPA) Poster and Know Your Rights (Poster)

Elasticsearch develops and distributes technology and information that is subject to U.S. and other countries’ export controls and licensing requirements for individuals who are located in or are nationals of the following sanctioned countries and regions: Belarus, Cuba, Iran, North Korea, Syria, or Russia, including the Ukrainian territories annexed by Russia (The Crimea region of Ukraine, The Donetsk People's Republic (DNR), The Luhansk People's Republic (LNR), Kherson or Zaporizhzhia). If you are located in or are a national of one of the listed countries or regions, an export license may be required as a condition of your employment in this role. Please note that national origin and/or nationality do not affect eligibility for employment with Elastic.

Please see here for our Privacy Statement.

What is The Role:

What You Will Be Doing:

Taking an engineering approach in leading technical initiatives for designing, building and automating network infrastructure and services to guarantee the reliability of the global Elastic network infrastructure. Focusing on Layer 2/3/4 of the TCP/IP stack (Ethernet and/or IP encapsulation, routing, firewalling, load balancing).
Growing our global Platform network infrastructure to meet the increasing scaling demands by Developing and maintaining software, codebases, tooling and automations to serve our Network Infrastructure as Code principle.
Collaborating in an environment with an inclusive approach, and focusing on operational excellence which uplifts others.
Preventing repeated customer impact in response to major incidents and prioritised problem management. Our on call rotation is spread well, and we address complex customer concerns too.

What You Bring:

Excellent networking skills, with knowledge of protocols such as IP/IPv6, TCP/UDP, BGP, DNS.
Strong technical depth for building and automating networks (Terraform, Ansible) in collaboration with other engineers as an authority in identifying, implementing and delivering solutions.
Good knowledge of public CSP network components (Load balancers, VPC peering/Transit gateways, VPN connectivity, Direct Connects)
Success and lessons of experiences from striving for 'progress not perfection' in the name of Platform reliability. We want to hear about your customer first approach in solving operational problems for both today and the future.
Passion for developing solutions that involve inclusive communication methods to grow and strengthen partner and team relationships. Examples of working in distributed teams or working remotely is desirable.
Site-Reliability Engineering experience. We tackle problems with code, but fundamentally we keep things working and have proven success in operational excellence. Responding to and preventing repeated customer impact in response to major incidents and prioritized problem management. Our on call rotation uses follow-the-sun model where everyone participates in it in (mostly) their working hours.

Bonus Points:

You have operated a SaaS product in a public cloud ideally built using Infrastructure-as-Code tooling such as Crossplane or Terraform.
You have designed and/or operated large network topologies that dynamic routing is based on BGP.
You have operated network topologies based on software routers.
You have experience in IP address management (IPAM) and you have used relevant tools for automated IP allocations.
You have designed and /or operated overlay networks with use of encapsulation protocols such as IPSec, GRE and VXLANYou have built or operated a Kubernetes-at-scale infrastructure, ideally across multiple cloud providers, with knowledge of the Cilium CNI.
You have written non-trivial programs in Golang or other programming languages.
You have worked with containerized services (such as Docker.)
You have proven experience in leading and improving alerting and major incident management standard processes metrics systems (e.g. Elastic Stack, Graphite, Prometheus, Influx) to diagnose issues and quantify impacts to present to others at varying level of the organization.
You have experience in system and network administration with professional skills in Linux on distributed systems at scale.
You have diagnosed or designed, implemented and created solutions with the Elastic Stack.
You are experienced in thriving in a self-organizing and sharing in a globally distributed team environment.
You strengthen team members in bringing out the best of each other by uplifting others with coaching and mentoring.

Compensation for this role is in the form of base salary. This role does not have a variable compensation component.

The typical starting salary range for this role is:

$179,800—$232,900 USD

The typical starting salary range for this role in the select locations listed above is:

$179,800—$232,900 USD

Additional Information - We Take Care of Our People

We strive to have parity of benefits across regions and while regulations differ from place to place, we believe taking care of our people is the right thing to do.

Competitive pay based on the work you do here and not your previous salary
Health coverage for you and your family in many locations
Ability to craft your calendar with flexible locations and schedules for many roles
Generous number of vacation days each year
Increase your impact - We match up to $2000 (or local currency equivalent) for financial donations and service
Up to 40 hours each year to use toward volunteer projects you love
Embracing parenthood with minimum of 16 weeks of parental leave

Please see here for our Privacy Statement.

Principal Sre (networking) - Platform Control Plane

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

What is The Role:

What You Will Be Doing:

What You Bring:

Bonus Points:

Additional Information - We Take Care of Our People

What is The Role:

What You Will Be Doing:

What You Bring:

Bonus Points:

Additional Information - We Take Care of Our People