Sr. Infrastructure Engineer - Observability and Monitoring

Intel Intel · Semiconductors · Arizona, Phoenix, United States +1

This role focuses on deploying, configuring, and managing enterprise observability and monitoring infrastructure, specifically Elastic, Logstash, Kibana, and Kafka environments. It involves implementing automation scripts for efficiency and reliability, optimizing Elasticsearch performance, designing data ingestion pipelines, and creating dashboards for operational insights. The position requires expertise in infrastructure engineering and scripting languages.

What you'd actually do

  1. Manage the full lifecycle of Elastic, Logstash, Kibana, Kafka and MS System Center Operations Manager (SCOM)
  2. Implement automation scripts (e.g., Bash, Python, Powershell, vbs, c#) to configure advanced observability scenarios and streamline repetitive tasks, maintain layered applications, data collection, system maintenance and improve system reliability
  3. Deploy, configure, and maintain Elasticsearch clusters for scalable log storage and search capabilities. Optimize Elasticsearch performance, manage indices, and implement data retention policies
  4. Design and implement Logstash pipelines for data ingestion, parsing, and transformation from multiple sources
  5. Create and maintain Kibana dashboards, visualizations, and alerts for real-time operational insights

Skills

Required

  • 5+ years of experience in Observability, Monitoring and/or manageability of infrastructure engineering
  • 2+ years' experience with Elastic, Logstash, Kafka, and Kibana
  • 2+ years' experience with a scripting language (such as Powershell, vbs, c#, Bash etc.)

Nice to have

  • Deep expertise in Elastic, Logstash, Kabana, System Center Operations Manager environments, including configuration, tuning, and support for various layered applications
  • Proficiency in automation and scripting, particularly in Powershell, vbs, c#, Bash or Python (other scripting languages a plus), to streamline OS and application management
  • Strong troubleshooting skills and experience in diagnosing issues across OS, middleware, enterprise applications, and security tools