Senior System Engineering

AT&T AT&T · Telecom · Plano, TX

Senior System Engineering role focused on providing 24x7 Tier 1 support for agent-facing applications like Salesforce, Mulesoft, etc. Responsibilities include incident management, triaging, resolution, developing knowledge bases, creating runbooks, building observability (alerts, monitoring, dashboards), and optimizing the on-call process. The role also involves working with Release Management and Product Development for risk mitigation and knowledge transfer. Requires technical leadership and strong communication skills.

What you'd actually do

  1. Provide 24x7 Tier 1 support for agent facing applications –Salesforce, Salesforce Marketing Cloud, Mulesoft, OPUS, Customer Account Lookup & Management (CALM).
  2. Manage escalated issues, incidents and outages, triage and driving prompt resolution.
  3. Provide prompt visibility and status of escalated issues, incidents and outages to leadership, business partners and other key stakeholders.
  4. Responsible for Site Reliability Engineering aspects such as developing functional and technical knowledge-base of the application, creation of run books, developing observability of the application in terms of alerts, monitoring and dashboards that enable proactive incident and problem detection, triaging of the incidents and helping Tier 2conduct blameless post-mortems (after action reviews).
  5. Work with Release Management related to upcoming changes to production to identify risks and mitigate them.

Skills

Required

  • Bachelor’s degree in Electronics Engineering, Computer Science or Engineering
  • 2 Years of experience in the job offered or 2 Years of experience in a related occupation
  • leadership and building cross-organizational consensus
  • building and managing high-performing teams
  • Incident Management
  • Incident response
  • managing Tier 1 Production Operations team
  • supporting large scale applications in production –ERP, CRM in a leadership capacity
  • Salesforce Development (Apex, Visualforce, Lightning)
  • Salesforce Sales Cloud & Service Cloud
  • MuleSoft
  • Dynatrace
  • ELK (Elastic, Logstash, Kibana) for monitoring and logging
  • Customer Experience Analytics &Session Based tools - Quantum Metric and Tealeaf
  • Synthetic Monitoring tools (Catchpoint)
  • Application Performance Monitoring tools (Dynatrace, AppDynamics, Introscope, etc.)
  • Kibana and Grafana visualization tools
  • Creation of Dashboards on Dynatrace, ELK and Grafana
  • great communication skills

Nice to have

  • technical leadership

What the JD emphasized

  • 24x7 Tier 1 support
  • escalated issues, incidents and outages
  • incident management
  • incident response
  • managing Tier 1 Production Operations team
  • supporting large scale applications in production