Home » Service Reliability Engineering Lead Needed At Safaricom

Service Reliability Engineering Lead Needed At Safaricom

Apply for Open Vacancies at Safaricom Kenya, Latest Recruitment at Safaricom Kenya, Vacant Positions at Safaricom Kenya Apply!, Safaricom Kenya Recruitment: Careers and How to Apply, Safaricom Kenya Recruitment: Careers and How to Apply, Job Openings at Safaricom Kenya,  Internships Program At Safaricom 2023/24 Open For Application, Latest Career Openings at Safaricom Kenya, Employment Opportunities at Safaricom Kenya, Jobs at Safaricom Kenya, Career Opportunities at Safaricom, Latest Job Openings at Safaricom Kenya, Job Openings at Safaricom Kenya, Product Owner - Wealth Management at Safaricom Kenya, Job Openings at Safaricom Kenya, Process and System Excellence Analyst at Safaricom Kenya. Senior Officer - Business Resilience at Safaricom Kenya, Sales Relationship Manager- Public Sector at Safaricom Kenya, Brand & Marketing Director Jobs at Safaricom PLC, Job Vacancies at Safaricom Kenya, Engineer – Enterprise Integration & BPM Support Jobs at Safaricom Kenya, Service Availability DevOps Engineer Jobs at Safaricom Kenya, Safaricom Discover Graduate Program Open For Application, Job Vacancies at Safaricom Kenya, Senior Officer (Supply Chain Performance Management) Jobs at Safaricom, Vacancies Open at Safaricom, Programme Analyst Jobs at Safaricom, Data Scientist Jobs, Jobs in Kenya, Latest Jobs in Kenya, Jobs Advertised Today, Government Jobs, NGO Jobs UN Jobs, Online Jobs, Freelance Jobs, Part Time Jobs, Full Time Jobs, Safaricom Annual Internship 2022/23, Jobs, Jobs in Kenya, Latest Jobs in Kenya, Jobs Advertised Today, Government Jobs, NGO Jobs UN Jobs, Online Jobs, Freelance Jobs, Part Time Jobs, Full Time Jobs

Job Identification 146

Posting Date 09/28/2023, 10:33 AM

Apply Before 10/05/2023, 10:33 AM

Degree Level Bachelor’s Degree

Job Schedule Full time

Locations Safaricom House, Waiyaki Way, Westlands, P.O Box 66827, 00800, KE

Similar HR Manager Needed At PactWorld

Brief Description

Reporting to the Senior Manager – Systems Engineering, the SRE team lead will be responsible for championing and driving operational excellence through driving the adoption of SRE best practices and ensuring system availability, performance, efficiency, change management, monitoring, emergency response, security and capacity planning.

RESPONSIBILITIES

Key Responsibilities:

  • Oversee and lead the implementation of the SRE frameworks and practices within the organization using the systems operations tool chain. Foster a collaborative and inclusive team culture that emphasizes reliability, innovation, and continuous improvement.
  • Team Management: Ensure team performance management while fostering an environment of trust, learning, collaboration and cultivate a culture of high performance.
  • Build, recruit, retain, manage and develop a world class SRE team.
  • Operational Excellence – Define, measure, monitor and report key SRE performance indicators and escalate breaches and violations. This will help in informing the maturity level of the team as well as to inform the Backlog and related decisions.Collaborate with cross-functional teams to identify, prioritize, and address reliability issues.
  • Stakeholder Engagement by engaging the business teams and promoting a culture of participation and collaboration to enhance effective and informed decision making.
  • Define, measure, monitor and report key systems reliability performance indicators and escalate breaches and violations.
  • Problem and Incident management – lead incident response efforts, ensuring that incidents are resolved quickly and effectively while minimizing downtime and customer impact. Conduct post-incident reviews to identify root causes and implement preventive measures.
  • Capacity Planning – Monitor system resource utilization and plan for capacity upgrades as needed to support business growth. Optimize resource allocation and cost-efficiency.
  • Security and Compliance: Collaborate with security teams to ensure the reliability and security of systems and applications. Ensure compliance with relevant industry standards and regulations.
  • Drivecontinuous improvement of applications through planned chaos simulations, AIOPs, automation and proactive alerting strategies.
  • Documenting “tribal” knowledge and constant upkeep of the playbooks, runbooks to ensure teams get the information they need right when they need it.
  • Champion and lead implementation of machine learning, self-healing and drive the organization towards a no-ops model.

QUALIFICATIONS

Qualifications:

  • Bachelor’s degree in Computer Science, Information Technology, or a related field (Master’s degree preferred).
  • Several years of experience in SRE or a related field, with a proven track record of improving system reliability.
  • Strong leadership and team management skills.
  • Proficiency in programming/scripting languages (e.g., Python, Go, Ruby).
  • Experience with containerization and orchestration technologies (e.g., Docker, Kubernetes).
  • Knowledge of cloud computing platforms (e.g., AWS, Azure, Google Cloud).
  • Familiarity with monitoring and alerting tools (e.g., Prometheus, Grafana, ELK Stack).
  • Excellent problem-solving and communication skills.
  • Ability to work in a fast-paced, dynamic environment and handle high-pressure situations effectively.

How to Apply

If you feel that you are up to the challenge and possess the necessary qualification and experience, kindly proceed to update your candidate profileon the recruitment portal and then Click on the apply button. Remember to attach your resume.

Apply Here

Leave a Reply

Your email address will not be published. Required fields are marked *