Senior Site Reliability Engineer – CPT

Engineering/Technical
Cape Town – Western Cape – South Africa

ENVIRONMENT:
A globally recognized brand with a strong strategic vision, dedicated to enhancing people’s lives through innovative technology, is seeking a Senior Site Reliability Engineer. This role is ideal for someone passionate about technology and experienced in developing and managing advanced environment monitoring solutions. You will leverage software and automation to address challenges, optimize performance, and oversee production systems, ensuring reliability and efficiency in a cutting-edge technological landscape. You will need Matric/Grade 12, suitable Certifications such as Oracle, Cloud & DevOps and 10+ years Software Development, of which preferably 6+ years must be experience in SRE, DevOps, or System Engineering.
 
DUTIES:
  • Master multiple scripting and programming languages to achieve advanced proficiency and deliver robust solutions.
  • Drive the design and implementation of sophisticated automation tools and processes for managing large-scale systems.
  • Lead critical incident responses with composure and efficiency, followed by thorough post-incident reviews to implement preventative measures.
  • Shape system architecture and design, bringing your vision and expertise to influence high-impact decisions.
  • Champion the creation and adherence to reliability standards, ensuring scalable and sustainable system operations.
  • Demonstrate strong strategic thinking and planning abilities to drive team and organizational success.
  • Exhibit exceptional leadership skills, with the capacity to influence key technical decisions and inspire cross-functional teams.
  • Possess mentorship and coaching expertise to nurture and develop junior and intermediate team members, fostering a collaborative and growth-oriented environment.
 
REQUIREMENTS:
Minimum Requirements:
  • Matric/Grade 12.
  • 10+ Years in Software Development, of which preferably 6+ years must be experience in SRE, DevOps, or System Engineering.
  • Proficiency in Scripting languages.
  • Relevant Certification such as Oracle, Cloud ,DevOps.
 
Technical Skills:
  • Continuous delivery
  • Cloud skills and best practices
  • Observability (System and Application Performance Monitoring)
  • Infrastructure as Code
  • Configuration Management (Infrastructure as a Service)
  • Containers
  • Automation
  • Collaboration and Communication
  • Coding and Scripting
  • Azure DevOps
  • General systems uptimes
  • SLO (Service-Level Objectives)
  • Latency
  • Incident and Outage Management
  • Change Management
  • Capacity Planning
 
ATTRIBUTES:
  • A proactive approach to spotting problems, areas for improvement, and performance bottlenecks.
  • Strong troubleshooting.
  • Self-disciplined and self-motivated.
  • Ability to learn quickly and share knowledge with others.
  • Work well in a team and independently.
  • Accountable and responsible.
  • Attention to detail, accurate and analytical.
  • Good reporting and documentation. 
  • Excellent communication. 

+ 27 (0) 21 741 0400