- Job Type Full Time
- Qualification Bachelors
- Experience 5 years
- Location Gauteng
- Job Field ICT / Computer 
Site Reliability Engineer at Momentum Metropolitan Holdings Limited
Site Reliability Engineer
Role Purpose
- This position requires implementing, enabling and facilitating DevOps practices and infrastructure within teams.
Requirements
- Computer Science Degree
- Min 5 years of relevant experience
- Solid Experience and Understanding architecture and implementation
Technical Mandatory:
- WebSphere Application Server ND 8.5 or higher
- WebSphere Liberty
- Microsoft IIS
- Springboot
- Unix experience
- Cloud technologies (AWS / Azure)
- Platform Development (Ansible, Python, bash, wsadmin, GIT, Jenkins, Flyway, Maven, Python)
- Nginx
- Docker, Kubernetes
- Artifactory, Sonatype Nexus, Gitlab
- Monitoring (Datadog, Prometheus, Kibana and Grafana
Advantageous:
- Websphere MQ
- Tivoli LDAP 6.3 or higher
- Additional requirements: Exposure to the following is an added benefit:
- IBM WebSphere Portal
- Monitoring (Alerta, Sentry, Chronograf, Selenium)
Duties & Responsibilities
- Research and review relevant IT trends and best practices
- Implement and maintain the infrastructure required for implementing DevOps practices.
- Enable automated deployment of applications and configurations.
- Enable automated monitoring and alerting.
- Enable automated end-to-end testing.
- Enable continuous release processes, practices, and pipelines.
- Enable change management and audit requirements for release pipelines.
- Interest in designing, analyzing, and troubleshooting large-scale distributed systems.
- Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.
- Ability to program (structured and OOP) using one or more high-level languages, such as Python, Java, C#, TypeScript
- Ability to debug and optimize code and automate routine tasks.
- Scale systems sustainably through mechanisms such as easy-to-use tooling and automation
- Practice sustainable incident response and drive root case analysis.
- Collaborate with development teams and other stakeholders to identify potential risks and remediation
- Proactive approach to identifying problems, performance bottlenecks, and areas for improvement
- Platform Development (Pulumi, Terraform, CloudFormation, Resource Manager, Ansible, Python, wsadmin, GIT, Gitlab CI, Jenkins)
Competencies
- Excellent problem-solving, organizational and communication skills;
- System analysis skills;
- Must be willing to work after hours and perform standby duties on multiple applications;
- Must be able to work under pressure;
- Must be a team player and able to build relationships;
- Must be detail-oriented
Deadline:10th June,2025
Method of Application
Interested and qualified? Go to Momentum Metropolitan Holdings Limited on momentumgroupltd.erecruit.co to apply
Leave a Comment