IBM Cloud Site Reliability Engineer - HPCS
- Published on
About the Role
The IBM Cloud Site Reliability Engineering (SRE) team is working on providing infrastructure and operations solutions to maintain scalable, highly reliable, and highly secure cloud-based software infrastructures to enable our clients to meet their on-demand IT and security needs to disrupt their industries including Financial, Manufacturing, and Insurance. We seek creative, enthusiastic applicants ready to thrive in a collaborative environment.
Your Role and Responsibilities
As a Site Reliability Engineer, you will work in an agile, collaborative setting to build, deploy, configure, and maintain systems for the IBM client business. Your primary responsibilities include:
- 24x7 Observability: Monitor the health of production systems and services continuously, ensuring optimal customer experience.
- Cross-Functional Troubleshooting: Collaborate with engineering teams to troubleshoot and resolve production issues effectively.
- Deployment and Configuration: Utilize Continuous Delivery (CI/CD) tools for deploying services at enterprise scale.
- Security and Compliance Implementation: Enact security measures that meet industry regulations such as GDPR and HIPAA.
- Maintenance and Support: Apply security patches and assist in customer issue resolutions.
This role will require shift rotations, working Sunday to Thursday or Tuesday to Saturday.
Required Education and Technical Expertise
While formal education is not mandatory, the ideal candidate will have technical expertise including:
- Design and develop tooling for cloud service availability and efficiency.
- Manage infrastructure and services within IBM's Cloud ecosystem.
- Handle incidents timely and participate in agile team processes.
- Strong debugging and problem-solving skills are essential.
Preferred Experience
The ideal candidate holds a Bachelor's Degree in Computer Science and possesses experience with Linux, GitHub, Docker, Kubernetes, and similar technologies. A strong communicator and team player, you will thrive in a cross-functional setting, learning and sharing knowledge as you go.
About the Company
IBM Systems helps IT leaders think differently about cloud infrastructure, emphasizing innovation and security. We build technology designed for the future, optimizing it for cognitive business and cloud computing.