Skip to main content

Site Reliability Engineer (SRE)

Responsibilities: Engage in and enhance the entire lifecycle of large-scale distributed systems, spanning system design consulting, launch reviews, deployment, operation, and refinement. Establish service availability across multiple global data centers. Develop tools and software to enhance service reliability, scalability, and operability. Measure and monitor availability, latency, and overall service health. Implement sustainable incident response protocols and conduct thorough postmortems. Participate in on-call rotations spanning multiple continents. Key Requirements: Bachelor's degree inputer Science with minimum 3 years of experience in a related field Expertise in Unix/Linux operating systems and IP networking. Proficiency in programming in at least one of the following languages: C, C++, Java, Python, Perl, or Go. Experience in problem-solving, resolving application issues, or managing production operations. Experience in automating routine tasks. Strongmunication skills with a sense of ownership and drive. Preferred experience in designing, analyzing, and troubleshooting large-scale distributed systems.

Site Reliability Engineer (SRE)

Morgan McKinley
Emory, TX 75440
Full time

Published on 06/28/2024

Share this job now