Skip to main content

Staff Engineer - SRE

Do work which matters! CommBank is recognized as leading the industry in IT and operations with its world-class platforms and processes, agile IT infrastructure, and innovation in everything from payments to internet banking and mobile apps. Our teams utilize process excellence principles to drive timely, error free processing which is an essential component of the value proposition we offer our customers. Each of us puts the customer at the centre of everything we do, and we measure our performance against the Group’s external customer satisfaction measures.See yourself in our team.We are looking for a highly skilled and experienced Staff SRE (Site Reliability Engineer) to join our team. As a Staff SRE Engineer, you will play a crucial leadership role in ensuring the reliability, availability, and performance of our applications that support online channels. You will work closely with cross-functional teams to implement SRE principles and practices, troubleshoot complex issues, and build robust system engineering solutions.We support our people with the flexibility to balance where work is done with at least half your time each month connecting in office. We also have many other flexible working options available including changing start and finish times, part-time arrangements and job share to name a few. Talk to us about how these arrangements might work for you.We’re interested in hearing from people whoLead the implementation and advocacy for SRE principles to improve the reliability and availability of our applications.Oversee the monitoring of system performance and reliability, and proactively identify and address potential issues.Collaborate with development and operations teams to design and build scalable, reliable systems.Develop and maintain advanced automation scripts and tools to streamline operations and reduce manual intervention.Perform root cause analysis of critical incidents and implement corrective actions to prevent recurrence.Participate in and lead on-call rotations to provide 24/7 support for critical systems.Continuously improve incident response processes and procedures, setting high standards for the team.Conduct and facilitate post-incident reviews and implement lessons learned to enhance system reliability.Develop and maintain comprehensive documentation for system configurations, processes, and procedures.Mentor and guide junior SRE team members, fostering a culture of continuous learning and improvement.Stay up-to-date with industry best practices and emerging technologies in SRE and DevOps. Tech Skills:We use a broad range of tools, languages, and frameworks. We don’t expect you to know them all but experience or exposure with some of these (or equivalents) will set you up for success in this team. PostgreSQL development - schema design, performance tuning, PostgreSQL managementAWS RDS Aurora PostgreSQL managementLinux, PythonSource control, GitHub, GitHub actions, Automation tools such as Ansible, TerraformMongoDB, MS SQL database development, Oracle database development.AWS foundation services (EC2, Lambda, Cloudformation, System Manager, RDS)Extensive experience as an SRE Engineer or in a similar role, with a proven track record of leadership.Strong understanding of SRE principles and practices.Proficiency in troubleshooting complex issues and exceptional problem-solving skills.Deep knowledge of a wide array of software applications and infrastructure.Experience with monitoring and observability tools (e.g., Prometheus, Grafana, AppDynamics, Splunk, PagerDuty).Proficiency in scripting and automation (e.g., Python, Bash, Ansible).Familiarity with cloud platforms (e.g., AWS, Azure) and containerization technologies (e.g., Docker, Kubernetes).Excellent communication and collaboration skills.Ability to work in a fast-paced, dynamic environment.Strong attention to detail and a commitment to delivering high-quality results.Specific Technical SkillsAbility to debug and troubleshoot .NET and Java applications.Experience building observability solutions with Grafana and Prometheus.Proficiency in using Splunk for log management and analysis.Familiarity with CI/CD tools and practices.Preferred QualificationsExperience in the banking or financial services industry.Certification in relevant technologies (e.g., AWS Certified Solutions Architect, Google Cloud Professional DevOps Engineer).Knowledge of security best practices and compliance requirements.Working with us: Whether you’re passionate about customer service, driven by data, or called by creativity, a career with CommBank is for you. Our people bring their diverse backgrounds and unique perspectives to build a respectful, inclusive, and flexible workplace with flexible work locations. One where we’re driven by our values, and supported to share ideas, initiatives, and energy. One where making a positive impact for customers, communities and each other is part of our every day. Here, you’ll thrive. You’ll be supported when faced with challenges and empowered to tackle new opportunities. We’re hiring engineers from across Australia and have opened technology hubs in Melbourne and Perth. We really love working here, and we think you will too. If this sounds like the role for you then we would love to hear from you. Apply today!If you're already part of the Commonwealth Bank Group (including Bankwest, x15ventures), you'll need to apply through to submit a valid application. We’re keen to support you with the next step in your career.We're aware of some accessibility issues on this site, particularly for screen reader users. We want to make finding your dream job as easy as possible, so if you require additional support please contact HR Direct on 1800 989 696.Advertising End Date: 06/09/2024

Staff Engineer - SRE

Commonwealth Bank
Lindfield NSW 2070
Full time

Published on 08/30/2024

Share this job now