Skip to main content

Staff Site Reliability Engineer - PRE

header-funct-publicResponsibilities: Support critical applications and ensure the stability of the applications by performing proactive maintenance activities. Engage in automation activities. Supporting application and infrastructure based on new technologies like Kubernetes containers, Kafka, Graphana, Prometheus, Elastic etc. Perform root cause analysis and remediation. Good knowledge on Cloud and VM ware infrastructure. Good knowledge on F5 Load Balancer, TCP layer architecture. Good Experience on Kubernetes and Docker (preferable OpenShift, MKE vendor products). Basic knowledge of ansible and YAML scripting. Requires working knowledge of production support processes such as incident/change/problem management, call triaging, escalation procedures and such. Ability to write and maintain scripts to monitor system activity including application smoke test activities during pre and postproduction implementations. Monitor application performance (e.g. memory, logging, latency). Writing SQL queries for data analytics. Code release into Test and Production environments using industry standard deployment tools. Support application deployment using chef/Jenkins. Support Client escalated issues specific to applications. (e.g. increased latency, transactional issues, features not working as expected etc. ). Implement and maintain Performance monitoring dashboards using industry standard tools (Splunk, Thousand Eyes, Keynote, Runscope, Ghost inspector, Evolven, Graphite etc..).Experience: • 6 or more years of work experience with a Bachelors Degree or 4 or more years of relevant experience with an Advanced Degree (e.g. Masters, MBA, JD, MD) or up to 3 years of relevant experience with a PhD • Experience with application support organization working in 24*7 environments. • Experience in working with RDBMS DBs, Non-SQL DBs, MySQL DML/DDL, Oracle • Poses exceptional analytical, problem solving skills, oral and written communication skills • Basic level knowledge on Active/Active setup Application Experience in Production support working in a globally distributed team. Working experience on Java, J2EE and Python technologies. Experience with Service Now and ticketing workflows is preferred. Working experience with monitoring tools like SPLUNK or any other monitoring tools/processes will be advantageous. Prior working experience with Card and transaction domains will be advantageous. Should have a technical and business mindset ISO 9000 and ITIL experience will be advantageous. Understanding of core networking concepts such as routing, protocols, subnets, DNS, Certificates, Load balancer and firewall. Demonstrated proficiency in troubleshooting, root cause analysis, application design, and implementing major components for large projects. Offer: Anual bonus Pension plan Life Assurance Lunch Allowance Medical Insurance Health and fitness financial bonus Eye care reimbursement Stable employment conditions based on an employment contract A wide training package (soft and technical training offer, access to the e-learning platform, possibility of co-financing courses and certification) and moreCompany DescriptionExperis to światowy lider rekrutacji specjalistów i kadry zarządzającej w kluczowych obszarach IT. Z nami znajdziesz konkurencyjne oferty zatrudnienia oraz ciekawe projekty IT skierowane zarówno do ekspertów z wieloletnim doświadczeniem, jak i osób, które dopiero zaczynają swoją przygodę w branży IT.job-detail-footer

Staff Site Reliability Engineer - PRE

Experis
Warsaw
Full time

Published on 06/28/2024

Share this job now