The team is looking for a seasoned SRE to come in on an “on-call” basis and be in charge of operating and scaling their data platform, running production services, and identifying and resolving critical incidents.
THE OPPORTUNITY FOR YOU
The SRE will have the opportunity to work closely with development and engineering teams to improve the data platform services, which support over 100 teams, and create better approaches to handling production issues. This person will need to have a strong understanding of how data services operate and interact, be able to respond quickly to incidents and outages, apply strategic approaches to monitoring and troubleshooting, and have the ability to immediately resolve and reconfigure any issues that are reported by this platforms many external and internal users.
KEY SUCCESS FACTORS
3+ years of experience with Hadoop & Spark
Strong understanding of how these services operate and interact, not simply experience using them
3+ years of programming experience with Python or Golang (highly preferable), or Java
Experience with AWS (EKS) and Kubernetes
#LI-LM1
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.
Thank you!
Your submission has been received, and we’ll be in touch with you shortly.
Oops! Something went wrong while submitting the form.
By clicking “Accept”, you agree to the storing of cookies on your device to enhance site navigation, store user preferences, analyze site usage, and assist in our marketing efforts. View our Privacy Policy for more information.