Cloud Operations Engineer

This Jobot Job is hosted by Dee Nguyen Are you a fit? Easy Apply now by clicking the “Apply Now” button and sending us your resume. A bit about us Our client has been helping service provider customers deliver the very best TV experiences to as many of their subscribers as possible. We constantly leverage the latest, technological innovations to help improve performance – and future-proof CapEx-heavy CPE investments. Why join us? Our people are a smart, nimble and talented bunch who work hard and have fun. We offer frequent all-hands meeting with the CEO, company sponsored events, celebrations and rewards as well as the opportunity to work in a small but dynamic environment that is changing the world of virtualization. Our vision is any experience, any network, any device. We are uniquely positioned to make it happen, and we are always looking for the best talent that is key to our innovations. Job Details About the role As a Cloud Operations Engineer you will use leading edge technologies to build, deploy, operate, and maintain configuration management and orchestration routines to deliver and scale applications and services in virtualized environments and in the cloud as part of a small, geographically distributed Cloud Operations team. We are committed to delivering best-in-class system uptime and operations observability through automation and instrumentation. This is a key technical role within the Cloud Platform and Operations organization that interfaces closely with the Cloud Infrastructure, Engineering, Product and Customer Engagement teams. The ideal candidate is a self-starter with a strong focus on collaboration, automation, continuous improvement, and an innovative mindset that will lead to operational efficiencies, and increased compliance. The candidate must have a track-record of managing live productions environments with strict availability targets and extensive experience building automation to support continuous software releases. What you’ll do Deliver configuration management and orchestration routines to deploy and scale applications and services in virtualized and cloud environments operate and maintain these routines in production Support product development teams in the delivery of continuous integration, continuous deployment, providing templates and patterns to follow to ensure code produced by product development teams can be deployed and scaled on standardized technologies and platforms Perform root cause analysis for production issues where the root cause is in infrastructure, environment, configuration, or deployment routines understand when to escalate to product development teams remediate root causes and implement preventative actions Participate in on-call rotation and afterhours maintenance when necessary, respond to major incidents, and participate in bridge calls when called upon in support of initiatives and incident response Actively collaborate with the product, engineering and QA teams to build automated testing and monitoring of deployments Revamp and continuously optimize application release cycles as production environments and product suite scales Participate in Change Management activities which include reviews, approvals, rollback plans, and live operations transition Applying automation where possible to reduce manual and repetitive tasks Architect and implement monitoring, reporting and centralized dashboarding solutions with visibility to internal and external customers What you’ll need 3 + years of experience as a DevOpsSRE Engineer operating an Public Cloud platform 2+ years experience working with configuration management and orchestration technologies such as Cloud Formation, Ansible or comparable Knowledge of application performance monitoring Knowledge of cloud infrastructure principles (load balancing, high availability, server-based and serverless architecture, database configurations) Extensive knowledge of troubleshooting in a Linux environment Experience managing cloud-native applications in Docker containers with Kubernetes orchestration In-depth knowledge of Bamboo, Jenkins, Artifactory or similar CICD tools Proficiency in Python, bash or other programming language Ability to quickly learn new and existing technologies Experience troubleshooting using monitoring and logging tools such as Splunk, DataDog, NewRelic, etc in complex cloud-based environments Ability to work in fast paced and dynamic environment Strong written and verbal communication skills AWS Certified – Associate Certification Interested in hearing more? Easy Apply now by clicking the “Apply Now” button.

Related Post