Senior DevOps/Site Reliability Engineer

Based in San Francisco, MyTime is a fully integrated scheduling, payments, and automated marketing platform, specializing in large multi-location chains and franchises. Our mission-critical software — which includes in-store scheduling and online booking, client record management, email and SMS marketing, and a full point of sale for handling payments — is used in every aspect of the customer journey. We are backed by Upfront and Khosla Ventures and are just getting started. We aspire to be the “operating system” of their business. Come help us realize our vision of becoming the world’s leading online scheduling and local commerce platform!

Job Description

As the functional leader of the DevOps environment for MyTime, you will ensure services and infrastructure are fast, stable, and scalable. You will build out services and tooling that are not already attainable via open-source software. Operational tasks such as infrastructure, build/release, and systems administration will also fall within your realm of responsibilities. You will be responsible for configuration management and the build and release lifecycle. 

The ideal candidate will need a strong software development background along with a solid understanding of systems, database architecture, and data integrity. If you have a passion for programming and automation, and actively look for opportunities to develop tools to streamline and simplify the development, we would love to speak with you!

Key Responsibilities:

  • Oversee and manage the release process
  • Investigate and recommend best practices for maintaining code quality, including the development of code metrics, code review workflows, code coverage measurement, and the efficient use of static and dynamic analysis 
  • Actively mentor junior developers and train experienced engineers, improving their skills, knowledge of our systems, and their ability to get things done!
  • Manage Backup and Recovery procedures, in accordance with our Disaster Recovery and Continuity policies
  • Coordinate with the appropriate teams for incident resolution for high severity or escalated incidents
  • Accept on-call rotations for emergency situations (resolving network, storage, DB, or memory issues)
  • Ensure critical system security in compliance with company security policy through the use of best-in-class cloud security solutions.
  • Maintain documentation and troubleshooting runbooks
  • Build and maintain tools for release, infrastructure, and application monitoring and operations
  • Evaluate open source technologies to continually enhance our stack and processes

Qualifications

  • 5 years of infrastructure operations experience, including architecting databases and web servers for scalability and high availability
  • 5 years experience with Amazon Web Services, MySQL and NoSQL databases, Docker containers, Elasticsearch clusters, Nginx web servers, HAProxy load balancers, strongly preferred.
  • 5 years experience building and maintaining a CI system, preferably with working knowledge of Jenkins
  • Familiarity with systems, networking, and software development (OS, firewalls, Load Balancer, Web Server, Application Server, etc)
  • Familiarity with tools for monitoring (esp. Cloudwatch, Grafana) and logging (esp. Kibana, Logstash), strongly preferred
  • A collaborative work style that can be effective in an Agile environment

Additional Information

MyTime provides a competitive salary and benefits package, early stage equity, and a transparent and exciting startup culture that is singularly focused on empowering people to make an impact in their jobs. We’re growing fast and solving a big problem, providing our employees the opportunity to make the tremendous impact that leads to true professional fulfillment. If you’re interested, we’d love to talk!

Related Post