Cassandra Site Reliability Engineer at Grubhub
Chicago, IL, US / New York City, NY, US / Philadelphia, PA, US

Grubhub is dedicated to connecting hungry diners with our wide network of restaurants across the country. Our innovative technology, easy-to-use platforms and streamlined delivery capabilities make us an industry leader today, and in the future of online food ordering. 

 

We strive to create a workplace that reflects the diversity of our customers and the communities we serve. When you join our team, you become part of a community that works together to innovate, solve problems, take risks, grow, work hard and have a ton of fun in the process!

 

Why Work For Us

We have a fast-paced environment and that is what our teams thrive on. Grubhub believes in empowering people and offering opportunities for development, as well as professional growth.  We value strong, positive relationships in all areas: with each other, our customers and our greater community.  Want to be a part of a team of diverse collaborators in an authentically fun culture?  If so, we want to talk to you - and hear what’s your favorite restaurant for food delivery!

 

More About the Role

As a Site Reliability Engineer on the Platform Engineering team, it’s our job to provide the supporting infrastructure, automation as well as systems design knowledge to the team. You’ll be designing, building and testing Cassandra database infrastructure in all areas of the Grubhub infrastructure.  This core role owns the subject matter expertise in Cassandra and how to leverage it best.  You will also be called on to export knowledge and training to the developers and other engineers that leverage this platform on a day-to-day basis.  

 

We pride ourselves on a highly available, highly scalable infrastructure, and datastores are no exception.As a Cassandra Site Reliability Engineer on the Platform Engineering team, it’s our job to provide the supporting infrastructure, automation as well as systems design knowledge to the team. You’ll be designing, building and testing Cassandra database infrastructure in all areas of the Grubhub infrastructure. As a Cassandra Site Reliability Engineer on the Platform Engineering team, it’s our job to provide the supporting infrastructure, automation as well as systems design knowledge to the team. You’ll be designing, building and testing Cassandra database infrastructure in all areas of the Grubhub infrastructure. 

Some Challenges You’ll Tackle

  • Multi-region, high-availability database clusters using Cassandra
    • And wondering why is the speed of light so slow
  • Real time data warehouse needs
  • RPO/RTO - Ensure backups and restores work perfectly
  • Monitoring - Build and manage the metrics and alerting for these critical systems
  • Build - Write code to automate operations, build services and tools that make our company more effective

 

Below is a list of some of the tools we’re either currently using or have made the decision to use and will begin implementing shortly, hopefully with your help.

 

  • Puppet
  • Ansible
  • Linux (CentOS, Ubuntu), Apache HTTPD, Tomcat, Flask
  • Python, Scala, Java
  • CollectD, ELK Stack, Splunk, DataDog
  • All the AWS things
  • Cassandra, MySQL, Redis
  • Stash/Git, Confluence, Jira, Jenkins

You Should Have

Soft Skills

  • A passion for accuracy
  • Be gutsy, and someone who isn’t afraid to try new things
  • Are you someone who is comfortable failing, learning and trying again?
  • Must be data obsessed - measure, optimize, repeat
  • Someone with an automation mindset
  • Be eager to share and document your knowledge to share with everyone

 

Hard Skills

  • Demonstrated experience designing, building, and operating Cassandra clusters
    • Bonus if you can explain paxos
  • Coding:  Ability to write python code for automation.  Bonus for application development in python.
  • Ability to analyze metric data for capacity planning
  • Linux - Must be able to explain IPC
  • Experience building and running CI pipelines
  • Experience with RDBMS such as MySQL, Postgres, or Oracle.
  • Knowledge of distributed systems design and constraints

 

We’ve got a lot of challenges ahead of us and a lot of really awesome things we intend to get done. If you’re looking for a new challenge and want to join an awesome team, we’d love to hear from you. Thanks!

And Of Course, Perks!

  • Flexible PTO. Grubhub employees are provided a generous amount of time to recharge their batteries. 
  • Health and Wellness. We provide programs that support your overall well-being such as generous medical benefits, employee network groups, company-wide fitness challenges, and a comfortable and casual workplace! We also support our parents by offering 8 weeks of paid parent bonding time, a 4-week returnship program, and 6-8 weeks paid medical leave.
  • Learning and Career Growth. Your personal and professional development is a priority at Grubhub. From day one, we empower you to lead and be an active participant in your career growth.  We provide continuous learning opportunities, training, and coaching and mentorship programs.  
  • MealPerks. Who’s ready for some lunch? We provide our employees with a weekly Grubhub credit  to enjoy and support local restaurants. We also offer company-wide meals several times a year to bring our Grubhub family together.
  • Fun. Every Grubhub office has an employee-led Culture Crew that connects people through fun, meaningful events and initiatives. Some of our popular past events include: Wing-eating contests, Grubtoberfest, 5k Runs, Bring Your Child to Work Day, regular happy hours, and more!
  • Social Impact. We believe in the importance of serving the communities that support our business. In addition, employees are given paid time off each year to support the causes that are important to them.