Events  Deals  Jobs 
    Sign in  
 
 
CarGurus // online platform for used cars
 
Engineering, Full Time    Cambridge, MA    Posted: Monday, May 16, 2022
 
   
 
Apply To Job
 
 
JOB DETAILS
 

Responsible for infrastructure, processes, & products to empower product engineering at CarGurus to build, scale & operate their services reliably & autonomously without the need to distribute operational knowledge around the organization. SRE owns reliability metrics, drives best practices, building culture around tracking & leveraging SLOs & error budgets in engineer decision making, the incident management process & reporting, & ensuring our infrastructure scales up/down in the most efficient ways possible.

What You'll Do:
Collaborating with Engineering & Product Managers to define SLOs & monitoring of well designed SLIs
Embedding with Engineering teams & Independently addressing or collaborating to complete architectural improvements
Being the primary escalation for major incidents involving assigned services
Participating in an on-call rotation
Owning our Incident Response Process, including conducting blameless Postmortems
Increasing robustness by automation of workflows, process improvements, CI/CD pipelines, & integrating modern toolsets
Refusing to accept manual work as a solution to areas of weakness
Partnering with Engineering teams to ensure new services are production ready
Championing our organizational standards for designing, deploying, & scaling our products
Making Data-Driven decisions to drive continuous improvement
Evolving our tooling, logging, monitoring & alerting systems to increase observability & transparency

Who You Are:

Demonstrable strong background in software engineering with multiple languages & a firm belief in continuous testing & delivery, or significant relative operational experience running services at scale

A bias for action, but sufficient emotional intelligence to approach colleagues with positive regard & understanding their challenges & decisions
Curiosity & the acceptance that there are always ways to learn & grow
The desire to be an active contributor in a collaborative & fast-paced environment
Excitement in solving puzzles, discovering how a new service or tool works by identifying the individual components, libraries, & relationships it is built upon
Understanding of technologies beyond coding such as Systems Engineering, Load Balancing, Configuration Management, Networking, Operating Systems, Troubleshooting, & Monitoring
Comfort in dealing with Incidents & Availability Issues
Familiarity with working with Cloud & Bare Metal infrastructure
Exposure to industry standard observability tools & services

We recognize that flexibility plays a critical role in enabling our people to thrive in both their personal & professional lives. We currently welcome Gurus into our Cambridge, MA office on a voluntary basis but do not require employees to physically be in the office. We will adopt a hybrid working model when health experts & government officials in our local communities deem it safe to do so. Specific arrangements within this model will be up to team leaders discretion; we encourage you to discuss your questions & needs during the interview process.

All US CarGurus employees are required to provide proof of full vaccination against COVID-19, unless they have an approved medical or religious accommodation. This helps us to safeguard the health of our employees & their families, our customers & visitors, & the community at large.

#LI-Hybrid

 
 
 
Apply To Job
 
 
 
 
 
© 2021 GarysGuide      About    Feedback    Press    Terms