Senior Software Engineer, Site Reliability at Auth0
United States of America
Auth0, a global leader in Identity-as-a-Service (IDaaS), provides thousands of enterprise customers with a Universal Identity Platform for their web, mobile, IoT, and internal applications. Its extensible platform seamlessly authenticates and secures more than 1.5B logins per month, making it loved by developers and trusted by global enterprises. Auth0 has raised more than $110 million to date and continues its global growth at a rapid pace. We are consistently recognized as a great place to work based our outstanding leadership and dedication to company culture, and are looking for the best people to join our incredible team spread across more than 35 countries!
 
Auth0 gives companies simple, powerful and developer friendly building blocks so they can free up resources to focus on innovation. We strive to be the identity platform of choice of developers and Enterprises. We take our culture very seriously and are looking for people who are drawn to both our mission and our culture.
 
The Auth0 platform processes thousands of requests per second (1.5 billion logins per month) for customers all around the world - and we're growing very fast! The Site Reliability team is a new initiative aimed at improving reliability and uptime in a data-driven way to support our customers' needs.
 
We are looking for software engineers with good understanding of how systems fail and a desire to learn about infrastructure.

You are a good fit if you...

    • Have initiative and can "unblock" yourself to get things done.
    • Tend to deliver work incrementally to get feedback and iterate over solutions.
    • Can mentor junior people and pair with other teams: education is a very important part of this role.
    • Like to get your hands dirty by debugging and fixing issues in production.
    • Understand the real problems by reading between the lines and asking good questions.
    • Are easy to work with: you communicate well, take feedback in a positive way and are OK not always doing the most glamorous tasks.

Responsibilities

    • Collaborate with other Engineering teams to support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
    • Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
    • Scale systems sustainably through automation, and evolve systems by pushing for changes that improve reliability and velocity.
    • Be on-call for services that the SRE team on-boards.
    • Practice sustainable incident response and blameless postmortems.

Requirements

    • You are interested in designing, analyzing and troubleshooting large-scale distributed systems.
    • You have a systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.
    • You have a great ability to debug and optimize code, and automate routine tasks.
    • You have designed applications and systems that scale, are resilient to failure, and are observable
    • Timezone: we are giving preference to candidates located in GMT-8 to GMT+2.

Extra Points

    • Experience with Amazon Web Services
    • Experience with Linux
    • Experience with Node.js, Golang, Python or any other application development language
    • Experience with MongoDB
    • Experience working in a remote-first , async environment
Auth0 is an Equal Employment Opportunity employer. Auth0 conducts all employment-related activities without regard to race, religion, color, national origin, age, sex, marital status, sexual orientation, disability, citizenship status, genetics, or status as a Vietnam-era special disabled and other covered veteran status, or any other characteristic protected by law. Auth0 participates in E-Verify and will confirm work authorization for candidates residing in the United States.