Senior Site Reliability Engineer
United States | Remote
About Upstart
Upstart is a leading AI lending marketplace partnering with banks and credit unions to expand access to affordable credit. By leveraging Upstart's AI marketplace, Upstart-powered banks and credit unions can have higher approval rates and lower loss rates across races, ages, and genders, while simultaneously delivering the exceptional digital-first lending experience their customers demand. More than two-thirds of Upstart loans are approved instantly and are fully automated.
Upstart is a digital-first company, which means that most Upstarters live and work anywhere in the United States. However, we also have offices in San Mateo, California; Columbus, Ohio; and Austin, Texas.
Most Upstarters join us because they connect with our mission of enabling access to effortless credit based on true risk. If you are energized by the impact you can make at Upstart, weâd love to hear from you!
Â
The Team
Upstartâs Site Reliability Engineering (SRE) team owns the reliability, resiliency, and observability of Upstartâs production systems. The SRE team builds tooling and automation to monitor the health of our infrastructure and create a fast, reliable, and productive environment for other engineers and a world-class experience for our customers. SRE defines Upstartâs strategy for technology operations risk mitigation, which includes disaster planning and on-call procedures. We use data-driven approaches to drive our decisions, and provide reports and insights to the business to improve visibility into the system and customer experience.
Â
As a Senior Site Reliability Engineer, your work will directly impact the success of the SRE team. Your expertise will inform the teamâs direction, and your work with other SREs and Upstart engineers will make Upstartâs systems as effective as possible for our customers. SRE at Upstart is ever-changing, and you will be a primary contributor in shaping our future path.
Â
How youâll make an impact
- Embody and share SRE principles at Upstart
- Exercise state-of-the-art SRE practices throughout the company
- Uphold a culture of visibility, ownership, and responsibility around service reliability
- Implement standards for monitoring microservices, web apps, mobile apps, databases, Kubernetes clusters, and machine learning platforms in a fast-paced environment
- Improve incident response practices, both within SRE and throughout the company
- Automate away toil that make sense to be automated
Â
Minimum QualificationsÂ
- Minimum of 5 years as an SRE or DevOps Engineer
- Minimum of 8 years in software or high tech industries
- Software engineering background, including CI/CD, TDD, or other agile development techniques
- Experience with on-call and incident management âŚ
This job isn't fresh anymore!
Search Fresh JobsJob Profile
Annual wellness, technology & ergonomic reimbursement programs Catered lunches + snacks & drinks Competitive compensation Comprehensive medical, dental, and vision coverage Employee stock purchase plan Generous holiday, vacation, sick and safety leave Health Savings Account contributions Life and Disability insurance Social activities Supportive parental, family care, and military leave programs
SkillsAgile Development AI Automation CI/CD Datadog Data-driven DevOps Digital-first Engineering Go Grafana Incident Response Kubernetes Machine Learning Marketplace Microservices Monitoring Operations Python Software Engineering SRE TDD Terraform
Experience5 years
Education TimezonesAmerica/Anchorage America/Chicago America/Denver America/Los_Angeles America/New_York Pacific/Honolulu UTC-10 UTC-5 UTC-6 UTC-7 UTC-8 UTC-9