Staff Software Engineer - O11y, Asserts
United States (Remote)
This is a full-time remote position, and we're seeking candidates in the US or Canada.
What do we do?
Asserts.ai team joined Grafana Labs as part of an acquisition and is now growing inside the larger Observability group. The backend team is responsible for building and operating a set of low-latency microservices built in Java and databases like Graph DB. The services provide automated Root Cause Insights by actively querying the Grafana Cloud Metrics, Traces, and Logs databases and integrating with the Grafana platform components like Dashboarding and Alerting. The Asserts services are multi-tenant, highly available and process and store data in Graph and Time Series DB. We will soon be rolling it out to thousands of self-service customers, and it's critical that as we grow, we improve our performance, increase our reliability, and do it all more efficiently.
We're looking for a senior or staff level engineer with a strong distributed systems background to both build and lead the Asserts backend initiatives.
As a company we are remote-first and global, we embrace people of different experiences and backgrounds to build diverse teams where every person brings a unique perspective to the software.
What will you be doing?
- Drive technical and business strategy in the Observability department.
- Influence the product roadmap. Drive innovations from ideation to customer adoption.
- Drive system design. Create design documents, collaborate within and across teams.
- Work with your team to deliver new features, then use the results to iterate and improve.
- Build and operate critical systems. Own their reliability, performance, and availability.
- Participate in on-call rotations.
- Mentor and support other team members.
- Strive to become a subject matter expert for observability products and systems.
- Gain a deeper understanding of our cloud product, our customers and get to know the codebase of a large distributed system.
What are we looking for in you?
- You are a motivated self starter with a bias towards action.
- You are customer focused. We build everything with our users in mind. You have a passion for creating intuitive products that fit customers’ needs.
- You have experience delivering projects from gathering requirements, brainstorming ideas all the way to shipping a product to the customer’s hands in a self-driven way.
- You have experience in building and deploying SaaS software on any one of the cloud providers like AWS, GCP or Azure.
- You have strong experience in Java programming and have built and deployed Java based microservices.
- You have experience with Kubernetes.
- You have been responsible for operating production services and organizing/participating in on-call rotations.
- You actively mentor other team members, identifying areas for focus and improvement.
- You like to share your knowledge by creating blog posts, giving tech talks at meetups and conferences.
- You’re curious and enjoy learning new programming languages and frameworks, setting up examples, and figuring out how things work.
Nice to haves:
- Been a power user of Grafana and Prometheus in operational roles (including on-call for your team at a previous employer or just using these tools on hobby/homelab projects)
Why Join Us
- Be part of a remote-only, globally distributed team.
- Collaborate with passionate professionals across Grafana Labs.
- Participate in company-wide hack-a-thons that help drive the direction of real-world innovation.
- Attend cross-functional team off-sites
- Contribute to open-source projects and make a real impact.
- Enjoy a culture that values work-life balance, personal growth, and professional development.
About Grafana Labs
Grafana Labs is an open-source SaaS company specializing in observability and monitoring solutions. We help organizations make sense of their data through open, composable, and enterprise-ready monitoring and analytics tools. As a remote-only company, we embrace a global team culture that values collaboration, innovation, and personal growth.
Grafana operates in 32+ countries. We try to operate as one team and focus on global benefits which our whole team can enjoy. Inevitably there are some regional variations and we discuss the benefits offered in your country of residence through our interview process.
In the US, the Base compensation range for this role is USD 168,256 - USD 210,320. Actual compensation may vary based on level, experience, and skillset as assessed in the interview process. Benefits include equity, bonus (if applicable) and other benefits listed here.
*Compensation ranges are country specific. If you are applying for this role from a different location than listed above, your recruiter will discuss your specific market’s defined pay range & benefits at the beginning of the process
About Grafana Labs: There are more than 20M users of Grafana, the open source visualization tool, around the globe, monitoring everything from beehives to climate change in the Alps. The instantly recognizable dashboards have been spotted everywhere from a NASA launch and Minecraft HQ to Wimbledon and the Tour de France. Grafana Labs also helps more than 3,000 companies -- including Bloomberg, JPMorgan Chase, and eBay -- manage their observability strategies with the Grafana LGTM Stack, which can be run fully managed with Grafana Cloud or self-managed with the Grafana Enterprise Stack, both featuring scalable metrics (Grafana Mimir), logs (Grafana Loki), and traces (Grafana Tempo). Benefits: For more information about the perks and benefits of working at Grafana, please check out our careers page. Equal Opportunity Employer: At Grafana Labs we’re building a company where a diverse mix of talented people want to come, stay, and do their best work. We know that our company runs on the hard work and the dedication of our passionate and creative employees. If you're excited about this role but your experience doesn’t align perfectly with every qualification in the job description, we encourage you to apply anyways. We will recruit, train, compensate and promote regardless of race, religion, color, national origin, gender, disability, age, veteran status, and all the other fascinating characteristics that make us different and unique. We believe that equality and diversity builds a strong organization and we’re working hard to make sure that’s the foundation of our organization as we grow. For information about how your personal data is used once you’ve applied to a job, check out our privacy policy. ApplyJob Profile
Candidates must be in the US or Canada Remote Remote position
Benefits/PerksBonus Diverse teams Diversity Equal opportunity employer Equity Global team Hack-a-thons Open source contributions Other benefits Other benefits listed Professional development Remote-first company Remote work Work-life balance
Tasks- Build and operate systems
- Contribute to open-source projects
- Create design documents
- Deliver new features
- Drive technical strategy
- Influence product roadmap
- Iterate and improve
- Mentor team members
- On-call rotations
- Participate in on-call rotations
- Participating in on-call rotations
AI Alerting Analytics AWS Azure Cloud Cloud Computing Cloud providers Collaboration Dashboards Databases Design Distributed Systems GCP Grafana Grafana Cloud Grafana Loki Grafana Mimir Grafana Tempo Graph DB Java Kubernetes Logs Loki Metrics Microservices Mimir Monitoring Observability Observability products Open Source Product Roadmap Programming languages Prometheus SaaS SaaS Software Strategy Support System design Tech Tempo Time Series DB Traces Visualization
Education TimezonesAmerica/Anchorage America/Chicago America/Denver America/Los_Angeles America/New_York Pacific/Honolulu UTC-10 UTC-5 UTC-6 UTC-7 UTC-8 UTC-9