FreshRemote.Work

Senior Software Engineer II, Unified Alerting

Redwood City, CA

Location- open to 100% remote

The proliferation of machine log data has the potential to give organizations unprecedented real-time visibility into their infrastructure and operations. With this opportunity comes tremendous technical challenges around ingesting, managing, and understanding high-volume streams of heterogeneous data.

As a Senior Software engineer - Machine Learning, you will actively contribute and lead in the design and development of innovative ML-powered product capabilities for alerting and remediation products. You will work through the entire feature lifecycle including ideation, dataset construction, experimental validation, prototyping, production implementation, deployment, and operations.

Responsibilities:

  • Identifying and validating opportunities for the application of ML or data-driven techniques
  • Driving technical delivery through the full feature lifecycle, from idea to production and operations
  • Collaborating within and beyond the team to identify problems and deliver solutions 
  • Assessing requirements and approaches for large-scale data and ML platform components
  • Collaborating with UX/UI teammates on the usability of product features
  • Owning the uptime and reliability of delivered services and capabilities, including on-call rotation
  • Developing supporting tooling, automation, and microservices to accelerate the team

Requirements:

  • B.S. / M.S. / Ph.D. in Computer Science or related disciplines
  • 8+ years of industry experience with a proven track record of ownership and delivery
  • Excellent collaboration and communication skills
  • Experience with software engineering of production-grade services in cloud environments
  • Experience formulating use cases as ML problems and putting ML models into production
  • Knowledge of and/or curiosity to learn about specific Sumo Logic customer problem domains
  • Operational excellence orientation: SLIs/SLOs, monitoring and troubleshooting, on-call rotations
  • Solid grounding in core ML concepts, basic statistics, and the judicious use of abstraction
  • LLM core concepts, libraries, and application design patterns

Desirable:

  • Cloud-based application and infrastructure deployment and management
  • Common ML libraries (eg, scikit-learn, PyTorch) and components (eg, Airflow, MLFlow)
  • Relevant cloud provider services (eg, AWS Sagemaker)

About Us

Sumo Logic, Inc. empowers the people who power modern, digital business. Sumo Logic enables customers to deliver reliable and secure cloud-native applications through its Sumo Logic SaaS Analytics Log Platform, which helps practitioners and developers ensure application reliability, secure and protect against modern security threats, and gain insights into their cloud infrastructures. Customers worldwide rely on Sumo Logic to get powerful real-time analytics and insights across observability …

This job isn't fresh anymore!
Search Fresh Jobs