FreshRemote.Work

Site Reliability Engineer - Remote

About Us

Onebrief makes military planning seamless and represents a shift in paradigm for future military decisions. It is an all-in-one tool that supports both the creative and process-oriented aspects of military planning. In Onebrief, planners use maps, boards, diagrams, timelines, slides and written products to create their plans—all while sharing a common database. Everything stays in sync, in real time. Our approach has been refined and validated through hundreds of user experiments.

Our product is currently in broad use at 8 of the largest military headquarters in the world. 3 of the 4 biggest operational plans in the US are currently built with Onebrief. Last year, we achieved 100% gross retention and 158% net retention—our revenue grew 4x and reached double-digit millions. We are backed by Y Combinator (S21) and top-tier VCs, including Caffeinated Capital (Affirm, Docker, Notion, and more) and Human Capital (Andruil, Brex, Snowflake, and more), and have raised a total of $44.6M in venture capital. Our elite team combines the best of tech and military talent, including education and experience at Google, Twitter, Adobe, MIT, Harvard, Delta Force, TOPGUN, and more.

What you will achieve

You’ll be the first line of support for our mission critical deployments, and responsible for ensuring best in class service quality and issue resolution. You’ll work in both on-premise DoD environments and AWS cloud environments. Your lessons from the field will shape how our team works, from policy to implementation.

In addition to working at the customer, you’ll contribute directly to solutions that increase stability, performance, and security of our deployments, and improve the overall experience of deploying and managing Onebrief on premise.

This role sits in our Technology Operations organization.

Core skills and technologies: VMWare, Kubernetes, Docker, Helm, Ansible, Terraform, Linux, AWS, DoD compliance

About You

This is an opportunity for candidates who are flexible to travel and work outside regular business hours with moderate frequency.

Qualifications

  • You are a trained and experienced (3+ years) SRE engineer, automating software delivery and deployment, and providing documentation and self-service tools to engineering teams and customers

  • You have an active security clearance, are familiar with the DoD IT environment, and have first-hand experience managing mission critical systems inside DoD’s air gapped networks.

  • You are experienced working in DoD on-premise environments and AWS cloud environments.

  • You understand Linux, containers, virtual machines, and Kubernetes, and know how to harden them in accordance with RMF security controls and STIGs/SRGs

  • You area proficient using VMWare, Docker, Helm, Ansible, and Terraform

  • You are at home at the command line

  • You have experience setting up backups, logging, and alerting at various layers of the OSI model to ensure SLAs meet customer satisfaction levels

  • You are experienced with system monitoring and can pro-actively identify future issues and/or needs for increased capacity

  • You have a strong understanding of incident response processes and how to conduct root cause analysis

  • You are creative and self-reliant, able to operate in air-gapped environments with limited tools and little external help

  • You work well with developers to ensure efficient and secure development and operations

  • You have a Security+ certification or another DoD 8570.01 approved security certificate.

  • You are obsessed with creating value for real users

  • You are committed to performing up to your potential

  • You are ambitious, scrappy, and a creative problem-solver

  • You learn quickly, work iteratively, and naturally seek collaboration

  • You approach your work with integrity, intellectual honesty, and a low ego

  • You communicate frankly, clearly, and succinctly

  • You thrive as a self-starter, embracing autonomy and ambiguity

  • You are a U.S. citizen

About Us

Our team shares a mission, seeks excellence, and plays to win, with the seriousness and camaraderie of an Olympic team. We are in this together, not just because Onebrief will one day be 10x the size of Palantir. We are here to radically improve the future of military decisions, so that the coming decades don’t repeat the previous ones.

Here’s what our team members value most about working here:

  • Founders' transparency

  • Product and vision

  • Nimble leadership

  • Challenges and learning opportunities

  • Ownership and autonomy

  • No non-sense policies and procedures

  • Remote-first

What's in it for you
  • Reimbursement for relocation expenses up to $10k

  • Equity

  • Remote work, flex time, and unlimited PTO

  • Health, dental, vision, and life insurance

  • 401k

  • Parental leave

  • Company summit trips, at least once a year

  • $1,000/year budget for home office improvements

Apply

Job Profile

Skills

Ansible AWS Docker Helm Kubernetes Linux Terraform VMWare

Tasks
  • Contribute to stability, performance, and security improvements
  • Ensure service quality and issue resolution
  • Manage deployments on premise
  • Support mission critical deployments
Experience

3+ years

Certifications

Security+

Restrictions

Flexible to travel and work outside regular business hours U.S. Citizen