Senior Bioinformatics Data Engineer
Remote
BioAge Labs is a venture-backed biopharmaceutical company headquartered in Richmond, CA. Because aging is the driver of many highly morbid, chronic medical conditions that reduce the length and quality of life, we are developing therapeutics that target the fundamental mechanisms of aging, with the goal of treating disease and increasing healthy lifespan.
We are seeking a Senior Bioinformatics Data Engineer to lead the design and development of robust, scalable data architecture and pipelines that enable the ingestion, integration, and harmonization of large, diverse biological data sets.
The Job
This role is pivotal in setting the standards, protocols, and best practices for our data architecture and infrastructure, ensuring that all systems are efficient, traceable, scalable, and maintainable. Embedded within the Data Science team, you will collaborate with engineers, data scientists, and domain experts to establish and maintain cutting-edge systems that drive the company's mission of leveraging omics, drug, target, and pathway data for biological discovery and therapeutic innovation. This is a high-impact role where you will not only build and optimize technical solutions but also set the foundation for long-term success by defining and implementing best practices for data infrastructure across the organization.
Key Responsibilities
Infrastructure Development & Standards
- Lead the design, development, and implementation of scalable, high-performance data infrastructure to support diverse biological data sets and downstream applications.
- Establish and enforce best practices, protocols, and policies for data management, processing, and infrastructure maintenance.
- Create standards for system architecture to ensure traceability, usability, integrity, and scalability of all data systems.
- Proactively identify opportunities to optimize and future-proof infrastructure for evolving data needs.
Data Architecture & Pipelines
- Design, implement, and maintain efficient ETL pipelines to process and harmonize internal and external biological data sources.
- Integrate large-scale datasets, such as OpenTargets, StringDB, and the Human Protein Atlas, into unified, accessible formats.
- Build infrastructure to support advanced applications, such as knowledge graphs and AI-driven tools, ensuring seamless interoperability with analytical workflows.
Collaboration & Leadership
- Partner with data scientists, engineers, and bioinformaticians to ensure infrastructure aligns with analytical and research objectives.
- Mentor and guide other engineers, instilling best practices in infrastructure design, data management, and software development.
- Foster a culture of collaboration, continuous learning, and high standards within the team.
Qualifications
Education
- BS in Computer Science, Bioinformatics, or a related field; MS or PhD preferred.
Experience
- 5+ years of experience in bioinformatics or software development in biotech, pharmaceuticals, or a related field.
- Demonstrated success in designing and deploying scalable ETL pipelines and infrastructure for large-scale biological datasets.
- Proven track record of establishing standards and best practices for data architecture and processing systems.
- Experience working in an agile development environment with test-driven development methodologies.
Technical Expertise
- Proficient in data engineering programming languages such as Python, R, SQL/no-SQL, and languages for big data and cloud platforms.
- Proficiency with Google Cloud Platform (GCP) and experience in managing cloud-based systems.
- Familiarity with public domain data sets and tools, such as OpenTargets, STRING DB, and the Human Protein Atlas.
Skills
- Strong leadership and mentoring capabilities, with the ability to guide teams in adopting best practices.
- Excellent organizational skills, with a focus on creating maintainable and scalable systems.
- Exceptional communication skills, enabling effective collaboration across cross-functional teams.
Preferred
- Experience with knowledge graphs or similar advanced data structures is a plus.
Our company
BioAge is a platform-driven, clinical-stage biotechnology company that is mapping human longevity to change the nature of aging and extend healthy lifespan. Our growing portfolio of therapeutics for immune, muscle, and brain aging includes four drug programs, two first-in-class and two first-in-indication. Our vision is “growing older without aging” - a future in which aging allows us to pursue our goals, accumulate new experiences and accomplishments, and actively contribute to society without disease, physical disability, or loss of independence and connection. A leading company in the emerging longevity biotech sector, BioAge has raised $127M from Andreessen Horowitz, Kaiser Foundation Hospitals, and others.
Our workplace
BioAge offers competitive salary, a comprehensive compensation package, and generous paid time off in addition to company-observed holidays. We provide comprehensive health and wellness benefits (medical, dental, and vision insurance) and a 401(k) retirement savings plan with matching employer contributions, and we support families through childcare and fertility benefits. We also pride ourselves in giving employees many opportunities for career development, including a generous annual budget for continued learning and a dedication to training and skill development. Many positions (including this one) are remote, allowing our team members to work from anywhere. The salary for this role is expected to be approximately $170,000 - $200,000 for someone who meets all of the minimum qualifications as listed for the role, and may be adjusted accordingly for someone with more or less experience. We are open to the concept that different experience levels add value to the team in different ways, and therefore will consider a variety of experience and offer commensurate pay.
At BioAge, we embrace diversity and differences while we learn from each other, and we believe that every team member has an important role to play. We are an equal opportunity employer. BioAge prohibits discrimination and harassment of any type and affords equal employment opportunities to employees and applicants without regard to race, color, religion, sex, national origin, disability status, protected veteran status, or any other characteristic protected by law. BioAge conforms to the spirit as well as to the letter of all applicable laws and regulations.
ApplyJob Profile
Tasks- Collaborate with teams
- Design data infrastructure
- Establish best practices
- Implement ETL pipelines
- Mentor engineers
Agile Development AI tools Bioinformatics Biological datasets BioTech Cloud systems Collaboration Communication Data Architecture Data engineering Data Management ETL GCP Google Cloud Platform Knowledge graphs NoSQL Python R Scalable Infrastructure SQL Test Driven Development
Experience5 years
Education Timezones