Staff/Senior Staff Data Scientist, Machine Learning
Remote
About Us
Valo Health is a technology company that is integrating human-centric data and AI-powered technology to accelerate the creation of life-changing drugs for more patients faster. Valo was created with the belief that the drug discovery and development process can and should be faster and less expensive, with a much higher probability of success. We are using models early to fail less often, executing clinical trials to add valuation to the company, and generating fit-for-purpose data to feed back into Valo’s Opal Computational Platform™ as we reinvent drug discovery and development from the ground up. Disease doesn’t wait, so neither can we.
We are a multi-disciplinary team of experts in science, technology, and pharmaceuticals united in our mission to achieve better drugs for patients faster. Valo is committed to hiring diverse talent, prioritizing growth and development, fostering an inclusive environment, and creating opportunities to bring together a group of different experiences, backgrounds, and voices to work together. We achieve the widest-ranging impact when we leverage our broad backgrounds and perspectives to accelerate a new frontier in health. Valo seeks to become the catalyst for the pharmaceutical industry and drive the digital transformation of the industry. Are you ready to join us?
About the Role
As a Senior Staff Data Scientist, Machine Learning or Staff Data Scientist, Machine Learning you will be a core member of a team of data scientists and engineers building a powerful computational platform for advancing the research and development of new medicines. As part of the Translational Platform Engineering team, you will help design, develop, and apply machine learning (ML) models, methods, and pipelines for scientific problems involving clinical and biomedical data. Successful candidates will work with a diverse set of data scientists, biological scientists, epidemiologists, and software engineers in ways that cut across traditional industry boundaries.
What You’ll Do…
- Provide technical leadership to propose, design, develop, and evaluate innovative deep learning models for learning patient representations from high dimensional electronic health records and omics data leveraging Valo’s proprietary platform (data assets and computational capabilities).
- Design, develop, and own deep learning pipelines to solve scientific problems.
- Propose and perform hands-on deep learning-based modeling of high-dimensional longitudinal data to generate fit-for-purpose evidence for projects.
- Contribute to planning, execution, interpretation, and communication of results.
- Collaborate with cross-functional teams and stakeholders to derive user requirements, maintain alignment, and ensure the relevance and impact of models, analyses, and pipelines.
- Be an active team member in code, design, and analysis review.
What You Bring...
- Degree in a quantitative field with the following years of post-degree experience or equivalent
-
- Senior Staff: 9+ (BS), 7+ (MS), or 5+ (PhD)
-
- Staff: 7+ (BS), 5+ (MS), or 3+ (PhD)
- Demonstrated experience designing, developing, applying, and evaluating the performance of deep learning approaches such as representation learning, transformers, sequence models, and self-supervised learning on high dimensional or multimodal data and approaches for explainability (Staff: 3+ years required; Senior Staff: 5+ years required).
- Demonstrated experience with ML on electronic health records (3+ years required).
- Proficient in Python (5+ years required) and with developing models using deep learning frameworks (e.g., pytorch) in cloud environments (e.g., AWS).
- Experience with collaborative software development using source control management (e.g., git, unit testing, code review, CI/CD) (3+ years required).
- Experience with MLops methodology such as workflow orchestration (e.g., Airflow, Prefect), experiment tracking (e.g., MLflow), containerization (e.g., Docker), and reproducible research.
- Experience with statistical methods such as hypothesis testing, longitudinal modeling, and time to event analysis.
- Strong work ethic with a bias for execution and an ability to manage multiple priorities, ambiguity, and tight timelines. Ability to work effectively in teams or independently.
- Experience with omics data is a plus.
- Familiarity with the drug discovery and development process is a plus.
More on Valo
Valo Health, LLC (“Valo”) is a technology company built to transform the drug discovery and development process using human-centric data and artificial intelligence-driven computation. As a digitally native company, Valo aims to fully integrate human-centric data across the entire drug development life cycle into a single unified architecture, thereby accelerating the discovery and development of life-changing drugs while simultaneously reducing costs, time, and failure rates. The company’s Opal Computational Platform™ is an integrated set of capabilities designed to transform data into valuable insights that may accelerate discoveries and enable Valo to advance a robust pipeline of programs across cardiovascular metabolic renal, oncology, and neurodegenerative diseases. Founded by Flagship Pioneering, Valo is headquartered in Lexington, MA with tissue engineering research based in New York, NY. To learn more, visit www.valohealth.com.
Staff Data Scientist Salary Range: $175,000 - $235,000
Senior Staff Data Scientist Salary Range: $215,000 - $280,000
Apply
Job Profile
Benefits/Perks Tasks- Collaborate with cross functional teams
- Design and develop deep learning models
- Evaluate model performance
- Maintain code and design reviews
AWS Biomedical data CI/CD Cloud environments Data analysis Data engineering Deep Learning Drug Discovery Electronic Health Records Git Machine Learning Python PyTorch
Experience5 years
Education