Staff Data Engineer
US Remote
At H1, we believe access to the best healthcare information is a basic human right. Our mission is to provide a platform that can optimally inform every doctor interaction globally. This promotes health equity and builds needed trust in healthcare systems. To accomplish this our teams harness the power of data and AI-technology to unlock groundbreaking medical insights and convert those insights into action that result in optimal patient outcomes and accelerates an equitable and inclusive drug development lifecycle. Visit h1.co to learn more about us.
Data Engineering is responsible for the development and delivery of our most important asset - our data. Looking across thousands of data sources from across the globe, the data engineering team is responsible for making sense out of that data to create the world’s most extensive and comprehensive knowledge base of healthcare stakeholders and the ecosystem they influence. It is our job to ensure that only accurate, normalized data flows through to our customers, and at a velocity that keeps up with the changes in the real world.
WHAT YOU'LL DO AT H1As a Staff Data Engineer on the H1DN Team, you will be a hands-on technical leader, directly contributing to the development of scalable and efficient data architectures. You will collaborate closely with two IC team members, guiding them while actively participating in the work. This role centers on data ingestion and enrichment workflows, ensuring seamless integration of client data from various sources (CSV, Parquet, JSON, APIs) and addressing scalability, data quality, and standardization challenges.
You will:- Lead the development of new features within our client data ingestion platform, transforming, standardizing, and enriching with H1 dataset to meet business needs, including customizable solutions for high-stakes client integrations.- Focus on optimizing infrastructure to deliver product- and client-ready insights efficiently, ensuring enriched data integrates seamlessly into broader pipelines.- Collaborate with clients, subject matter experts, and product teams to drive critical integrations and shape the evolution of data workflows.- Establish best practices for data quality, system reliability, and scalable processing to support growing datasets.- Mentor and guide engineers on the team, promoting best practices and fostering a culture of technical excellence.- Advocate for engineering improvements, including scalable designs, quality assurance, and technical documentation standards.- Serve as a cultural leader within the engineering team, promoting high standards of excellence and continuous improvement in engineering practices.- Ensure the projects you work on deliver clear end-user impact, align with strategic goals, and are accountable for meaningful outcomes.
ABOUT YOUYou have strong hands-on technical skills and substantial experience in data engineering, with a proven track record of building and maintaining scalable data systems and pipelines. As a proactive and visionary technical leader, you excel at solving complex data engineering challenges and driving innovative solutions.
- Proven ability to lead the development of complex data workflows, applying business logic for data enrichment and resolving challenges with creative solutions- Expertise in building and scaling data infrastructure, including integrating with core platforms.- Strong experience addressing data quality challenges and implementing robust validation mechanisms.- Self-motivated with the ability to manage tasks and projects independently.- Able to understand and align with broader organizational goals and strategies.- Proactively identifies potential risks and mitigates them early in the project lifecycle.- Passionate about mentoring junior engineers and fostering a collaborative, high-performing team culture.
REQUIREMENTS - 8+ years of experience in data engineering, specializing in building scalable data pipelines and enrichment processes, with a strong track record of working with large datasets, including ingestion, transformation, and optimization- Proficiency in Spark, Python, and SQL for building scalable data processing pipelines.- Hands-on experience with Kubernetes for container orchestration and deployment.- Strong background in AWS, including services such as S3, Lambda, ECS, and RDS for data infrastructure.- Strong SQL skills, including the ability to write optimized complex queries for large datasets using advanced SQL operators such as GROUP BY, HAVING, window functions, and complex joins.- Experience with EMR and Databricks to optimize large-scale data workflows.- In-depth understanding of optimizing LLM usage in production, with experience integrating LLMs into real-world applications and applying LLM-powered insights within data pipelines or customer-facing solutions
COMPENSATIONThis role pays $155,000 to $175,000 per year, based on experience, in addition to stock options.
Anticipated role close date: 05/28/2025
H1 OFFERS- Full suite of health insurance options, in addition to generous paid time off- Pre-planned company-wide wellness holidays- Retirement options- Health & charitable donation stipends- Impactful Business Resource Groups- Flexible work hours & the opportunity to work from anywhere- The opportunity to work with leading biotech and life sciences companies in an innovative industry with a mission to improve healthcare around the globe
H1 is proud to be an equal opportunity employer that celebrates diversity and is committed to creating an inclusive workplace with equal opportunity for all applicants and teammates. Our goal is to recruit the most talented people from a diverse candidate pool regardless of race, color, ancestry, national origin, religion, disability, sex (including pregnancy), age, gender, gender identity, sexual orientation, marital status, veteran status, or any other characteristic protected by law. H1 is committed to working with and providing access and reasonable accommodation to applicants with mental and/or physical disabilities. If you require an accommodation, please reach out to your recruiter once you've begun the interview process. All requests for accommodations are treated discreetly and confidentially, as practical and permitted by law. Apply
Data Engineering is responsible for the development and delivery of our most important asset - our data. Looking across thousands of data sources from across the globe, the data engineering team is responsible for making sense out of that data to create the world’s most extensive and comprehensive knowledge base of healthcare stakeholders and the ecosystem they influence. It is our job to ensure that only accurate, normalized data flows through to our customers, and at a velocity that keeps up with the changes in the real world.
WHAT YOU'LL DO AT H1As a Staff Data Engineer on the H1DN Team, you will be a hands-on technical leader, directly contributing to the development of scalable and efficient data architectures. You will collaborate closely with two IC team members, guiding them while actively participating in the work. This role centers on data ingestion and enrichment workflows, ensuring seamless integration of client data from various sources (CSV, Parquet, JSON, APIs) and addressing scalability, data quality, and standardization challenges.
You will:- Lead the development of new features within our client data ingestion platform, transforming, standardizing, and enriching with H1 dataset to meet business needs, including customizable solutions for high-stakes client integrations.- Focus on optimizing infrastructure to deliver product- and client-ready insights efficiently, ensuring enriched data integrates seamlessly into broader pipelines.- Collaborate with clients, subject matter experts, and product teams to drive critical integrations and shape the evolution of data workflows.- Establish best practices for data quality, system reliability, and scalable processing to support growing datasets.- Mentor and guide engineers on the team, promoting best practices and fostering a culture of technical excellence.- Advocate for engineering improvements, including scalable designs, quality assurance, and technical documentation standards.- Serve as a cultural leader within the engineering team, promoting high standards of excellence and continuous improvement in engineering practices.- Ensure the projects you work on deliver clear end-user impact, align with strategic goals, and are accountable for meaningful outcomes.
ABOUT YOUYou have strong hands-on technical skills and substantial experience in data engineering, with a proven track record of building and maintaining scalable data systems and pipelines. As a proactive and visionary technical leader, you excel at solving complex data engineering challenges and driving innovative solutions.
- Proven ability to lead the development of complex data workflows, applying business logic for data enrichment and resolving challenges with creative solutions- Expertise in building and scaling data infrastructure, including integrating with core platforms.- Strong experience addressing data quality challenges and implementing robust validation mechanisms.- Self-motivated with the ability to manage tasks and projects independently.- Able to understand and align with broader organizational goals and strategies.- Proactively identifies potential risks and mitigates them early in the project lifecycle.- Passionate about mentoring junior engineers and fostering a collaborative, high-performing team culture.
REQUIREMENTS - 8+ years of experience in data engineering, specializing in building scalable data pipelines and enrichment processes, with a strong track record of working with large datasets, including ingestion, transformation, and optimization- Proficiency in Spark, Python, and SQL for building scalable data processing pipelines.- Hands-on experience with Kubernetes for container orchestration and deployment.- Strong background in AWS, including services such as S3, Lambda, ECS, and RDS for data infrastructure.- Strong SQL skills, including the ability to write optimized complex queries for large datasets using advanced SQL operators such as GROUP BY, HAVING, window functions, and complex joins.- Experience with EMR and Databricks to optimize large-scale data workflows.- In-depth understanding of optimizing LLM usage in production, with experience integrating LLMs into real-world applications and applying LLM-powered insights within data pipelines or customer-facing solutions
COMPENSATIONThis role pays $155,000 to $175,000 per year, based on experience, in addition to stock options.
Anticipated role close date: 05/28/2025
H1 OFFERS- Full suite of health insurance options, in addition to generous paid time off- Pre-planned company-wide wellness holidays- Retirement options- Health & charitable donation stipends- Impactful Business Resource Groups- Flexible work hours & the opportunity to work from anywhere- The opportunity to work with leading biotech and life sciences companies in an innovative industry with a mission to improve healthcare around the globe
H1 is proud to be an equal opportunity employer that celebrates diversity and is committed to creating an inclusive workplace with equal opportunity for all applicants and teammates. Our goal is to recruit the most talented people from a diverse candidate pool regardless of race, color, ancestry, national origin, religion, disability, sex (including pregnancy), age, gender, gender identity, sexual orientation, marital status, veteran status, or any other characteristic protected by law. H1 is committed to working with and providing access and reasonable accommodation to applicants with mental and/or physical disabilities. If you require an accommodation, please reach out to your recruiter once you've begun the interview process. All requests for accommodations are treated discreetly and confidentially, as practical and permitted by law. Apply
Job Profile
Regions
Countries
Flexible work hours Fully remote Health insurance Paid Time Off Retirement options Stock options Work from Anywhere
Tasks- Advocate for engineering improvements
- Collaborate with clients and teams
- Ensure project alignment with goals
- Establish data quality practices
- Lead data ingestion platform development
- Mentor engineers
- Optimize infrastructure for insights
APIs AWS Best Practices CSV Data Data Architecture Data engineering Data ingestion Data Pipelines Data Quality Data standardization JSON Life sciences Mentoring Parquet Scalability Technical Leadership
Experience5 years
TimezonesAmerica/Anchorage America/Chicago America/Denver America/Los_Angeles America/New_York Pacific/Honolulu UTC-10 UTC-5 UTC-6 UTC-7 UTC-8 UTC-9
Remote Jobs in North America
Remote Jobs in Europe
Remote Jobs in Asia/Pacific
Remote Jobs in South America
Remote Jobs in Africa
Remote Jobs in Middle East
Full Time Remote Jobs
Part Time Remote Jobs
Internship Remote Jobs
Contract Remote Jobs
Temporary Remote Jobs
Freelance Remote Jobs
Mid-Level Remote Jobs
Senior-Level Remote Jobs
Entry-Level Remote Jobs
Exec-Level Remote Jobs
Lead-Level Remote Jobs
Remote Senior Software Engineer Jobs
Remote Technician Jobs
Remote Contract Jobs
Remote Assist Jobs
Remote Designer Jobs
Remote Engineer I Jobs
Remote Project Manager Jobs
Remote Platform Jobs
Remote Hybrid Jobs
Remote Assistant Jobs
Remote Strategy Jobs
Remote Admin Jobs
Remote Program Manager Jobs
Remote Analytics Jobs
Remote Marketing Manager Jobs
Remote Scientist Jobs
Remote Quality Jobs
Remote Customer Success Jobs
Remote Advisor Jobs
Remote Claims Jobs
Remote Jobs with GBP > 120K in Salary
Remote Jobs with EUR > 100K in Salary
Remote Jobs with CAD > 140K in Salary
Remote Jobs with GBP > 140K in Salary
Remote Jobs with GBP > 160K in Salary
Remote Jobs with EUR > 120K in Salary
Remote Jobs with CAD > 160K in Salary
Remote Jobs with GBP > 180K in Salary
Remote Jobs with GBP > 200K in Salary
Remote Jobs with GBP > 220K in Salary
Remote Jobs with GBP > 240K in Salary
Remote Jobs with PLN > 40K in Salary
Remote Jobs with PLN > 60K in Salary
Remote Jobs with PLN > 80K in Salary
Remote Jobs with CAD > 180K in Salary
Remote Jobs with PLN > 100K in Salary
Remote Jobs with EUR > 140K in Salary
Remote Jobs with PLN > 120K in Salary
Remote Jobs with PLN > 140K in Salary
Remote Jobs with PLN > 160K in Salary