Data Engineer


Verana Health, a digital health company that delivers quality drug lifecycle and medical practice insights from an exclusive real-world data network, recently secured a $150 million Series E led by Johnson & Johnson Innovation – JJDC, Inc. (JJDC) and Novo Growth, the growth-stage investment arm of Novo Holdings. 

Existing Verana Health investors GV (formerly Google Ventures), Casdin Capital, and Brook Byers also joined the round, as well as notable new investors, including the Merck Global Health Innovation Fund, THVC, and Breyer Capital.

We are driven to create quality real-world data in ophthalmology, neurology and urology to accelerate quality insights across the drug lifecycle and within medical practices. Additionally, we are driven to advance the quality of care and quality of life for patients. DRIVE defines our internal purpose and is the galvanizing force that helps ground us in a shared corporate culture. DRIVE is: Diversity, Responsibility, Integrity, Voice-of-Customer and End-Results. Click here to read more about our culture and values. 

Our headquarters are located in San Francisco and we have additional offices in Knoxville, TN and New York City with employees working remotely in AZ, CA, CO, CT, FL, GA, IL, LA, MA, NC, NJ, NY, OH, OR, PA, TN, TX, UT , VA, WA, WI. All employees are required to have permanent residency in one of these states. Candidates who are willing to relocate are also encouraged to apply.

*Applicants must be authorized to work for any employer in the U.S. We are unable to sponsor or take over sponsorship of an employment Visa at this time.

Job Title: Data Engineer

Job Intro:

The Data Engineering team is dedicated to build the end to end engineering platform at Verana Health using cloud based solutions that are engineered for scale, availability and high performance in a distributed data processing environment.

Job Duties and Responsibilities:

We are looking for a self-starter to join our Data Engineering team. You will work in a fast-paced environment where you will get an opportunity to build and contribute to the full lifecycle development and maintenance of the data engineering platform at Verana Health.

With the Data Engineering team you will get an opportunity to - 

  • Design and implement data engineering solutions that is scalable, reliable and secure on the Cloud environment
  • Understand and translate Verana Health’s business needs into data engineering solutions 
  • Build large scale data pipelines that can handle big data sets using distributed data processing techniques that supports the efforts of the data science and data application teams
  • Partner with cross-functional stakeholder including Product managers, Architects, Data Quality engineers, Application and Quantitative Science end users to deliver engineering solutions
  • Contribute to defining data governance across the Verana Health data platform

Basic Requirements:

  • A minimum of a BS degree in computer science, software engineering, or related scientific discipline is desired
  • 3+ years of work experience in building scalable and robust data engineering solutions
  • Strong understanding of Object Oriented programming and proficiency with programming in Python (TDD) and Pyspark to build scalable algorithms
  • 3+ years of experience in distributed computing and big data processing using the Apache Spark framework including Spark optimization techniques
  • 2+ years of experience with Databricks, Delta tables, unity catalog, Delta Sharing, Delta live tables(DLT)  and incremental data processing 
  • Experience with Delta lake, Unity Catalog
  • Advanced SQL coding and query optimization experience including the ability to write analytical and nested queries
  • 3+ years of experience in building scalable ETL/ ELT Data Pipelines on Databricks and AWS (EMR)
  • 2+ Experience of orchestrating data pipelines using Apache Airflow/ MWAA
  • Understanding and experience of AWS Services that include ADX, EC2, S3
  • 3+ years of experience with data modeling techniques for structured/ unstructured datasets
  • Experience with relational/columnar databases - Redshift, RDS and interactive querying services - Athena/ Redshift Spectrum
  • Passion towards healthcare and improving patient outcomes
  • Demonstrate analytical thinking with strong problem solving skills
  • Stay on top of emerging technologies and posses willingness to learn


  • Experience with Agile environment
  • Experience operating in a CI/CD environment
  • Experience building HTTP/REST APIs using popular frameworks
  • Healthcare experience


  • We provide health, vision, and dental coverage for employees
    • For our PPO plans, Verana pays 100% of employee insurance coverage and 90% of family
    • For our HDHP plans, Verana offers additional monthly $100 indiviual/$200 HSA contribution
  • 401k Match - 3% match paid year end, up to $1000/year
  • A generous parental leave policy and family building support through the Cleo Family
  • Flexible vacation plans
  • $700 Learning & Wellness stipend
  • $25/wk in Doordash credit
  • Spring Health mental health support

Final note:

You do not need to match every listed expectation to apply for this position. Here at Verana, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.





Verana Health is committed to complying with all applicable pay transparency laws and supports equitable pay practices. We pay based on a market-based approach, supported with robust data and in alignment with the compensation of our existing team. We construct our compensation ranges based on the US national average but your pay may vary depending on your location and the cost of living index for that geographic area. In determining an offer, base salary will also be based on experience, qualifications, skills and market conditions.

Please note pay ranges for major metropolitan areas may be different.National Pay Range$112,000—$140,000 USD Apply

Job Profile


North America


United States


Agile Apache Spark AWS CI/CD Databricks Data engineering Data Science Delta Lake PySpark Python Redshift SQL

  • Build large scale data pipelines
  • Contribute to defining data governance
  • Design and implement scalable data engineering solutions
  • Modeling
  • Partner with cross-functional stakeholders

3+ years


BS Degree in Computer Science Software Engineering


Applicants must be authorized to work for ANY employer in the U.S. Permanent residency required in specific states


America/Anchorage America/Chicago America/Denver America/Los_Angeles America/New_York Pacific/Honolulu UTC-10 UTC-5 UTC-6 UTC-7 UTC-8 UTC-9