Data Engineer
Kentucky
Job Highlights Location: Remote, must be based in the United States Salary Range: $103,500-$143,500 per year, plus benefits. Individual salary offers will be based on experience and qualifications unique to each candidate. Office Hours: Availability for meetings between 8:00 am – 5:00 pm Eastern Time with flexibility dependent on employee location.Position Type: Grant funded, limited-term opportunity Position End Date: June 30, 2025.
Overview:The Data Engineer will play a crucial role in advancing the CDC Foundation's mission by designing, building, and maintaining data infrastructure for a public health organization. This role is aligned to the Workforce Acceleration Initiative (WAI). WAI is a federally funded CDC Foundation program with the goal of helping the nation’s public health agencies by providing them with the technology and data experts they need to accelerate their information system improvements. The Data Engineer will work to build modern systems, tools and technology they need to address public health needs effectively.
This role is best for someone who is mission driven and is comfortable bringing a data driven solution to organizations who may be unfamiliar with this approach.
Working within Louisville Metro Department of Public Health and Wellness, the data engineer will collaborate closely with software development, a product manager, other data engineers, and program stakeholders to develop or support the deployment of an off-the-shelf mobile data entry/collection mechanism. This solution will be deployed by field staff who work daily to provide harm reduction and behavioral health equity-related interventions in populations at risk for drug misuse and/or overdose.
The Data Engineer will be hired by the CDC Foundation and assigned to the Louisville Metro Department of Public Health and Wellness. This position is eligible for a fully remote work arrangement for U.S. based candidates.
Responsibilities
- Create and manage the systems and pipelines that enable efficient and reliable flow of data, including ingestion, processing, and storage.
- Collect data from various sources, transforming and cleaning it to ensure accuracy and consistency. Load data into storage systems or data warehouses.
- Optimize data pipelines, infrastructure, and workflows for performance and scalability.
- Monitor data pipelines and systems for performance issues, errors, and anomalies, and implement solutions to address them.
- Implement security measures to protect sensitive information.
- Collaborate with data engineers, analysts, and other partners to understand their data needs and requirements, and to ensure that the data infrastructure supports the organization's goals and objectives.
- Collaborate with cross-functional teams to understand data requirements and design scalable solutions that meet business needs.
- Implement and maintain ETL processes to ensure the accuracy, completeness, and consistency of data.
- Design and manage data storage systems, including relational databases.
- Provide technical guidance to other staff.
- Communicate effectively with partners at all levels of the organization to gather requirements, provide updates, and present findings.
Qualifications
- Bachelor’s degree in Computer Science, Information Technology, Data Science, or a related field. Bachelor’s degree in business administration, Engineering Technology, Liberal Arts, Physical Science, Public Administration, Social Science, or related discipline will be considered.
- Minimum of 5 years of relevant professional experience. Six years of relational database management system (i.e. human resource management system, MS SQL) experience in a client/server environment preferred.
- Proficiency in programming languages commonly used in data engineering, such as Python, Java, SQL; including C#, C++.
- Candidate should be able to implement data automations within existing frameworks as opposed to writing one off scripts.
- Strong understanding of relational database systems (e.g., Microsoft SQL, MySQL, PostgreSQL)
- Experience with AWS tools for data engineering
- Experience regarding engineering best practices such as source control, automated testing, continuous integration and deployment, and peer review.
- Knowledge of data warehousing concepts and tools.
- Knowledgeable about industry trends, best practices, and emerging technologies in data engineering, and incorporating the trends into the organization's data infrastructure.
- Experience with cloud computing platforms.
- Expertise in data modeling, ETL (Extract, Transform, Load) processes, and data integration techniques.
- Familiarity with agile development methodologies, software design patterns, and best practices.
- Strong analytical thinking and problem-solving abilities.
- Excellent verbal and written communication skills, including the ability to convey technical concepts to non-technical partners effectively.
- Flexibility to adapt to evolving project requirements and priorities.
- Outstanding interpersonal and teamwork skills; and the ability to develop productive working relationships with colleagues and partners.
- Experience working in a virtual environment with remote partners and teams
- Experience with big data technologies and frameworks like Hadoop, Spark, Kafka, and Flink a plus.
All qualified applicants will receive consideration for employment and will not be discriminated against on the basis of race, color, religion, sex, national origin, age, mental or physical disabilities, veteran status, and all other characteristics protected by law.
We comply with all applicable laws including E.O. 11246 and the Vietnam Era Readjustment Assistance Act of 1974 governing employment practices and do not discriminate on the basis of any unlawful criteria in accordance with 41 C.F.R. §§ 60-300.5(a)(12) and 60-741.5(a)(7). As a federal government contractor, we take affirmative action on behalf of protected veterans.
The CDC Foundation is a smoke-free environment. Relocation expenses are not included. Apply
Job Profile
Fully remote Must be based in the United States
Benefits/PerksFlexible work hours Fully remote Grant funded opportunity Health protection mission Limited-term opportunity Remote work Remote work arrangement
Tasks- Collaborate with teams
- Collect and clean data
- Create and manage data systems
- Gather requirements
- Implement security measures
- Load data into storage
- Monitor data performance
- Optimize data pipelines
- Present findings
- Provide technical guidance
Agile Development Automated Testing AWS C Cloud Cloud Computing Collaboration Communication Continuous Integration Database Management Data Cleaning Data engineering Data Infrastructure Data Integration Data Modeling Data Monitoring Data Pipelines Data Science Data storage Data Transformation Data Warehousing Deployment Engineering ETL ETL Processes Flink Hadoop Harm reduction Information Technology Infrastructure Integration Java Kafka Modeling MySQL Peer review PostgreSQL Problem-solving Public health Python R Relational databases Security Software Development Source Control Spark SQL
EducationComputer Science Data Science Information Technology Public health Related Field
TimezonesAmerica/Anchorage America/Chicago America/Denver America/Los_Angeles America/New_York Pacific/Honolulu UTC-10 UTC-5 UTC-6 UTC-7 UTC-8 UTC-9