Principal Data Engineer
Remote US Canada
Hey, this job isn't fresh anymore! 👉 Find fresh remote jobs here
What You'll Do:As a pivotal member of the team, you will lead the design and development of a robust data architecture that guides data modeling, integration, processing, and delivery standards enabling modern data product development at Scribd.
You will also serve as a data and analytics solution architect, leading architecture initiatives encompassing data warehousing, data pipeline development, data integrations, and data modeling. You will shape Scribd’s data strategy, guiding stakeholders in how they consume and act on data.
We’re looking for someone with proven proficiency in architecting, designing and development experience with batch and real time streaming infrastructure and workloads. Your expertise will help establish standards for data modeling, integration, processing, and delivery and also help translate business requirements into technical specifications.
At Scribd, we leverage deep data insights to inform every aspect of our business, from product development, experimentation, to understanding our subscriber engagement and tracking key performance indicators. You'll join a data engineering team tackling complex challenges within a rich domain encompassing three distinct brands – Scribd, Everand, and Slideshare – all serving a massive user base with over 200 million monthly visitors and 2 million paying subscribers. You'll have the opportunity to make a real impact as we are heavily investing in improving our core data layer and this exciting new role puts you right at the forefront of this initiative.
Based on the project, this might involve cross-functional work with the Data Science, Analytics, and other Engineering and Business teams to design cohesive data models, database schemas and data storage solutions, consumption strategies and patterns. Almost everything you will be working on will be to increase the "customer satisfaction" for internal customers of Scribd data.
Required Skills:• 7+ years of experience in data strategy, data architecture, modeling, solution design, data engineering, or a similar role• Hands-on experience and knowledge of data lake technologies (Databricks, Snowflake, etc),data storage formats (Parquet, Avro etc.) and query engines (Athena,Presto etc.), data schemas, optimization of queries and associated concepts for building optimized solutions at scale• Strong understanding of distributed systems, Restful APIs and data consumption patterns• Proficiency in data modeling, ETL processes, and real-time and batch analytics frameworks.• Proficient with at least one dialect of SQL.• Hands-on experience in Scala or Python.
Desired Skills:• Experience and working knowledge of streaming platforms, typically based around Kafka.• Strong grasp of AWS data platform services and their strengths/weaknesses.• Hands on experience in implementing data …
You will also serve as a data and analytics solution architect, leading architecture initiatives encompassing data warehousing, data pipeline development, data integrations, and data modeling. You will shape Scribd’s data strategy, guiding stakeholders in how they consume and act on data.
We’re looking for someone with proven proficiency in architecting, designing and development experience with batch and real time streaming infrastructure and workloads. Your expertise will help establish standards for data modeling, integration, processing, and delivery and also help translate business requirements into technical specifications.
At Scribd, we leverage deep data insights to inform every aspect of our business, from product development, experimentation, to understanding our subscriber engagement and tracking key performance indicators. You'll join a data engineering team tackling complex challenges within a rich domain encompassing three distinct brands – Scribd, Everand, and Slideshare – all serving a massive user base with over 200 million monthly visitors and 2 million paying subscribers. You'll have the opportunity to make a real impact as we are heavily investing in improving our core data layer and this exciting new role puts you right at the forefront of this initiative.
Based on the project, this might involve cross-functional work with the Data Science, Analytics, and other Engineering and Business teams to design cohesive data models, database schemas and data storage solutions, consumption strategies and patterns. Almost everything you will be working on will be to increase the "customer satisfaction" for internal customers of Scribd data.
Required Skills:• 7+ years of experience in data strategy, data architecture, modeling, solution design, data engineering, or a similar role• Hands-on experience and knowledge of data lake technologies (Databricks, Snowflake, etc),data storage formats (Parquet, Avro etc.) and query engines (Athena,Presto etc.), data schemas, optimization of queries and associated concepts for building optimized solutions at scale• Strong understanding of distributed systems, Restful APIs and data consumption patterns• Proficiency in data modeling, ETL processes, and real-time and batch analytics frameworks.• Proficient with at least one dialect of SQL.• Hands-on experience in Scala or Python.
Desired Skills:• Experience and working knowledge of streaming platforms, typically based around Kafka.• Strong grasp of AWS data platform services and their strengths/weaknesses.• Hands on experience in implementing data …
This job isn't fresh anymore!
Search Fresh JobsJob Profile
Regions
Countries
Remote only in US and Canada
Benefits/PerksCompetitive equity ownership Local cost of labor benchmarks Total compensation package
Tasks- Implementation
APIs Athena Avro AWS Batch processing CI/CD Compensation Data Architecture Databricks Data Delivery Data engineering Data Governance Data Integration Data Modeling Data Pipeline Development Data processing Data Warehousing Distributed Systems Docker Engagement ETL Git GitHub Jenkins Kafka Looker Parquet Presto Python Qlik RESTful API's Scala Snowflake SQL Tableau Terraform
Experience7 years
Education TimezonesAmerica/Anchorage America/Chicago America/Denver America/Los_Angeles America/New_York Pacific/Honolulu UTC-10 UTC-5 UTC-6 UTC-7 UTC-8 UTC-9
Remote Jobs in North America
Remote Jobs in Europe
Remote Jobs in Asia/Pacific
Remote Jobs in South America
Remote Jobs in Africa
Remote Jobs in Middle East
Full Time Remote Jobs
Part Time Remote Jobs
Internship Remote Jobs
Contract Remote Jobs
Temporary Remote Jobs
Freelance Remote Jobs
Mid-Level Remote Jobs
Senior-Level Remote Jobs
Entry-Level Remote Jobs
Exec-Level Remote Jobs
Lead-Level Remote Jobs
Remote Sales Rep Jobs
Remote Marketing Manager Jobs
Remote Program Manager Jobs
Remote Sales Representative Jobs
Remote Spanish Jobs
Remote Writer Jobs
Remote Advisor Jobs
Remote Finance Jobs
Remote Mobile Jobs
Remote Customer Success Jobs
Remote Data Scientist Jobs
Remote Technician Jobs
Remote Sales Manager Jobs
Remote Machine Learning Jobs
Remote Therapist Jobs
Remote Strategic Account Jobs
Remote Counsel Jobs
Remote Administrator Jobs
Remote Data Engineer Jobs
Remote Pathologist Jobs
Remote Jobs with GBP > 100K in Salary
Remote Jobs with CAD > 140K in Salary
Remote Jobs with CAD > 160K in Salary
Remote Jobs with GBP > 120K in Salary
Remote Jobs with EUR > 120K in Salary
Remote Jobs with GBP > 140K in Salary
Remote Jobs with CAD > 180K in Salary
Remote Jobs with GBP > 160K in Salary
Remote Jobs with JPY > 240K in Salary
Remote Jobs with JPY > 260K in Salary
Remote Jobs with JPY > 200K in Salary
Remote Jobs with JPY > 280K in Salary
Remote Jobs with JPY > 180K in Salary
Remote Jobs with JPY > 220K in Salary
Remote Jobs with PLN > 80K in Salary
Remote Jobs with JPY > 40K in Salary
Remote Jobs with PLN > 60K in Salary
Remote Jobs with JPY > 100K in Salary
Remote Jobs with JPY > 120K in Salary
Remote Jobs with JPY > 140K in Salary