Lead Data Engineer Job at WorkHQ, Los Angeles, CA

b2FyNElSd0RjUExTZStpUkhQZnhKQWk2OEE9PQ==
  • WorkHQ
  • Los Angeles, CA

Job Description

Company Context

Series A, well-funded US startup in HRTech developing WorkHQ.com and an AI Recruiter product.

This is a US-only, Remote role (Mainland).

Role Overview

Lead data infrastructure architect managing billions of data points across 250M+ professional profiles.

Hire data engineers to aid you in that journey.

Core Responsibilities

  • Design scalable data pipelines processing massive record volumes

  • Architect ETL processes using PySpark on Amazon EMR (Open to shifting to other solutions like Data Bricks / Snowflake)

  • Distribute enriched data through medallion architecture across Postgres, Athena, OpenSearch

  • Integrate new data sources into the main pipeline

  • Implement advanced data matching using Splink

Technical Requirements

  • 5-8 years professional data engineering experience

  • Good proficiency in:

    • PySpark and distributed computing

    • AWS data services (EMR, Glue, Athena)

    • Docker

    • Pandas and DataFrame manipulation

    • Complex data format handling (JSONL, Parquet)

  • Strong background in:

    • Big data processing architectures

    • Data warehouse design

    • Performance optimization

  • Advanced Python, SQL skills

Nice to Have

  • Probabilistic record linking expertise

  • OpenSearch/elasticsearch technologies

  • Machine learning data pipeline design

  • Recruitment tech ecosystem knowledge

Technical Stack

  • Big Data: PySpark, EMR

  • Databases: Postgres, OpenSearch

  • Cloud: AWS

  • Containerization: Docker

  • Data Formats: JSONL, Parquet

  • Analytics: Metabase, Athena, Glue

  • Data Processing: Pandas, Splink

Other Considerations

While this role has specific requirements - if you lack a few technical skills, but motivated to learn and lead the platform, please apply for consideration.

If you are coming from Director/Head of/VP levels that is relevant to this job, you can apply as well.

You will need to apply directly on our platform.

Thank you for your time.

Job Tags

Permanent employment, Remote work, Shift work,

Similar Jobs

Aston Martin of Chicago

Automotive Parts Manager Job at Aston Martin of Chicago

The Ed Napleton Automotive Group is looking for our next Automotive Parts Manager. Located at Aston Martin of Downers Grove, the Automotive Parts Manager is responsible for managing the parts department including the hiring, training and development of parts advisors and... 

Mayo Clinic

Chaplain - Jacksonville Job at Mayo Clinic

 ...and FSAs for eligible expenses. Retirement: Competitive retirement package to secure your future. Responsibilities The chaplain provides emotional support, spiritual counsel and comfort in addressing the spiritual needs of patients as assigned. This... 

Vision Centric Inc.

Data Analyst Job at Vision Centric Inc.

At Vision Centric Inc., were looking for a detail-oriented Data Analyst who thrives on solving complex problems, ensuring accuracy, and delivering actionable insights. If youre passionate about data integrity and want to make an impact, this is your opportunity. You... 

Navarro Inc.

Hydrogeologist/Geologist (3737) Job at Navarro Inc.

 ...environmental monitoring, data evaluation, and field investigation. The ideal candidate will have a strong technical background in geology, hydrogeology, and environmental science, with experience designing and implementing field programs, interpreting environmental data... 

Peraton

Military Information Support Operations (MISO) Planner - TS/SCI w/CI POLY (Ft Meade MD) Job at Peraton

 ...service or joint level in support of operations at Combatant Commands or Joint Task Forces. Knowledge of service departments (Army, Navy, Marine Corps, and Air Force) functions and their relationships to CCMDs to coordinate PSYOP/MISO actions in appropriate channels, e...