Job Description:
A legal technology startup is seeking Senior Data Engineer to build and own the core data infrastructure behind an AI-driven analytics platform. This role is ideal for someone who enjoys solving complex data problems, working with messy unstructured datasets, and building systems from the ground up in a fast-moving startup environment.
Location: Remote (travel 1x/quarter)
Salary: Up to 220k base + equity
Responsibilities:
-
Design and maintain scalable production data pipelines
-
Ingest, clean, and structure large unstructured datasets
-
Build OCR, PDF parsing, and document extraction workflows
-
Develop semantic search and LLM-assisted retrieval systems
-
Improve pipeline reliability, performance, and scalability
-
Collaborate closely with engineering and product teams
Requirements:
-
4+ years of experience building production data pipelines
-
Experience with law firm data
-
Strong Python and SQL skills
-
Experience with ETL/ELT, OCR, and document extraction tooling
-
Familiarity with RAG, semantic search, vector databases, and AI-assisted extraction workflows
-
Experience handling large, messy, or external/regulatory datasets
-
Comfortable working autonomously in fast-paced environments
-
Bachelor’s degree in CS, Data Science, or related field preferred
Preferred:
-
Startup experience
-
Experience with AI-enabled products or retrieval systems
Keywords: Python, data pipelines, dbt, (Airflow or Dagster), startup or reputable company: Meta/Palantir
Qualified candidates, please send your resume to Hazem Kamal, Hazem@analyticrecruiting.com | For more opportunities, please visit www.analyticrecruiting.com.