Web Analytics Made Easy - Statcounter

Lead Data Engineer

Machine Learning & Infrastructure

Equity +, Hybrid , work at the beach!

Cardiff By the Sea, California

Random Walk has delivered best in class alternative data insights focused on consumer email behavior for over a decade.  

We are seeking a Lead Data Engineer to serve as the technical owner for the infrastructure and systems powering an exciting new product. This product is focused on improving how Fortune 500 brands market new products and services with customers and leads. This is a hands-on role where you will lead our technical processes and manage a lean development team, directly impacting the systems that help leading institutional asset managers uncover market inflections.


Role & Responsibilities

This role is focused on building and scaling the data and machine learning infrastructure.

  • Build Data Pipelines: Design, build, and maintain robust, scalable data pipelines to process high-volume email text data.
  • Implement LLM Systems: Engineer systems that leverage LLMs to perform semantic analysis and feature extraction on text data at scale. This includes API integration, fine-tuning, and optimizing for latency and cost.
  • MLOps & Deployment: Own the end-to-end lifecycle of machine learning models. You will build, deploy, and maintain our primary recommendation and personalization models (e.g., collaborative filtering, gradient boosting, neural networks) in a production environment.
  • Infrastructure Ownership: Design and manage the cloud-based infrastructure required for data processing, model training, and real-time inference, ensuring high availability and scalability.
  • System Integration: Collaborate closely with the backend engineering team to integrate data services and ML models into our core application. The models must execute with minimal latency to ensure a seamless user experience.
  • Privacy by Design: Implement and enforce data handling and modeling practices that are fully compliant with our strict privacy-first approach, ensuring all systems work exclusively with anonymized, non-PII data.

We are looking for a hands-on builder with a strong engineering background.

Qualifications


Experience: 5+ years of experience in data engineering or backend engineering, with a proven track record of building and deploying data-intensive applications and machine learning systems in production.

ML in Production: Demonstrated experience with the full MLOps lifecycle, including deploying, monitoring, and maintaining machine learning models. Experience with personalization and recommendation systems is a major plus.

LLM Application: Hands-on experience building applications that use Large Language Models (LLMs) via APIs or by deploying open-source models. Familiarity with frameworks like LangChain or Hugging Face is highly desirable.

Backend Proficiency: Strong proficiency in backend development with languages like Python or Go.

Infrastructure & Cloud: Deep experience with cloud platforms (AWS, GCP, or Azure), containerization (Docker, Kubernetes), and data orchestration tools (e.g., Airflow, Prefect).

Education: Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical field.

Leadership: Strong communication and leadership skills with a passion for building consumer-centric experiences in an agile environment.

Location: Preference for candidates located in Southern California

Position

  • Competitive salary and equity incentives
  • Medical, dental & vision insurance
  • Hybrid Remote work benefits
  • Commuter benefits
  • Free lunches

please submit CV with brief 4-5 sentences note specifically describing how your recent experiences will help lead in this role.

sales AT ranwalk.com