All roles

GCP Data Engineer with ML

Remote · USA Full-time New today

Role : GCP Data Engineer with ML knowledge Location : Remote (Preferably NY/NJ) Contract

Key Responsibilities

Pipeline Development & ETL: Design and deploy robust batch and streaming data pipelines using Cloud Dataflow (Apache Beam) and Cloud Pub/Sub. Data Modeling & Warehouse: Construct and optimize data models in BigQuery for high-performance analytics and ML model consumption. MLOps & Deployment: Operationalize ML models developed by data scientists, transitioning models from experimentation to production environments using Vertex AI. Feature Engineering: Collaborate with data scientists to implement feature engineering pipelines that automate the extraction of features from raw data for training. Data Security & Quality: Implement data governance, privacy, and security best practices (IAM, Data Loss Prevention) throughout the data lifecycle. Automation: Automate data workflows and orchestration using Cloud Composer (Apache Airflow). Monitoring & Optimization: Monitor pipeline performance using Cloud Monitoring and optimize for cost and speed.

Required Qualifications

Experience: 3-5+ years of experience in data engineering, with at least 2+ years focused on GCP. Programming Skills: Expert-level SQL and strong Python programming skills. GCP Expertise: Proven experience with Cloud function, Cloudrun, GCE, GKE, BigQuery, Dataflow, Dataproc, pub-sub, Google Cloud Storage, and Vertex AI. Programming Skills: Expert-level SQL and strong Python programming skills. ML Knowledge: Understanding of machine learning fundamentals (training, testing, evaluation, drift) and feature engineering techniques. Strong understanding of SQL and unstructured data management. Hand-on experience with Docker, Kubernetes (GKE), and CI/CD tools. Infrastructure as Code: Experience with Terraform to provision and manage infrastructure. Education: Bachelor's degree in Computer Science, Engineering, or a related field.

Preferred Qualifications

Certification: Google Cloud - Professional Data Engineer Certification. MLOps Specialization: Experience with Kubeflow or Vertex AI Pipelines. Data Modeling: Strong understanding of data warehouse modeling patterns (Kimball/Inmon). Key Technologies GCP Core: Cloud function, Cloudrun, BigQuery, Dataflow, Pub/Sub, Composer, Dataproc, Vertex AI. Languages: Python, SQL Frameworks: Apache Beam, Apache Spark. Tools: Terraform, Git, Docker, Kubernetes. Apply tot his job Apply To this Job

Related roles

[Remote] Senior GCP DevOps Engineer

Remote · USA Full-time

Sr. DevOps Engineer - Multiple roles - Remote

Remote · USA Full-time

Java with DevOps Engineer (Entry/Remote)

Remote · USA Full-time

Senior SRE/DevOps Engineer

Remote · USA Full-time

Senior SRE / DevOps Engineer – Kubernetes & VMware 8

Remote · USA Full-time

Google Cloud Engineer Internship Program

Remote · USA Full-time

AI Platform Engineer (Google Cloud Platform)

Remote · USA Full-time

Lead GCP Engineer: AI Platforms & Development

Remote · USA Full-time

DevOps Engineers job at Ingram Micro in Irvine, CA

Remote · USA Full-time

SRE/DevOps Engineer

Remote · USA Full-time

Software Developer - AI Trainer

Remote · USA Full-time

Sports Data Analyst

Remote · USA Full-time

Experienced Data Analyst – High-Level Investigation, Content Group at arenaflex

Remote · USA Full-time

Immediate Hiring: arenaflex Customer Service Representative - Work From Home Opportunity

Remote · USA Full-time

Pharmacy Technician IV - Patient Assistance Coordinator

Remote · USA Full-time

Senior Informatics Technical Specialist - Weekly Overnight Travel - Midwest Territory

Remote · USA Full-time

Experienced Customer Order Specialist – CAR-T Treatment Logistics

Remote · USA Full-time

Sr Manager Business Development, Data & Measurement

Remote · USA Full-time

Associate Systems Analyst - MyChart/Digital Experience

Remote · USA Full-time

Experienced Customer Support Representative – Delivering Magical Experiences for Disney Fans Worldwide at arenaflex

Remote · USA Full-time