All roles

Senior Data Architect (AI & AI-Assisted Development)

Remote · USA Full-time New today

Business Area: IT Seniority Level: Mid-Senior level Job Description: At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry. Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises. We are seeking an experienced Senior Data Architect (AI-First Data Architecture & AI-Assisted Development) with 5+ years of experience designing scalable enterprise data platforms and enabling modern AI-driven ecosystems. The ideal candidate will bring deep expertise in data warehousing, lakehouse architectures, combined with hands-on experience in AI governance, Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), semantic data architectures, and AI-assisted development practices. This role extends beyond traditional data architecture by partnering with Business Intelligence, Data Science, Engineering, and AI teams to build AI-ready data foundations. The architect will lead the design of data models, metadata frameworks, and governance practices that optimize enterprise data for AI consumption, intelligent search, agentic workflows, and RAG-based applications. A key focus will be establishing robust metadata, business definitions, lineage, data tagging, and semantic structures to improve the accuracy, discoverability, and scalability of AI-powered solutions. The successful candidate will drive AI-first data acquisition, curation, and governance strategies that support business intelligence, advanced analytics, and AI-driven decision-making across Finance, Sales, and other strategic business domains. They will also champion AI-assisted architecture and documentation practices to accelerate delivery, improve productivity, and create reusable patterns that enable both users and AI systems to effectively discover, understand, and leverage enterprise data. This role will lead the evolution of intelligent, governed, and scalable data platforms that seamlessly integrate traditional data engineering with next-generation AI-powered capabilities, ensuring the organization's data ecosystem is optimized for the future of AI-enabled business operations. As a Sr. Data Architect you will: Design and implement scalable data warehouse and lakehouse architectures on the Cloudera platform. Define enterprise data models, governance frameworks, data stewardship processes, security standards, and data quality practices. Architect and optimize analytics solutions across SQL engines including Impala, Hive, and Iceberg. Design AI-powered analytics solutions leveraging LLMs, Retrieval-Augmented Generation (RAG), vector databases (such as PostgreSQL, Qdrant, Milvus) , and NLQ capabilities. Lead the integration of AI/ML capabilities into enterprise data platforms and data pipelines while establishing governance controls for AI models, data usage, and lifecycle management. Leverage vibe coding / AI-assisted development tools to accelerate development and improve productivity. Build and optimize batch and near real-time data pipelines. Collaborate with business stakeholders to translate business requirements into scalable data products and analytics solutions. Establish best practices for performance optimization, data architecture, and AI-assisted development. Mentor teams on modern data architecture and AI-enabled development methodologies. Ensure data security, governance, compliance, and responsible AI practices within enterprise data platforms and AI-enabled solutions. Collaborate with business stakeholders across FP&A, Sales, and Revenue Operations to translate business requirements into scalable data solutions that support financial forecasting, revenue optimization, budgeting, pipeline analysis, and sales forecasting We are excited about you if you have: Bachelor’s degree in Computer Science or equivalent and 5-6 years of related experience; OR Master’s degree and 3-5 years of related experience; OR PhD and 0-3 years of related experience Deep expertise in enterprise data warehousing, lakehouse architectures, and Cloudera-based data platforms. Strong experience with CDP, including HDFS, Hive, Impala, Kudu, and Cloudera data ingestion and processing frameworks. Strong understanding of distributed data systems and Hadoop-based architectures. Advanced SQL skills, including performance tuning and query optimization. Proficiency in Python and data engineering frameworks. Experience with dimensional and normalized data modeling. Strong understanding of data governance, lineage, metadata management, data cataloging, enterprise security, and compliance requirements. Experience implementing AI governance practices including model governance, AI risk management, explainability, monitoring, and responsible AI controls. Experience implementing AI/ML, LLM, vector database, and RAG-based solutions in production environments. Familiarity with AI-assisted development tools (e.g., GitHub Copilot and LLM-powered workflows). Strong communication, stakeholder management, and problem-solving skills. Ability to align enterprise data architecture with business objectives in Finance, Sales, and Revenue Operations. Ability to bridge traditional data platforms with modern AI capabilities You might also have: Experience with CDP Public Cloud and Private Cloud deployments. Knowledge of Cloudera Data Warehouse (CDW), Cloudera Data Engineering (CDE), Kafka, Spark, and streaming architectures. Experience with generative AI, vector databases, modern AI data architectures, and AI governance frameworks. Understanding of Data Mesh, Data Fabric, and enterprise governance operating models. Experience working with Salesforce, NetSuite, and other enterprise business systems. Experience supporting FP&A, Sales Analytics, and executive reporting environments. What Makes This Role Unique Lead architecture for a modern Cloudera-centric enterprise data platform. Drive the convergence of data warehousing, lakehouse architectures, and AI innovation. Influence enterprise-wide data, governance, and AI strategy. Champion AI-assisted development practices to accelerate productivity. This role is not eligible for immigration sponsorship. What you can expect from us: Generous PTO Policy Support work life balance with Unplugged Days Flexible WFH Policy Mental & Physical Wellness programs Phone and Internet Reimbursement program Access to Continued Career Development Comprehensive Benefits and Competitive Packages Paid Volunteer Time Employee Resource Groups EEO/VEVRAA #LI-SZ1 #LI-REMOTE Apply To This Job

Related roles

RN Case Manager (IKC)

Remote · USA Full-time

Manager, Business Intelligence

Remote · USA Full-time

Customer Team Leader (District Sales Manager), Cardiovascular Disease - Orlando District

Remote · USA Full-time

Customer Team Leader (District Sales Manager), Cardiovascular Disease - South Georgia District

Remote · USA Full-time

Customer Team Leader (District Sales Manager), Cardiovascular Disease - Southwest Florida District

Remote · USA Full-time

Client Manager - Technical (Software)

Remote · USA Full-time

Customer Team Leader (District Sales Manager), Cardiovascular Disease - South Carolina District

Remote · USA Full-time

Account Representative (Remote- West Coast)

Remote · USA Full-time

Partner Activation Manager

Remote · USA Full-time

Clinical Financial Case Management RN

Remote · USA Full-time

Senior Governance, Risk, and Compliance Engineer

Remote · USA Full-time

Experienced Data Entry Specialist – Healthcare Industry – Join arenaflex

Remote · USA Full-time

Experienced Customer Support Specialist – Spanish Speaker – Work from Home Opportunity at arenaflex

Remote · USA Full-time

Senior D365 F&O / CE Project Operations (Modern Architecture) Consultant - + 401k + Bonus

Remote · USA Full-time

IT Business Analyst – IT Facilities, Infrastructure & Remote Operations (Full-Time, Hybrid)

Remote · USA Full-time

Experienced Customer Success Manager - Bilingual Spanish and Portuguese

Remote · USA Full-time

Experienced Entry-Level Remote Customer Service Representative – Flexible Work Arrangements and Career Growth Opportunities at arenaflex

Remote · USA Full-time

Data Entry - Entry Level Online (Typist) - Remote

Remote · USA Full-time

Proofreader-Remote

Remote · USA Full-time

Experienced Customer Service Representative – East Aurora, NY at arenaflex

Remote · USA Full-time