Language Requirement: Spanish and English (required) About the Role This Data Engineer will own the data ingestion, transformation, and quality infrastructure that powers a high-performing consulting environment’s AI and data platform. Clean, structured data is the foundation for everything downstream, from analytics to AI-driven insights. You’ll build and maintain pipelines that take raw commercial datasets such as CRM exports, ERP data, sales data, compensation plans, and organizational structures, and turn them into reliable, structured inputs for analysis. This role requires strong engineering fundamentals paired with the ability to work across diverse, real-world datasets in both English and Spanish. What You’ll Do Build and manage data pipelines Design, develop, and maintain scalable pipelines to ingest, validate, and transform client data from multiple sources. Handle complex data wrangling Work through messy, inconsistent datasets across different formats and languages, ensuring clean and structured outputs. Design ETL processes Implement reliable and testable ETL pipelines that adapt to varying data structures across engagements. Own data quality and validation Develop validation frameworks to ensure completeness, consistency, and integrity across datasets. Generate actionable data quality insights. Design flexible data models that evolve with business needs while maintaining structure and usability. Incorporate third-party and internal datasets to enrich core data models and improve analytical outcomes. Collaborate cross-functionally Partner with AI and analytics teams to ensure data is structured and optimized for downstream use cases, including AI-driven workflows. What You Bring 3+ years of experience in data engineering, building ETL pipelines or data processing systems Strong Python skills for data processing (pandas, polars, or similar) Experience with data validation tools (Pydantic, Great Expectations, or equivalent) Solid experience with relational databases and data modeling Hands‑on experience working with messy, real‑world datasets from multiple sources Experience building end‑to‑end automated pipelines (ingestion through validation and monitoring) Familiarity with cloud‑based data storage and services Fluent in both Spanish and English , with the ability to work across bilingual datasets and teams Nice to Have Experience designing shared data models or ontologies Background in consulting, analytics, or enterprise data environments Familiarity with data contracts, APIs, or structured data delivery for AI/analytics use cases Python (pandas, polars), Pydantic, Great Expectations, PostgreSQL, SQL Server, cloud data platforms, ETL orchestration tools, APIs Why This Role This is a high‑impact contract role where your work directly enables analytics and AI outcomes. You’ll be building the data foundation that allows teams to move faster, make better decisions, and scale intelligently across engagements. #J-18808-Ljbffr