Data Engineer
Innodata India Private Limited
Job Description
Role- GCP Data Engineer
Experience- 6years+
Responsibilities:
• Design and implement data-driven solutions on GCP including BigQuery, Cloud Storage, Dataflow, Pub/Sub, and Looker/BI.
• Build ETL scripts using SQL and Python to extract, clean, and transform structured and unstructured data from ERP, procurement, logistics, and facility management systems.
• Develop and optimize data pipelines for ingestion, transformation, and loading into enterprise data lakes and warehouses.
• Build and extend end-to-end data and BI solutions, spanning extraction, storage, transformation, and visualization layers.
• Partner with supply chain, real estate, and AI/ML teams to provide pipelines for AI solutions (e.g., RAG ingestion, Copilot integration, multi-agent workflows).
• Ensure data governance, lineage, and compliance across supply chain datasets.
• Continuously optimize query performance, ETL processes, and pipeline reliability. Skills Required
• Strong hands-on expertise with GCP services: BigQuery, Dataflow, Pub/Sub, Cloud Storage, Looker/BI (or similar).
Skills Required
• Strong hands-on expertise with GCP services: BigQuery, Dataflow, Pub/Sub, Cloud Storage, Looker/BI (or similar).
• Advanced proficiency in SQL (complex queries, optimization) and Python (data engineering, scripting, APIs).
• Experience building ETL/ELT pipelines operating on structured and unstructured data sources.
• Knowledge of enterprise data warehouse and data lake architectures.
• Exposure to data pipelines for AI/ML (vector DB ingestion, embeddings, RAG pipelines, copilots, agents). • Familiarity with supply chain or data center operations data is a strong plus.