Data Engineer/II
Bachatt
Job Description
Job Title: Data Engineer Location: Gurugram, Haryana, India About Bachatt : Bachatt has transformed how India saves. Bachatt is a young Indian fintech startup founded by 3 friends from IIT/IIM with 10+ years of relevant experience ( https://bachatt.app). Bachatt provides a savings and investment platform to specifically target over 30 crore self-employed individuals in India, who find regular mutual funds and investment platforms daunting. often have irregular income streams.
Bachatt is on a mission to bring over 30 crore Indians to invest in mutual funds, and generate wealth beyond the usual instruments of FDs and Post Office schemes. Further, the vision is to empower them to access the full suite of financial products like loans, insurance and credit cards, in a simplified and curated experience, like an offline insurance or mutual fund agent provides them. The company was founded in November 2024 and raised over $15 million in two rounds of funding, by leading marquee investors of the likes of Lightspeed Venture Partners, Info Edge Ventures and others.
Founders : Anugrah Jain (CEO), Ankur Jhavery, and Mayank Agarwal Experience: 4+ Years Role Summary: Build and maintain the enterprise data lake, design ETL pipelines, develop ML models for forecasting, and create AI agents/MCP integrations using LLM APIs. Required Skills Python ETL — Pandas, NumPy, data modelling, API integrations SQL — Complex queries, schema design, performance tuning GCP — BigQuery, Cloud Storage, CloudRun, Secret Manager Data Lake Design — Ingestion from ERP/CRM systems (NetSuite, Salesforce), schema evolution, data quality REST API — Development and consumption (OAuth, webhooks) Git & CI/CD — Version control and deployment basics Preferred Skills AI Agent Development — Tool-calling agents, MCP servers, LLM APIs (Claude, Gemini, OpenAI) Machine Learning — Time series forecasting, predictive modelling (Prophet, XGBoost, SARIMAX) Orchestration — Cloud Scheduler, Airflow, or cron-based job pipelines Key Responsibilities Build and maintain data lake on BigQuery — ingestion, transformation, scheduling Design and implement ETL pipelines (Python) across banking, ERP, and CRM sources Experiment with and deploy ML models for cash forecasting and business predictions Develop AI agents and MCP tool integrations using LLM APIs Ensure data quality, monitoring, and alerting across pipelines.