Key Responsibilities:
Design and develop scalable ETL pipelines using Python, PySpark, and Azure Data Factory.
Build Azure Functions for automation, event-driven tasks, and integration workflows.
Optimize SQL and PySpark jobs for performance and cost efficiency.
Manage and orchestrate data pipelines across Synapse, Data Lake, and Blob Storage.
Implement RBAC, data encryption, and governance policies to ensure compliance.
Collaborate with DevOps teams to deploy CI/CD pipelines for data engineering solutions.
Work closely with cross-functional teams to translate business requirements into technical designs.
Maintain technical documentation, architecture diagrams, and testing frameworks.
Technical Skills Required:
Programming: Python (Advanced), PySpark, SQL
Azure Cloud: Azure Functions, Synapse Analytics, Data Factory, Data Lake, Blob Storage
ETL Tools: Azure Data Factory, Databricks (preferred), or custom Python ETL
Databases: Azure SQL, PostgreSQL, or equivalent RDBMS
Version Control & CI/CD: Git, GitHub, Azure DevOps, YAML pipelines
Data Architecture: Data Modeling, Data Warehousing, Governance
Monitoring: Azure Monitor, Application Insights, Log Analytics
Soft Skills:
Strong analytical and problem-solving skills.
Excellent communication and collaboration in remote teams.
Adaptable to fast-paced Agile environments.
Preferred Qualifications:
Microsoft Certified: Azure Data Engineer Associate (or equivalent).
Experience with Power BI / Tableau visualization tools.
Familiarity with REST APIs, JSON, and AI/ML data pipelines.
Why Join Us:
Remote-first culture with flexible work hours.
Exposure to global-scale AI and Data Automation projects.
Collaborative, innovation-driven environment.
Continuous upskilling through enterprise-level engagements.
© 2025 xTag Services Pvt Ltd. | All Rights Reserved.