Senior Data Engineer
Noida, Uttar Pradesh, India · पूर्णवेळ
अर्ज करणारे पहिले व्हा
- अनुभव
- 4+ yrs
- पगार
- INR 2,500,000 – INR 3,500,000 / year
- रिक्त जागा
- 1
- पोस्ट केले
- १ तास आधी
- Work mode
- कार्यालयात
- शिक्षण
- Computer Science Engineering
- Eligibility
- Candidates must have at least 4 years of experience and be Computer Science Engineering students/candidates who meet the stated requirements.
- Resume
- Required to apply
Where you'll work
नोकरीचे वर्णन
About the Role
This opportunity is with a technology company that focuses on building secure, sovereign, and scalable digital solutions across artificial intelligence, Web3, cloud, and health technology. The role is for a hands-on Senior Data Engineer who will architect, build, and support robust data pipelines, lakehouse systems, and analytics platforms. The environment uses modern open-source tools to power data insights and GenAI-enabled analytics.
Core Responsibilities
You will be responsible for designing and maintaining data ingestion systems, orchestrating workflows, supporting analytics layers, and contributing to lakehouse architecture and BI enablement.
- Develop and maintain scalable ETL and data pipelines using Airbyte across multiple source systems.
- Set up and manage connectors for PostgreSQL, MySQL, MongoDB, APIs, and SaaS platforms.
- Implement change data capture for near real-time replication, including Kafka-based streaming where needed.
- Define both incremental and full refresh synchronization approaches.
- Build validation mechanisms and data quality checks to ensure reliable data flow.
- Create and maintain Airflow DAGs for end-to-end ETL orchestration.
- Add robust retry handling, failure management, and pipeline monitoring.
- Schedule and coordinate complex workflows involving multiple processing steps and external systems.
- Design and tune ClickHouse schemas for OLAP and analytical workloads.
- Write high-performance SQL and improve query efficiency.
- Use materialized views and aggregation strategies to improve reporting speed.
- Manage partitioning and retention policies for warehouse data.
- Create analytics-focused data marts for downstream consumption.
- Build a data lakehouse on object storage such as Cloudian.
- Plan partitioning by dimensions such as date and region.
- Work with Parquet and ORC file formats.
- Apply access controls and governance practices to data assets.
- Support dashboard development in Apache Superset and Power BI.
Experience and Eligibility
Applicants should have at least 4 years of relevant experience. The role is intended for Computer Science Engineering students/candidates who meet the experience requirement.
Compensation
The annual salary range for this position is ₹25,00,000 to ₹35,00,000.
Additional Information
This is a full-time, on-site role based in Noida, Uttar Pradesh, India. One opening is available, and the expected start is immediate. The application deadline is 2026-09-30 23:59:59.