Job Description What you need to do :
- We’re building the most personalized and intelligent news experiences for India’s next 750 million digital users.
As Our Principal Data Engineer, You Will
- Design and maintain data infrastructure that powers personalization systems and analytics platforms with seamless data flow from source to consumption.
- Architect scalable data pipelines processing massive volumes of user interaction and content data across our news platforms.
- Develop robust ETL processes and implement distributed data processing workflows for large-scale transformations and analytical processing.
- Create and maintain data lakes/warehouses that consolidate data from multiple sources, optimized for ML model consumption and business intelligence.
- Implement data governance practices and partner with the ML team to ensure right data availability for recommendation systems.
Who You Need To Be
- Bachelor’s or Master’s degree in Computer Science, Engineering, Data Science, or related field with 8-12 years of data engineering experience (3+ years in senior role).
- Expert-level SQL skills with strong experience in Apache Spark ecosystem (Spark SQL, Streaming, SparkML) and proficiency in Python/Scala.
- Experience with AWS data ecosystem (RedShift, S3, Glue, EMR, Kinesis, Lambda, Athena) and ETL frameworks (Glue, Airflow).
- Proven track record building large-scale data pipelines in production environments, preferably in high-traffic digital media.
- Excellent communication skills with ability to work effectively across teams in a fast-paced environment requiring engineering agility.
(ref:hirist.tech)