We are seeking a highly skilled
Senior Data Engineer with strong expertise in
PySpark and Python to join our team for a leading banking client in Dubai. The ideal candidate will be responsible for designing, building, and optimizing scalable data pipelines and data architectures to support advanced analytics and business intelligence initiatives.
Key Responsibilities:
- Design, develop, and maintain scalable data pipelines using PySpark and Python
- Work with large-scale distributed systems (Hadoop/Spark ecosystem)
- Build and optimize ETL/ELT workflows for structured and unstructured data
- Collaborate with data scientists, analysts, and stakeholders to deliver data solutions
- Ensure data quality, integrity, and governance across platforms
- Implement performance tuning and optimization for Spark jobs
- Work with cloud-based data platforms (AWS / Azure / GCP) if applicable
- Develop and maintain data models, schemas, and metadata management
- Troubleshoot production issues and provide timely resolutions
Required Skills & Qualifications:
- 7 years of experience in Data Engineering
- Strong hands-on experience with PySpark and Python
- Solid understanding of Apache Spark, Hadoop ecosystem
- Experience in SQL, data warehousing, and data modeling
- Familiarity with ETL tools and big data processing frameworks
- Experience with cloud platforms (AWS / Azure / GCP) is a plus
- Knowledge of data pipelines orchestration tools (Airflow, etc.)
- Strong problem-solving and analytical skills
Preferred Qualifications:
- Experience in the Banking / Financial Services domain
- Exposure to real-time data processing (Kafka, Spark Streaming)
- Understanding of data governance and security practices
Soft Skills:
- Strong communication and stakeholder management skills
- Ability to work in a fast-paced, collaborative environment
- Proactive and solution-oriented mindset