Implement methods to improve data reliability and quality
Combine raw data from different sources to create consistent and machine-readable formats
Structural Development:
Develop and test data structure that enable data extraction and transformation for predictive or prescriptive modeling
Process Definition:
Define and set development, test, release, update, and support processes for data engineering operations. Troubleshoot and fix code bugs
Big Data Models:
Develop big data models/use cases based on the data structure and prepare them for data operations end users
Query Execution:
Create and execute queries on structured and unstructured data sources to identify process issues or perform mass updates
Feature Layer:
Develop and implement the feature zone with the required features/KPIs required by different stakeholders and that supports building robust machine learning models
Batch Scheduling and Reporting:
Ensure that batch production scheduling and report distribution are accurate and timely
Ad Hoc Requests:
Perform ad hoc requests from users such as data research, file manipulation/transfer, and research of process issues
Requirements
Core Competencies (Level 1,2):
Performance Excellence
Collab. & Creating Synergy
Agility & Resilience
People Centricity
Technical Competencies:
Basic to Intermediate knowledge in Java, C#, and Python for developing robust applications
Basic to Moderate Experience with Cloudera or any Big data platform with the complementary service like Apache Hive, Apache Scala, BizSpark, Impala, Apache Spark, Data Security, Kafka, HBase, Sqoop, NiFi, Python for Programming, Python for Data Analysis & ML, ...etc
Proficiency in handling and processing large datasets using distributed computing frameworks
Moderate knowledge of SQL for querying and managing relational databases
Understanding of data warehousing principles and best practices
Proficiency in handling and processing large datasets using distributed computing frameworks
Moderate experience in data analysis and visualization tools
Expertise in writing shell scripts using Bash, KornShell (ksh), or Bourne Shell (sh)
Domain Expertise:
Bachelor Degree in Computer Science/Engineering is preferred