Search by job, company or skills

T

Pyspark

Save
  • Posted a day ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Location: PAN India

Experience: 8 to 10 Years


Key Skills

:PySpark, Pytho

n
Must have Skill

  • s:
    Implementing data ingestion pipelines from different types of data sources i.e Databases, S3, Files et
  • c..Experience in building ETL/ Data Warehouse transformation proce
  • ss.Experience working with structured and unstructured da
  • ta.Developing Big Data and non-Big Data cloud-based enterprise solutions in PySpark and SparkSQL and related frameworks/librari
  • es,Developing scalable and re-usable, self-service frameworks for data ingestion and processi
  • ng,Integrating end to end data pipelines to take data from data source to target data repositories ensuring the quality and consistency of da
  • ta,Processing performance analysis and optimizati
  • on,Bringing best practices in following areas: Design & Analysis, Automation (Pipelining, IaC), Testing, Monitoring, Documentati

on.
Good to have (Knowled

  1. ge):
    Experience in cloud-based solut
  2. ions,Knowledge of data management princi

ples.

More Info

Job Type:
Industry:
Employment Type:

Job ID: 149212713

Similar Jobs

Bengaluru, India

Skills:

CassandraPysparkApache AirflowNosqlNumpyDockerAzure Data LakePythonAWSSpark SQLPycharmHadoopSqlJenkinsHivePandasAzureKubernetesDataFramesGitHub ActionsSpark MLlibHDFSJupyterVSCode

Bengaluru, India

Skills:

PysparkSqlAzure Data Lake Storage ADLSo9 INT3.0Delta LakeData Integration ETL

Bengaluru, India

Skills:

data wrangling OopsException Handlingdata pipelineDebuggingPythonPysparkApisconcurrencyNumpyGitPandasRestful ServicesData LoadingDecoratorsParallel Processingapplication unit testsLambda Functionsvector operationsDB connectivityGen Agentic AIdataframesapply map functionsapplymaperror handling

Bengaluru, India

Skills:

NosqlPysparkBigQueryGitGcpFirebasePythonSqlAirflowAstronomer

Bengaluru

Skills:

PysparkScriptingBig Data TechnologiesData WarehousingCloudera Data Platform