Search by job, company or skills

L

Big Data Lead Engineer

5-10 Years
Save
  • Posted a month ago
  • Over 100 applicants
Quick Apply

Job Description

Key Responsibilities

  • Design, develop, and maintain scalable data pipelines using Apache Spark, Databricks, Python, and PySpark.
  • Build and optimize data processing workflows for structured, semi-structured, and unstructured data.
  • Develop data solutions using Azure services including Azure Databricks and Azure Data Factory.
  • Implement data lakehouse architecture using medallion design principles.
  • Ensure data quality, reliability, security, and compliance across all pipelines and systems.
  • Optimize Spark jobs using performance tuning techniques and debugging tools like Spark UI.
  • Work with storage formats such as Delta Lake, Parquet, and Avro for efficient data handling.
  • Develop automation solutions to reduce manual effort and improve delivery speed.
  • Implement CI/CD practices using GitHub or GitLab for code versioning and deployment.
  • Collaborate with stakeholders to understand requirements and translate them into technical solutions.
  • Monitor production systems, troubleshoot incidents, and ensure system stability.
  • Contribute to best practices, architecture standards, and knowledge sharing within teams.

More Info

Job Type:
Industry:
Function:
Employment Type:

About Company

Logicplanet IT Services (India) Pvt. Ltd., incorporated in 2007 and headquartered in Hyderabad, operates as a software publishing, consulting, and IT solutions provider. The company delivers enterprise technology services including software development, digital transformation, and IT staffing solutions. With expertise in areas such as embedded systems, QA automation, ERP, and cloud technologies, Logicplanet supports global clients by combining technical innovation with workforce solutions, positioning itself as both a technology partner and a recruitment facilitator.

Job ID: 147075249

Similar Jobs

Pune, India

Skills:

agile management CassandraKafkaNosqlRDBMSSolrOracleHadoopScrumRedisRuplambda architectureAirflowOO ModelingData lakes and related ecosystemsflink data streamingBig Data solutions using SparkPresto technologyrule enginesperformance benchmarking testing for Big data technologiesXpscripting and automationcaching components or servicesML models engineeringelastic