Python Spark Developer

Company:  Diverse Lynx
Location: Columbus
Closing Date: 19/10/2024
Hours: Full Time
Type: Permanent
Job Requirements / Description
Job Role : Python Spark Developer

Location : Columbus, OH

Experience : 9+ Years

Responsibilities:

Develop and maintain data platforms using Python, Spark, and PySpark.

Handle migration to PySpark on AWS.

Design and implement data pipelines.

Work with AWS and Big Data.

Produce unit tests for Spark transformations and helper methods.

Create Scala/Spark jobs for data transformation and aggregation.

Write Scaladoc-style documentation for code.

Optimize Spark queries for performance.

Integrate with SQL databases (e.g., Microsoft, Oracle, Postgres, MySQL).

Understand distributed systems concepts (CAP theorem, partitioning, replication, consistency, and consensus).

Skills:

Proficiency in Python, Scala (with a focus on functional programming), and Spark.

Familiarity with Spark APIs, including RDD, Data Frame, MLlib, GraphX, and Streaming.

Experience working with HDFS, S3, Cassandra, and/or DynamoDB.

Deep understanding of distributed systems.

Experience with building or maintaining cloud-native applications.

Familiarity with serverless approaches using AWS Lambda is a plus

Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.

Apply Now
Share this job
An error has occurred. This application may no longer respond until reloaded. Reload 🗙