• Cars & Vehicles
Cars & Vehicles
Browse by
  • Cars
  • Motorbikes & Scooters
  • Vans
  • Campervans & Motorhomes
  • Caravans
  • Trucks
Discover more in our guides
For Sale
Browse by
Services
Browse by
  • Business & Office
  • Childcare
  • Clothing
  • Computers & Telecoms
  • Entertainment
Property
Browse by
  • For Sale
  • To Rent
  • To Share
Pets
Browse by
  • Birds
  • Cats
  • Dogs
Jobs
Browse by
Community
Browse by
  • Artists & Theatres
  • Classes
  • Events, Gigs & Nightlife

Contact Randstad Technologies Recruitment

Randstad Technologies Recruitment

Apply on company site

Lead PySpark Engineer

North London, London

Randstad Technologies Recruitment

Contact Randstad Technologies Recruitment

Hours
Full Time
Posted
3 hours ago
Salary
£281 - 292 - Day
Recruiter
Randstad Technologies Recruitment
Closes
09 May 2026
Course
No
Recruiter Type
Direct Employer

Description

PySpark Engineer Lead

As the Technical Lead, you will drive the high-stakes migration of legacy SAS analytics to a modern, cloud-native PySpark ecosystem on AWS. This isn't just a lift and shift you will refactor complex procedural logic into scalable, production-ready distributed pipelines for a Tier-1 financial services environment.

Core Responsibilities

Engineering Leadership: Design and develop complex ETL/ELT pipelines and Data Marts using PySpark, EMR, and Glue.

Legacy Modernisation: Architect the conversion of SAS Base/Macros into modular, testable Python code using SAS2PY and manual refactoring.

Performance Tuning: Optimise Spark execution (partitioning, shuffling, caching) to ensure cost-efficient processing of massive financial datasets.

Quality & Governance: Implement rigorous CI/CD, unit testing, and data reconciliation frameworks to ensure "penny-perfect" accuracy.

Technical Stack

Engine: PySpark (Expert), Python (Clean Code/SOLID principles).

AWS: EMR, Glue, S3, Athena, IAM, Lambda.

Data Modeling: SCD Type 2, Fact/Dimension tables, Data Vault/Star Schema.

Legacy: Proficiency in reading/debugging SAS (Base, Macros, DI Studio). xjnqpsq

DevOps: Git-based workflows, Jenkins/GitLab CI, Terraform.

Randstad Technologies is acting as an Employment Business in relation to this vacancy

Ad ID: 5417993600

Jobs