PySpark Technical Lead

6 - 8 years

19.0 - 21.0 Lacs P.A.

Chennai

Posted:2 months ago| Platform: Naukri logo

Apply Now

Skills Required

TrainingArchitectureScalabilitysparkMachine learningData processingTechnical LeadDeploymentManagementAWS

Work Mode

Work from Office

Job Type

Full Time

Job Description

We are seeking a highly skilled and motivated Data Engineer to join our dynamic team. As a Data Engineer, you will collaborate closely with our Data Scientists to develop and deploy machine learning models. Proficiency in below listed skills will be crucial in building and maintaining pipelines for training and inference datasets. Responsibilities: Work in tandem with Data Scientists to design, develop, and implement machine learning pipelines. Utilize PySpark for data processing, transformation, and preparation for model training. Leverage AWS EMR and S3 for scalable and efficient data storage and processing. Implement and manage ETL workflows using Streamsets for data ingestion and transformation. Design and construct pipelines to deliver high-quality training and inference datasets. Collaborate with cross-functional teams to ensure smooth deployment and real-time/near real-time inferencing capabilities. Optimize and fine-tune pipelines for performance, scalability, and reliability. Ensure IAM policies and permissions are appropriately configured for secure data access and management. Implement Spark architecture and optimize Spark jobs for scalable data processing. Total Experience Expected: 06-08 years Professional degree

Information Technology & Services
Lyon

RecommendedJobs for You

Chennai, Pune, Mumbai, Bengaluru, Gurgaon

Chennai, Pune, Delhi, Mumbai, Bengaluru, Hyderabad, Kolkata

Pune, Bengaluru, Mumbai (All Areas)