As a Software Engineer specializing in Apache Spark, you will develop and optimize large-scale data processing solutions to handle complex batch and micro-batch workloads. You will play a key role in ensuring the performance and cost-efficiency of our data pipelines while enabling data-driven insights at scale. What You'll Do Design and develop high-performance data processing jobs using Apache Spark, PySpark, and Scala. Build and maintain scalable data pipelines on platforms like Databricks to support enterprise analytics. Optimize Spark applications for memory management, CPU utilization, and overall execution cost. Implement robust data transformations and ensure data quality across large-scale distributed datasets. Collaborate with data architects to refine data models and improve the reliability of the processing ecosystem. What We Are Looking For 4+ years of experience in Software Engineering, Data Engineering, or Big Data. Proven expertise in developing large-scale batch and micro-batch data processing using Apache Spark. Proficiency in programming with Scala or PySpark for complex data transformations. Hands‑on experience with Databricks for performance tuning and cost optimization. Deep understanding of optimizing Spark jobs for memory and processing efficiency. Advanced proficiency in English. Benefits 100% remote work (from anywhere). Excellent compensation in USD or your local currency if preferred. Hardware and software setup for you to work from home. Flexible hours: create your own schedule. Paid parental leaves, vacations, and national holidays. Innovative and multicultural work environment: collaborate and learn from the global Top 1% of talent. Supportive environment with mentorship, promotions, skill development, and diverse growth opportunities. #J-18808-Ljbffr
Software Engineer (Apache Spark) - Remote Work | Ref#288500
BAIRESDEV
puebla de zaragoza, puebla de zaragoza
Publicado hace 19 días
Denunciar empleo