Lead Python Developer

Pittsburgh, PA, USA Req #27265
Friday, November 15, 2024

Company Overview and Culture 

EXL (NASDAQ: EXLS) is a leading data analytics and digital operations and solutions company. We partner with clients using a data and AI-led approach to reinvent business models, drive better business outcomes and unlock growth with speed. EXL harnesses the power of data, analytics, AI, and deep industry knowledge to transform operations for the world's leading corporations in industries including insurance, healthcare, banking and financial services, media and retail, among others. EXL was founded in 1999 with the core values of innovation, collaboration, excellence, integrity and respect. We are headquartered in New York and have more than 55,000 employees spanning six continents. For more information, visit http://www.exlservice.com.



Job Title: Senior Python Data Engineer 

Job Description 

We are seekinga highly skilled Senior Python Data Engineerto join our dynamic team. The ideal candidate will possessa strong programming background in advanced Python, with a focus on data engineering frameworks and libraries. You will be responsible fordesigning, building, and maintainingrobust data ingestion pipelines, ensuring seamless integration of data from various sources. 

Key Responsibilities 

  • Data Pipeline Development: Design, implement, and optimizedata ingestion pipelines using advanced Python (NumPy, Pandas, Dask) to ensure efficient data flow and processing. 

  • Data Storage Management: Work extensively with Parquet files for efficient data storage and retrieval, including partitioned Parquet files, ensuring optimalcompression and schema evolution. 

  • Collaboration: Work closely with geographically distributed teams and clients to gather requirements, provide technical solutions, and ensure data quality. 

  • Team Leadership: Lead a team of data engineers by assigning tasks, reviewing code, and mentoring junior team members. 

  • Design Participation: Engage in architectural discussions and design sessions, contributing to the overall data pipeline architecture. 

  • REST API Development: Build and maintainREST APIs, ensuring API security through key validation, authorization, and authentication mechanisms. 

  • Data Manipulation: Set up and manipulate Python data structures such as lists, strings, dictionaries, and tuples. Use strong expertisein Pandas and NumPy for data manipulation. 

  • Data Exploration & Visualization: Conduct data exploration, visualization, and comparison of metrics for large CSV and Parquet files. 

  • Debugging and Optimization: Troubleshoot complex data pipeline issues, utilizinglogging and monitoring tools (like ELK Stack, Grafana) to optimizeperformance for scalability and efficiency. 

  • Data Storage Solutions: Design and implement data storage solutions using SQL (PostgreSQL, MySQL) and NoSQL databases (MongoDB, Cassandra). 

  • Data Transformation: Use advanced techniques such as joins, merges, pivot tables, grouping, and window functions in Python or SQL. 

  • Documentation: Maintainthorough documentation of data pipelines, architectures, and processes for future reference and onboarding. 

Required Qualifications (Must-Have) 

  • Programming Skills: Advanced proficiencyin Python, particularly with libraries such as NumPy and Pandas for data manipulation and analysis. 

  • Parquet Experience: Strong experience with Parquet files, including reading, writing, and optimizingfor performance and storage efficiency. 

  • Data Structure Manipulation: Ability to set up and manipulate Python data structures such as lists, strings, dictionaries, and tuples. 

  • Data Exploration: Familiarity with data exploration, visualization, and comparing metrics of large CSV and Parquet files, including partitioned Parquet files. 

  • Advanced Data Techniques: Strong skills in joins, merges, pivot tables, grouping, and window functions in Python or SQL. 

  • Version Control: Strong understanding of GIT, including git push and git clone for collaborative development. 

  • Linux Proficiency: Experience with Linux commands and shell scripting for data operations. 

  • Data Pipeline Experience: Proven experience in building and managing data ingestion pipeline scripts, including batch and real-time processing. 

  • REST API Knowledge: Familiarity with building REST APIs and securing them through API key validation and authentication mechanisms. 

  • Debugging Skills: Demonstratedability to handle complex data pipeline architecture with excellent debugging skills. 

  • Leadership Experience: Prior experience leading a technical team and mentoring junior engineers. 

Preferred Qualifications (Good-to-Have) 

  •  
  • Object-Oriented Programming: Good experiencewith object-oriented programming patterns, multithreading, and multiprocessing. 
 
  • Spark Applications: Experience developing Spark applications using Python, including familiarity with Apache Spark (Spark SQL, Spark Streaming, DataFrames, RDD, PySpark). 

  • Communication Skills: Excellent verbal and written communication skills, with the ability to convey technical concepts to non-technical stakeholders. 

 

EEO/Minorities/Females/Vets/Disabilities 

To view our total rewards offered click here —> https://www.exlservice.com/us-careers-and-benefits
 
Base Salary Range Disclaimer: The base salary range represents the low and high end of the EXL base salary range for this position. Actual salaries will vary depending on factors including but not limited to: location and experience. The base salary range listed is just one component of EXL's total compensation package for employees. Other rewards may include bonuses, as well as a Paid Time Off policy, and many region specific benefits. 
 
Please also note that the data shared through the job application will be stored and processed by EXL in accordance with the EXL Privacy Policy. 

Application & Interview Impersonation Warning – Purposely impersonating another individual when applying and / or participating in an interview in order to obtain employment with EXL Service Holdings, Inc. (the “Company”) for yourself or for the other individual is a crime.  We have implemented measures to deter and to uncover such unlawful conduct.  If the Company identifies such fraudulent conduct, it will result in, as applicable, the application being rejected, an offer (if made) being rescinded, or termination of employment as well as possible legal action against the impersonator(s).

EXL may use artificial intelligence to create insights on how your candidate information matches the requirements of the job for which you applied. While AI may be used in the recruiting process, all final decisions in the recruiting and hiring process will be taken by the recruiting and hiring teams after considering a candidate’s full profile. As a candidate, you can choose to opt out of this artificial intelligence screening process. Your decision to opt out will not negatively impact your opportunity for employment with EXL.

Other details

  • Pay Type Salary
Location on Google Maps
  • Pittsburgh, PA, USA