Job Details

ID #45934927
Estado Virginia
Ciudad Mclean
Tipo de trabajo Contract
Salario USD Depends on Experience Depends on Experience
Fuente Technology Ventures
Showed 2022-09-23
Fecha 2022-09-20
Fecha tope 2022-11-18
Categoría Etcétera
Crear un currículum vítae

Python Developer (for data processing and analysis)

Virginia, Mclean, 22102 Mclean USA

Vacancy caducado!

Location: On-Site - McLean, VA

Duration: 6 Months, possible extension

Required Skills: Strong with Python, Knowledgeable in Pyspark, Snowflake SQL

Responsibilities include:• Cleanse, manipulate and analyze large datasets (Semi-Structured and Unstructured data – XMLs, JSONs, CSVs, PDFs) using python and Snowflake database.• Develop Python scripts to filter/cleanse/map/aggregate data.• Manage and implement data processes (Data Quality reports)• Develop data profiling, deduping logic, matching logic for analysis• Programming Languages experience in Python, PySpark and SQL for data ingestion• Present ideas and recommendations on data handling and data parsing technologies to management

Qualifications:• 5+ years of experience in processing large volumes and variety of data (Structured and semi-structured data, writing code for parallel processing, shredding XMLS, JSONs and reading PDFs) - Mandatory• 3+ years of programming experience in Python for data processing and analysis – Mandatory• 2+ years of experience with Snowflake, preferable parsing JSON and XML files- Desirable• Strong SQL experience is a must - Mandatory• 3+ years of experience – using Hadoop platform and performing analysis. Familiarity with Hadoop cluster environment and configurations for resource management for analysis work - Optional• 2+ years of programming experience in PySpark for data processing and analysis - Optional• Detail oriented. Excellent communication skills (verbal and written)• Must be able to manage multiple priorities and meet deadlines• Degree in Computer Science , Statistics, Mathematics, or related field

Vacancy caducado!

Suscribir Reportar trabajo