Overview
The Data Engineer will lead the design and implementation of the Data360 data production and dissemination system, ensuring alignment with the WBG Development Data Quality Policy.
Key Responsibilities
- Develop and implement technical and business processes for data ingestion, production, management, and dissemination.
- Architect and implement the Data360 framework for "self-service" production and publishing of development indicators.
- Design and implement reusable functions, packages, and libraries for automated data pipelines.
- Develop CI/CD processes and policies for managing code repositories, issue tracking, and pipeline orchestration.
- Assess, recommend, and implement capabilities related to metadata, data lineage, data management, and data quality improvement.
- Provide guidance and mentoring to team members on data engineering practices.
- Collaborate with business data stewards and task teams to implement data projects and initiatives.
- Serve as a liaison between Business and Information Technology departments.
- Stay updated with and transfer knowledge on data management methods and best practices in data governance.
Required Experience
- At least, five years of relevant professional experience in data engineering.
- Demonstrated experience architecting and leading development of large-scale end-to-end data solutions.
- Hands-on experience building production-grade ETL pipelines using Databricks, Azure data lake, Unity Catalog and similar technologies.
- Extensive familiarity with the WBG Data360 framework, platform and solutions.
- Proven experience working within the World Bank Group’s ITS ecosystem.
- Solid experience working with data governance and data management concepts within a development data/statistical data context.
- Demonstrated familiarity with SDMX and international metadata standards such as DDI, ISO19139/19115 and WBG time series metadata schemas.
- Strong knowledge and experience in Agile Project Management methodologies.
- Practical experience developing production-grade products using CI/CD workflows.
- Demonstrated expertise in Python and R.
Qualifications
• Master’s degree in computer or data science, statistics, mathematics, or a related field.