Population Services International (PSI) is seeking an Associate, Data Engineering for the DISC Project. This role is focused on designing, building, and optimizing scalable data pipelines to deliver high-quality, analytics-ready datasets. You will support machine learning workflows, ensure data platform reliability, and contribute to data governance and security practices. The position involves working with cloud-based platforms like Microsoft Fabric or Databricks and requires strong scripting skills in Python, SQL, or PySpark.
Responsibilities
Data Engineering
Support the design, build, and optimization of scalable data pipelines using cloud-based proprietary or open-source data platforms.
Deliver high-quality, analytics-ready datasets for enterprise reporting and advanced analytics.
Implement best practices for data transformation and cleansing to ensure data integrity and reliability.
Support the development and maintenance of technical documentation, including SOPs and user guides.
Machine Learning Operations (MLOps) Support
Enable and support machine learning workflows by developing feature pipelines.
Integrate models into production data environments and contribute to automated model monitoring processes.
Platform Operations & Reliability
Ensure the stability, performance, and reliability of data platforms through proactive monitoring and CI/CD practices.
Support the implementation of data analytics, visualization, and reporting tools.
Provide technical support in data quality management, data governance, security, and data privacy.
Organizational Values
Embody PSI’s values: Measurement, Pragmatism, Honesty, Trust, Collaboration, and Commitment.
Be prepared for 10–25% international travel or flexibility to support global teams across different regions.
Qualifications and Experience
Bachelor’s degree (or international equivalent) in Computer Science, Information Technology, Statistics, or a related field.
At least 3 years of related experience; an equivalent combination of relevant education and experience may be considered.
Strong data engineering skills, specifically building and maintaining ETL/ELT pipelines with both structured and unstructured data.
Solid scripting skills in PySpark, Python, R, or SQL.
Knowledge of data modeling principles and database systems.
Experience with Microsoft Fabric, Databricks, or similar cloud-based/open-source data platforms.
Familiarity with data lake/lakehouse architectures and best practices for data storage and processing.
Knowledge of machine learning concepts and frameworks.
Experience using data analysis and visualization tools such as Power BI, Superset, Tableau, or D3.js.
Strong problem-solving skills and the ability to interpret data for non-technical audiences.
How to Apply
Interested and qualified candidates should apply through the Population Services International (PSI) careers portal via the application link provided below. Ensure you provide all required documentation and information as part of your profile.
How to Apply
Interested and qualified candidates should apply online through the Population Services International (PSI) recruitment portal on careers-psi.icims.com. You can access the application page directly via this link: https://www.myjobmag.co.ke/apply-now/1199111