Senior Evaluation Specialist | Research Fellow

Lucas Sempé

Brazilian-Argentine policy researcher with over two decades of experience spanning grassroots programme delivery, senior government leadership, and academic research. As Director at Peru's Ministry of Education, managed a £2 billion budget covering results-based reforms, teacher policy, and school governance. Current research focuses on evidence-informed policymaking, AI/digital policy governance, health financing, and the use of large language models for research automation. Published in The Lancet, BMJ, Social Science & Medicine, and Review of Income and Wealth. Over £1.5 million in research funding secured as PI/co-PI.

Evidence-Informed Policymaking & Knowledge MobilisationAI for Research & Policy: Tools, Ethics & GovernanceInequality Measurement & Social PolicyMental Health & Health SystemsCausal Inference Methods & Policy Applications

Recent Posts

View all →

AI SafetyMental Health

The Instruction Tuning Firewall

Mental health chatbots can drift toward dangerous validation while sounding perfectly appropriate. I built a monitoring system that detects persona drift in model activations—catching problems that even a fine-tuned DeBERTa misses, with a 2.6× advantage on crisis recognition. Validated by two clinical psychologists (ICC=0.716) and tested on naturalistic emotional support conversations.

Feb 15, 2026

AI EthicsImpact Evaluation

When Algorithms Meet Warzones

A drone image classifier that can't distinguish combatants from farmers. A beneficiary targeting model trained on data from before the displacement. A chatbot collecting trauma narratives in a language it barely understands. These aren't hypotheticals—they're the edge cases where AI meets impact evaluation in fragile contexts.

Feb 6, 2026

AI GovernancePolicy

The Capacity Gap

I scored 2,216 AI policy documents across 193 countries on implementation capacity. The headline isn't that rich countries do better—it's that the gap nearly vanishes once you account for documentation quality. The real story is what's happening within income groups.

Feb 6, 2026

RAGVoice AI

Talking to Your Evidence Base

What if you could ask your research library a question out loud and get a spoken answer grounded in actual studies? A retrieval-augmented system with voice interface makes research synthesis conversational.

Jan 20, 2026

Affiliations

International Initiative for Impact Evaluation (3ie) University of Oxford, Department of Psychiatry (Visiting Fellow) University of East Anglia (Honorary Research Fellow) Queen Margaret University (Honorary Research Fellow) Universidad Católica San Pablo (Visiting Fellow)

Education

PhD International Development

University of East Anglia, UK • 2022

"Measurement and effects of wealth inequality and schools. Evidence from PISA"

MSc Poverty Reduction, Policy and Practice

SOAS, University of London • 2014

"Appraisal of a learning coaching strategy in Peru in a results-based budgeting rationality"

Skills & Expertise

🔬 Research Domains

Education & Inequality Health Systems & Financing Ageing & Older People Mental Health & NTDs Mortality & Epidemiology Social Protection

📊 Methods

Impact Evaluation Systematic Reviews Mortality Modelling Cost-Benefit Analysis Econometrics Spatial Analysis

💻 Programming & Tools

R & Stata Python SQL JavaScript/React GIS (QGIS, R) Git & GitHub

🤖 AI & Data

LLM Engineering RAG Systems Vector Databases Prompt Engineering Data Pipelines Cloud Deployment

🏛️ Policy Experience

Results-Based Budgeting Public Sector Management Program Evaluation Capacity Building Stakeholder Engagement Policy Translation

))}

Languages

Spanish · Native

Portuguese · Native

English · Fluent

Country Expertise

Argentina Brazil Chile Colombia Ecuador El Salvador Liberia Peru UAE