Extreme heat events cause substantial excess mortality, yet their long-term demographic consequences extend far beyond immediate death counts. Each heatwave creates demographic memory—the cascading effects of lost individuals who would have reproduced, aged, and shaped future population structures. In this work, I will develop a microsimulation framework to quantify how a single extreme heat event reshapes population trajectories over subsequent decades, comparing outcomes with and without the heatwave to isolate its lasting demographic imprint.
I will implement microsimulations using the SOCSIM package in R, which simulates individual life events—births, deaths, migrations—through probabilistic rules applied at monthly time steps. This granular approach captures how mortality shocks propagate through populations in ways that aggregate models cannot.
I will construct synthetic populations for Western European countries affected by the July 2022 heatwave, initializing from the Human Mortality Database and Human Fertility Database. Two parallel projections will be analyzed: one incorporating observed mortality from July 2022 (including heat-attributable excess deaths), and a counterfactual where mortality follows forecasted baseline rates without the heatwave. I will use the Lee-Carter method to project baseline mortality, allowing the observed July 2022 mortality spike to capture the heat events demographic shock.
By the conference, I expect to present proof-of-concept results showing how the two scenarios diverge. Core outputs will include: excess mortality attributable to the July 2022 heatwave, measures of bereavement quantifying how many individuals lost family members due to heat-attributable mortality, and projections comparing population structures with and without this event over subsequent decades.
Focusing on a single extreme heat event establishes methodological foundations while demonstrating that even one event matters demographically. While climate change will bring repeated heat extremes that compound these effects, understanding one events impact provides the baseline for assessing cumulative effects. The difference between the two projected populations quantifies the full demographic burden: not only immediate deaths, but also potential altered age structures, depleted cohorts, and expanded bereavement.
Alignment is a critical calibration technique in microsimulation, ensuring individual-level transitions aggregate to known macro-targets. While indispensable for updating populations to match demographic projections or macroeconomic forecasts, the statistical properties of various alignment algorithms remain under-researched. This paper provides a systematic evaluation of alignment methods for discrete-time models to guide researchers in method selection.
We categorize existing methods based on a robust taxonomy: variable type (continuous vs. discrete), the nature of the outcome (exact match vs. stochastic), and time-step logic (continuous vs discrete). Building on the work of Stephenson (2018), we demonstrate that most contemporary alignment methods can be unified under a single constrained optimization framework, differing only in their choice of objective functions. This unification allows us to establish previously unrecognized relationships between seemingly disparate techniques.
Furthermore, we derive closed-form expressions and Taylor series approximations for computationally intensive iterative methods. These mathematical shortcuts allow modellers to reduce simulation run-times significantly while converting exact alignment processes into stochastic ones without losing desirable statistical properties. We then study the impact of these methods on the predicted distributions and covariance of outcomes between simulation runs.
Our analysis reveals that common techniques, such as Sorting by Predicted Probability (SBP) and Sorted By Difference between predicted Probability and a random number (SBD), are undesirable for microsimulation modelling as they can introduce biases and path dependence into the simulation outcomes. In contrast, parameter variation methods (i.e. recalibration of model coefficients such as intercept shifting) offer modellers control over the impact of alignment and better interpretability. Furthermore, methods like logit scaling, variance-weighted additive scaling for probabilities, and additive scaling for continuous variables offer greater robustness due to their foundations in probability theory. We conclude by providing a decision matrix for modellers, weighing the trade-offs between computational efficiency, distribution preservation, and variance reduction.
This paper analyzes the reduction in total labor costs induced by an increase in the minimum wage in France. Using the Ines microsimulation model developed by the French National Statistical Institute (Insee), I simulate a 2% increase in wages for all workers paid at the minimum wage. Due to the complex of exemptions of the French socio-fiscal system, an increase in the minimum wage leads to a reduction in employers’ social security contributions (SSCs) for workers earning between 1.01 and 3.5 times the minimum wage. I confirm existing results from L’horty (2000): overall, a 1% increase in the minimum wage reduces employers’ SSCs by approximately €1.67 billion.
PASSAGES is an open-source dynamic microsimulation model aimed at supporting policy analysis and research related to the Canadian retirement income system at the individual and family levels. The model simulates demographic processes, labour market histories, pension entitlements, and public pension income. The publicly available version includes a synthetic starting database, the model, and documentation. This paper describes the methodology of the newly added “Residual Net Income” module in PASSAGES for individuals aged 60 and over. In the Canadian tax context, net income represents total income reported on the tax return after allowable deductions, such as registered pension contributions, union dues, and other tax-deductible expenses. The concept of net income is used to calculate the Old Age Security (OAS) recovery tax and the income-tested reduction of the Guaranteed Income Supplement (GIS). PASSAGES already models certain income sources which are included in net income such as employment earnings, self-employment income, Canada and Quebec Pension Plan (CPP/QPP) benefits, and OAS. A residual net income concept was created where these known income sources were subtracted from net income. For the elderly, the income sources included in residual net income primarily comprise pension income from private sources and investment income.
Modelling residual net income presents several challenges, notably the large proportion of individuals with zero residual net income and the presence of sharp, non-linear transitions in many individuals’ residual income across deciles over time. While a majority of individuals remain within their income decile or move by only one decile annually, the model captures the non-linear transitions observed in the tax data. We employ a novel approach to embed these longitudinal dynamics into forward-looking simulations. Using linked tax and census data, we estimate a two-step model combining logistic and quantile regressions to initialize residual net income in 2016. Longitudinal dynamics are simulated using multinomial logit and quantile regression models, with validation based on observed decile transitions and age-specific distributions. Incorporating residual net income enables PASSAGES to compute total family income and supports analyses of income-tested retirement benefits and family-level outcomes.
The BELMOD microsimulation model of the Belgian Federal Public Service Social Security is based on an administrative input dataset. While the model’s policy simulations are updated twice a year, the most recent input dataset currently available refers to 2019. As a result, the discrepancy between the input data and the present situation has grown to several years. This gap can be reduced through nowcasting methods, by updating the outdated input data with more recent information to bring it more in line with the present situation, thereby making the data more suitable for simulations relating to the most recent years. We applied nowcasting to the BELMOD input dataset by incorporating both demographic changes and changes in individuals’ labour market status. To assess which nowcasting approach is most suitable for BELMOD, we developed and compared three different methods. In the first two methods, labour market status transitions are modelled using, respectively, a parametric and a non-parametric approach. For individuals experiencing a labour market status transition, the relevant variables in the dataset are subsequently adjusted. In addition, demographic changes are incorporated in these two methods through reweighting. In the third method, both labour market status and demographic changes are implemented through reweighting. We validated these three methods using external statistics on the number of income and benefit recipients, as well as aggregate income and benefit amounts.
We introduce the use of microsimulation modelling in the context of survey experiments, coined “simu-survey experiments.” They are a specific type of information experiment (i.e. a research approach for studying how specific pieces of information affect attitudes or intentions) in which the informational treatments are based on empirically grounded microsimulation estimates.
We argue that simu-survey experiments are particularly well suited to research contexts that involve some form of change, either a prospective policy reform (e.g. pension reform) or a life course event (e.g. taking up work). In both cases, the core principle is the same: respondents are asked in a survey to evaluate a situation that departs from the status quo, and the experiment identifies how their expressed attitudes or behavioural intentions shift when they are provided with microsimulation based information about the likely consequences of that change. Depending on the research question, this information can be situated at the household level, providing respondents with personalized estimates tailored to their socio demographic profile (e.g. the predicted change in disposable income), and/or at the societal level, offering general information that applies uniformly to all respondents (e.g. the expected change in poverty).
Simu-survey experiments therefore rest on a dual promise of causal inference and empirical realism. Respondents are not reacting to fabricated, unrealistic or vague information, but to concrete numbers that could actually apply to them or to their society, meaning that their responses will more closely approximate to how people think and behave in the real world. As such, simu-survey experiments represent a versatile methodological innovation for studying how individuals evaluate certain hypothetical changes in either policy context or personal circumstances (and their consequences) across a wide range of policy domains. They also allow researchers to more accurately assess how informed individuals form opinions, revise their beliefs or update their behavioural intentions.
To illustrate how simu-survey experiments can be applied in practice and to demonstrate their effectiveness, we present the second wave of the Basic Income in Belgium (BABEL) survey, our own simu-survey experiment aimed at uncovering how support for a universal basic income (UBI) is causally impacted by microsimulation-based information regarding its specific household-level and societal-level policy outcomes.
Synthetic populations are a key component of many transportation and urban analysis frameworks, as they provide disaggregated representations of individuals or households used to feed traffic simulators and exposure models. Beyond mobility studies, they are increasingly mobilized to assess territorial sensitivity to external factors such as environmental nuisances or construction noise. Traditionally, synthetic populations are generated using calibration-based methods such as Iterative Proportional Fitting (IPF), which adjust a micro-sample to match aggregated census constraints. While robust and interpretable, these approaches are limited in high-dimensional settings and can only reproduce individuals that are already present in the initial sample.
Recent advances in machine learning offer new perspectives for synthetic population generation. In particular, Variational Autoencoders (VAEs) have demonstrated strong capabilities in learning complex joint distributions and generating realistic synthetic data in domains such as image and text generation. Applied to population synthesis, VAEs allow the modeling of rich, multidimensional dependency structures between socio-demographic attributes and enable the generation of more diverse populations. However, VAEs alone do not naturally enforce consistency with known marginal distributions derived from official statistics, which remains a key requirement in applied territorial studies.
This contribution presents a hybrid methodology that combines the generative power of VAEs with the statistical guarantees of IPF. First, a VAE is trained on a microdata sample to learn a low-dimensional latent representation of individuals and to generate synthetic agents capturing complex correlations between attributes. In a second step, IPF is used as a post-processing procedure to adjust the generated population so that selected marginal distributions strictly match external constraints. This decoupled strategy leverages the strengths of both approaches: the flexibility and scalability of VAEs in high-dimensional spaces, and the ability of IPF to enforce consistency with official statistics.
The presentation will detail the architecture of the VAE and the integration of IPF as a calibration layer. Results will illustrate how this approach improves diversity, realism, and statistical coherence of synthetic populations compared to traditional methods.
Spain records one of the highest rates of in-work poverty in the European Union (Eurostat, 2024). Despite this, the development of policies specifically targeted at supporting low-income workers has been limited, especially when compared to other European countries (Laun, 2019). This policy gap, combined with the expansion of minimum income guarantees schemes, may weaken work incentives by narrowing the income gap between employment and nonemployment, thereby increasing the risk of poverty traps among low-income households (Domínguez-Olabide & Zalakain, 2023).
This paper proposes the introduction of an in-work benefit in Spain, inspired by the American Earned Income Tax Credit (EITC), with the aimed of improving the living conditions of the low-wage workers while strengthening labour market incentives. As in the EITC, the benefit is implemented through the Spanish personal income tax system as a refundable tax credit, varying according to household type. In particular, we propose a more generous credit for families with dependent children, as they face higher poverty rates. The reform is simulated using the EUROMOD microsimulation model, based on 2022 EU-SILC data. Labor supply responses are estimated using a structural labour supply model.
The main results suggests that the proposed in-work benefit would not generate negative labour supply effects. On the contrary, the reform is associated with an overall increase in labour supply, particularly in full-time employment. Positive effects are found for both single and couples, with stronger impacts among households with dependent children.
We introduce MIDAS DE, a LIAM2 based microsimulation model for analysing German pension incomes under current law and counterfactual policy scenarios. The model reproduces the statutory formula for earnings point accrual, access and type factors, and the current pension value, and is designed to evaluate distributional, gender, and adequacy effects of reforms such as pension splitting and survivor benefit adjustments within a unified framework. MIDAS DE is implemented in LIAM2 using discrete time processes over entities (individuals, households), typed fields (e.g., insured status, pension points), and explicit links (spouse/partner, parent–child) necessary for survivor pensions and splitting eligibility. The model combines SOEP RV administrative insurance records with SOEP survey microdata. Linkage relies on rv_id (SOEP RV↔SOEP) and pid to reconstruct households and partnerships from ppath/ppathl, household matrices from pbrutto/pl, and family histories from biofam/biomars. This enables (i) identification of spouses; (ii) retrieval of pension relevant histories for groups under represented in DRV (e.g., civil servants, self employed) via biowork/biojob; and (iii) construction of household attributes needed for survivor benefit means tests. To harmonise labour income for accrual, we estimate gender and occupation specific Heckman selection models for three groups—salaried employees, self employed, and civil servants—ensuring segment specific participation mechanisms and wage processes. Predictions are selection corrected (inverse Mills ratio) and back transformed with lognormal adjustments; observed wages replace predictions when available. This captures institutional heterogeneity (e.g., civil service pay scales, self employment volatility) and mitigates bias from missing or misreported earnings, feeding consistent contributory bases into earnings point calculations. Robustness checks consider exclusion restrictions (household composition and partner status), outlier trimming, and alternative retransformation (smearing). Technically, MIDAS DE shows how LIAM2 can host a law consistent German pension engine calibrated on linked RV–SOEP microdata with explicit household links, enabling faithful simulation of survivor pensions, pension splitting, and income offsets. Substantively, the model structures policy scenarios along current law splitting, VersAusglG style variants (with and without 25 year conditions and cross pillar coverage), and a universal splitting regime, providing outcomes on the gender pension gap, poverty at retirement, and fiscal effects.
Unemployment benefits and contributions are often uniform across partnered and single individuals. In this paper, we study whether this constitutes an optimal policy, given that partners within couples benefit from risk sharing. Our contribution is threefold. First, we document the channels through which unemployment benefits have a first-order impact on aggregate welfare. Second, we express the welfare impact in terms of a few sufficient statistics, which can be derived from observational data. Third, we quantify the optimal policy in an empirical application using US data.