Admin Data

BEAMM project : How do we deal with data ? Statistical matching and WGAN generation.

BEAMM project : How do we deal with data ? Statistical matching and WGAN generation.

In the framework of the BEAMM project (BElgian Arithmetic Micro-simulation Model), we propose several methods to address data issues. The core of this project is to develop a tax-benefit microsimulation model for Belgium accessible online, requiring intensive data handling. Our challenges consist in creating a unified data set containing variables from different surveys and developing a completely synthetic database for the online development of the BEAMM platform.

Read More
Estimation and Simulation of RURO Labor Supply Models with Administrative Data: Re-assessing the Evidence from Belgium

Estimation and Simulation of RURO Labor Supply Models with Administrative Data: Re-assessing the Evidence from Belgium

This paper estimates a Random Utility Random Opportunity model of labor supply using linked Belgian administrative data . The framework allows individuals to choose among stochastic wage and hours offers, capturing both participation decisions and hours adjustments within a unified structure. By combining tax records, social security data, and demographic registers, we construct precise measures of earnings, hours, and household characteristics.

Read More
Evaluating the results of a social benefit simulation using individual administrative data on benefit receipt

Evaluating the results of a social benefit simulation using individual administrative data on benefit receipt

The simulation of social benefits is an important application of tax-benefit microsimulation models in social policy research. Simulation outcomes inform about the (potential) effects of social policies and policy reforms. Furthermore, tax-benefit simulations allow for an evaluation of the interactions of different benefits in complex benefit systems. However, the quality of the simulation outcomes has consequences for the assessment of the effectiveness of the policy. An increasing number of recent studies on benefit non-take-up as one measure for the ineffectiveness of social policies explicitly address the difficulties in determining benefit entitlements using tax-benefit microsimulation models (Tasseva 2016, Bruckmeier et al. 2021, Doorley and Kakoulidou 2024, Bargain et al. 2012, Harnisch 2019). Consequently, the validation of the simulation outcomes is an important step in the application of tax-benefit simulations. Our study contributes to the literature on validating the results of tax-benefit simulation models. We examine how well the results of an open-source tax-benefit microsimulation model for Germany (GETTSIM) on means-tested minimum income (UBII) entitlements match the benefits contained in administrative data on UBII. Our analysis has two objectives: First, the results should provide an assessment of the validity of the UBII simulation results using GETTSIM. Second, generalized conclusions for policy and non-take-up analyses based on tax-benefit microsimulation models will be drawn. The results show that UBII entitlements are in most cases correctly simulated. In an adapted version of GETTSIM we have used, only for 3 to 4 percent of all observations no UBII entitlement was simulated (beta error). The simulation also allowed a precise distinction between UBII and existing similar benefits (housing and means-tested child benefit) for most observations. A closer look at individual deviations between recorded and simulated entitlements reveals significant deviations for migrants, especially from crisis countries, which was particularly relevant in 2017 and 2018. Furthermore, for households with many family members, with children or employed persons, the simulation at the individual becomes less precise. The results also provide some insights for the analysis of eligibility based on tax-benefit simulations in general. In social policy systems with overlapping benefits, even with very good data quality, misspecification of benefit entitlements cannot be avoided to a relevant extent, especially when the benefits pursue similar objectives and discretionary decisions occur at the administrative level. Since the mean values of various large sociodemographic groups are relatively accurately determined in the simulation, calibrating the simulated recipient numbers can compensate for these inaccuracies. The analysis at the individual level has shown that simulation quality decreases particularly for subgroups with more complex life circumstances, such as households with children. This applies in particular to comprehensive last-resort minimum income systems that provide benefits in the household context and take all types of household income into account. Temporary special circumstances, like a national or global crisis, can also lead to simulation results that do not reflect actual payments. In crisis years, consideration should be given to excluding certain particularly affected subgroups from the analysis or to choose other simulation years, if possible.

Read More
MIDAS DE – A LIAM2 based dynamic microsimulation of German pension incomes using linked RV–SOEP data

MIDAS DE – A LIAM2 based dynamic microsimulation of German pension incomes using linked RV–SOEP data

We introduce MIDAS DE, a LIAM2 based microsimulation model for analysing German pension incomes under current law and counterfactual policy scenarios. The model reproduces the statutory formula for earnings point accrual, access and type factors, and the current pension value, and is designed to evaluate distributional, gender, and adequacy effects of reforms such as pension splitting and survivor benefit adjustments within a unified framework. MIDAS DE is implemented in LIAM2 using discrete time processes over entities (individuals, households), typed fields (e.g., insured status, pension points), and explicit links (spouse/partner, parent–child) necessary for survivor pensions and splitting eligibility. The model combines SOEP RV administrative insurance records with SOEP survey microdata. Linkage relies on rv_id (SOEP RV↔SOEP) and pid to reconstruct households and partnerships from ppath/ppathl, household matrices from pbrutto/pl, and family histories from biofam/biomars. This enables (i) identification of spouses; (ii) retrieval of pension relevant histories for groups under represented in DRV (e.g., civil servants, self employed) via biowork/biojob; and (iii) construction of household attributes needed for survivor benefit means tests. To harmonise labour income for accrual, we estimate gender and occupation specific Heckman selection models for three groups—salaried employees, self employed, and civil servants—ensuring segment specific participation mechanisms and wage processes. Predictions are selection corrected (inverse Mills ratio) and back transformed with lognormal adjustments; observed wages replace predictions when available. This captures institutional heterogeneity (e.g., civil service pay scales, self employment volatility) and mitigates bias from missing or misreported earnings, feeding consistent contributory bases into earnings point calculations. Robustness checks consider exclusion restrictions (household composition and partner status), outlier trimming, and alternative retransformation (smearing). Technically, MIDAS DE shows how LIAM2 can host a law consistent German pension engine calibrated on linked RV–SOEP microdata with explicit household links, enabling faithful simulation of survivor pensions, pension splitting, and income offsets. Substantively, the model structures policy scenarios along current law splitting, VersAusglG style variants (with and without 25 year conditions and cross pillar coverage), and a universal splitting regime, providing outcomes on the gender pension gap, poverty at retirement, and fiscal effects.

Read More