Latent Classes in HILDA's 7-item Module C2 and What They Predict for Subsequent Recovery
The 7-item HILDA Module C2 financial-hardship battery has been treated as an aggregate score (count of items endorsed) or a binary any-hardship indicator in essentially every published Australian household-finance study that uses the panel. This paper begins from the observation that the seven items measure substantively different states — asking friends for help is qualitatively different from being unable to pay for heat — and the aggregate score conflates these states in ways that may matter for predicting subsequent recovery.
The behavioural-economics hypothesis under test is that the FIRST hardship item endorsed at hardship onset determines the subsequent trajectory, distinct from the aggregate hardship score. Three substantive states are proposed: strategic help-seeking (asking friends/family or welfare organisations for help), passive bill failure (couldn't pay bills, rent), and severe deprivation (couldn't heat home, went without meals, pawned/sold items). The central claim is that people whose first hardship is 'asked for help' do better than people whose first hardship is 'couldn't pay bills' — even though the standard aggregate index treats them as equivalent.
The empirical setting is HILDA Module C2 across waves 1–22, 454,861 person-waves, of which 286,402 are observable on the 7 hardship items, 62,984 endorse at least one item, and 60,286 form the LCA sample (count ≥ 1, all 7 items non-missing). First-landing events — transitions from hardship_count = 0 to hardship_count ≥ 1 — number 11,595 across the panel.
The analysis pipeline applies Bernoulli-mixture latent class analysis (LCA) via EM with K = 2, 3, 4, 5 and BIC-based model selection; first-landing detection at the 0 → ≥1 transition; trajectory recovery analysis at t+1 to t+4 by first-landing class; and heterogeneity stratification by income, age, education, household, and era.
The feasibility exploration completed 2026-05-10 returns a clear REVISE verdict. The latent-class structure is strongly supported: K = 5 wins on BIC and the five recovered classes recover exactly the three substantive types the hypothesis predicts (strategic help-seekers C3, bills-only C2, severe multi-domain C1) plus two more (deprivation-mixed, housing-stress). One class (C3, 30.7% of LCA sample, 38.5% of first-landings) is unambiguously 'strategic help-seekers' with P(help_fam) = 1.00 and P(bills) = 0.34. However, the central trajectory hypothesis is NOT supported as stated: at t+1, C3 recovery rate is 56.9% versus C2 bills-only 62.4% (z = −3.67, p = 0.0002) — the direction is the opposite of the prior. The gap closes by t+4 (C3 = 67.6% vs C2 = 67.6%) but never inverts. Severe multi-domain (C1, 4.5%) shows a clear durable disadvantage (36.0% recovery at t+1; 49% at t+4). The paper now pivots on the directional finding: help-seeking is a distinct latent state but it predicts slower, not faster, recovery — a substantive contribution that the standard aggregate-index literature has missed.