Question 1

What is the N-1 temporal holdout methodology?

Accepted Answer

N-1 temporal holdout is a validation method in which the most recent completed role in a candidate's career history is withheld as the ground truth. The scoring system runs on the prior career history only — what was visible before that last role started. The system then predicts 12-month retention behavior, and the prediction is compared to the actual outcome of the withheld role. This approach tests whether the system can generate useful predictions from only the data available at the pre-hire stage.

Question 2

What does 84.3% accuracy at 12 months mean?

Accepted Answer

In the holdout validation study, Stability Engine correctly identified early-departure risk at the 12-month threshold for 84.3% of candidates in the validation cohort. Candidates who scored in the higher risk bands and departed within 12 months were classified correctly; candidates who scored in the lower risk bands and remained employed past 12 months were also classified correctly. The 84.3% figure reflects that binary classification accuracy across the full n=51 holdout cohort.

Question 3

What is the Cox model Brier score?

Accepted Answer

The Brier score measures the accuracy of probabilistic predictions on a 0-to-1 scale, where 0 is perfect and 1 is the worst possible. A score of 0.169 indicates well-calibrated probabilistic forecasts — the model's stated probabilities of early departure are close to the observed rates. This is the mean Cox Brier score across the n=51 holdout cohort.

Question 4

Where can I read the full validation study?

Accepted Answer

The Ros Holdout Validation Study 2026 is available as a PDF download at stabilityengine.ai/audit. The study covers methodology, cohort composition, results, failure mode transparency, and interpretation guidance.

Metric	Result	What it measures
12-month accuracy	84.3%	Correct binary classification at the 12-month retention threshold
Score accuracy	72.5%	Stability Scores within ±15 points of the reference label
Mean Cox Brier score	0.169	Probabilistic calibration quality (0 = perfect, lower is better)
Cohort size	n=51	Total candidates in the holdout validation cohort
Methodology	N-1 temporal holdout	Prior career history only; most recent role withheld as ground truth

Validation Methodology: 84.3% Accuracy at 12 Months

Validation methodology: N-1 temporal holdout

Validation results

What the Stability Score measures

Honest limits of the validation

Frequently asked questions

What is the N-1 temporal holdout methodology?

What does 84.3% accuracy at 12 months mean?

What is a Brier score and what does 0.169 indicate?

Where can I read the full validation study?

Read the full validation study