The science behind Vital

Why this page exists

Health apps make plenty of claims. Vital shows its working. Recovery, Exertion, Stress, Health and Readiness each draw on peer-reviewed research, and this page sets out the weights, the reasoning and every linked paper. The full reference library is below.

Some weights are engineering judgement rather than something a paper can validate. We call those out plainly.

What Vital reads

Vital reads 25 Apple Health signals on your iPhone at refresh time, alongside anything you log in the app. Raw HealthKit samples stay on your iPhone; only daily scores and Vital entries sync.

HRV (SDNN)
Resting heart rate
Heart rate
Walking heart rate
Heart-rate recovery
VO₂ max
Respiratory rate
Blood oxygen
Sleep stages
Workouts
Active energy
Basal energy
Exercise minutes
Stand hours
Steps
Distance
Flights climbed
Body mass
Body fat %
Lean mass
Caffeine
Mindful minutes
Dietary energy
Macros
Water

The three principles

Your normal isn't anyone else's normal. A healthy adult's HRV can sit anywhere from 20 ms to over 100 ms, so comparing user A against user B mostly measures age, fitness history and genetics. Vital scores almost everything against your own rolling distribution. A one-size table would spend most of its points scoring who you are rather than how recovered you are.

State signals and trait signals get different weights. Resting heart rate and HRV change day-to-day (state). VO₂ max changes month-to-month (trait). The Recovery score weights state signals more heavily because the question it answers (how recovered am I today?) is a state question.

The lack of a paper isn't a free pass. Several weights in Vital are engineering decisions with no citation behind them. Where that's true, this page calls it out, and the honest caveats section collects them in one place.

Recovery

Recovery formula35% HRV + 25% sleep + 15% RHR + 15% (inverse) accumulated fatigue + 10% fitness composite

HRV (35%)

HRV is parasympathetic tone, and parasympathetic tone is the physiological substrate of "recovered." The 1996 European Society of Cardiology / NASPE Task Force standards (Circulation) established HRV as an independent mortality predictor and the canonical autonomic-balance marker. Shaffer & Ginsberg 2017 (Frontiers in Public Health) documents the wide healthy range (RMSSD 20–100+ ms) that mandates personal-baseline scoring: there's no useful absolute cut-off.

For daily scoring, Aubert 2003 (Sports Medicine) and Plews 2013 (Sports Medicine) are load-bearing: in trained populations, rolling averages outperform single-day reads, and trend deviations matter more than absolute values. This is why Vital uses a personal percentile of the recent window.

One honest caveat on the metric itself. The research above is built largely on RMSSD, but Apple Health exposes HRV as SDNN from roughly one-minute samples. SDNN and RMSSD are correlated but not interchangeable, and a one-minute window sits below the Task Force's short-term recording standard. Vital works in SDNN against your own SDNN baseline, where the trend is what matters, but the cross-metric mismatch is real and we'd rather name it than paper over it.

Sleep quality (25%)

Duration, efficiency, and deep+REM each carry a sub-citation. Hirshkowitz 2015 (NSF, Sleep Health) and Watson 2015 (AASM/SRS consensus, Sleep) set the 7–9 h adult target. Ohayon 2004 (Sleep, meta-analysis of 65 studies) is the canonical lifespan reference for deep% and REM%. Xie 2013 (Science) showed glymphatic clearance roughly doubles during slow-wave sleep, and Van Cauter 2000 (JAMA) showed roughly 70% of nocturnal growth-hormone pulses coincide with SWS episodes. The case that "deep sleep" specifically is the recovery-relevant stage isn't ornamental.

Caveat: consumer sleep-stage classification from a wrist or phone is approximate against laboratory polysomnography, so the deep% and REM% figures Vital reads carry more noise than the studies they lean on. We lean on duration and your own night-to-night trend more than any single night's stage split, which softens the error without pretending it's gone.

Van Dongen 2003 and Belenky 2003 establish that sleep debt is asymmetric: three recovery nights don't undo a week of restriction. That's why Recovery applies a chronic-bad-night penalty on top of last night's score, and why Readiness includes a sleep-trend factor.

Resting heart rate (15%)

Zhang 2016 (CMAJ), a meta-analysis of 46 studies (n > 1.2 million), established that each 10 bpm increase in RHR raises all-cause mortality risk by 9%. We don't use that as an absolute threshold; we use it inversely against your own baseline, because absolute RHR is dominated by fitness and genetics. Karvonen 1957's heart-rate-reserve framework is the basis for every personalised HR system since.

Accumulated fatigue (15%, inverse)

A single-day read misses multi-day stacking. Vital's fatigue accumulator runs an exponentially-weighted moving average over recent stress and sleep signals; Recovery subtracts the inverse. Borg 1982 (MSSE) validated the psychophysical foundations of subjective exertion scales; Krupp 1989 (Archives of Neurology) validated short Likert fatigue instruments. Vital's single 0–10 slider is a deliberate simplification of these (daily friction matters for adherence), but we acknowledge it carries more measurement noise than a 9-item scale.

Fitness composite (10%)

VO₂ max and 1-minute heart-rate recovery split this weight. Kodama 2009 (JAMA, n = 102,980) and Mandsager 2018 (JAMA Network Open, n = 122,007) establish that low cardiorespiratory fitness is among the largest single mortality predictors, larger than smoking or diabetes, with no plateau at the elite end. Cole 1999 (NEJM) and Jouven 2005 (NEJM) establish 1-minute HRR as an autonomic-function marker in its own right, beyond a fitness proxy. Lambe 2025 (PLOS ONE) is an honest caveat: Apple Watch VO₂ max underestimates lab values with a mean absolute percentage error of ~13%, which is why we weight this composite at 10% rather than higher.

Exertion

Exertion formulaEdwards TRIMP across continuous HR zones + sport-impact bonus + ring-based floor, optionally blended with logged RPE

Banister 1975 introduced the fitness-fatigue impulse-response model that underlies all modern training-load monitoring. Edwards 1993 proposed the 5-zone simplification Vital ships. Borresen & Lambert 2009 (114-article review in Sports Medicine) showed Banister and Edwards correlate r = 0.80–0.99, so the simplification trades little physiological precision for huge data-input feasibility on consumer wearables.

Karvonen 1957 is the basis for the heart-rate-reserve concept. Norton 2010 (J Sci Med Sport, ESSA position) standardises moderate/vigorous intensity descriptors. Murray 2017 (British Journal of Sports Medicine) provides peer-reviewed validation that exponentially-weighted moving averages of acute-to-chronic workload predict injury risk more sensitively than rolling averages. This is why Vital's CTL/ATL/TSB pipeline uses EWMAs.

Stress

Stress formulaHRV depression up to 40 + RHR elevation up to 20 + sleep deficit up to 15 + accumulated fatigue up to 15 + SpO₂ up to 5 + respiratory rate up to 5 + user-reported up to 20

Kim 2018 (Psychiatry Investigation, meta-analysis of 37 studies) confirms that acute stress reliably reduces HF-HRV and RMSSD across cognitive, social-evaluative, and pain stressors. Thayer 2012 (Neuroscience & Biobehavioral Reviews) is the neurovisceral-integration model: HRV indexes vmPFC–amygdala–brainstem coupling. Low HRV is paired with elevated cortisol and disinhibited HPA-axis reactivity. This is the strongest single piece of evidence in the stress algorithm.

For user-reported stress, Cohen 1983 (J Health Soc Behav) is the canonical PSS reference. Vital's 5-level daily check-in is a simplified version validated against the full PSS-10 by Elo 2003 (Scand J Work Environ Health, r ≈ 0.6–0.7).

The menstrual-cycle modifier

Schmalenberger 2019 (J Clin Med), a meta-analysis of 37 studies in 1,004 women, established the magnitude: vagal HRV decreases from follicular to luteal with d = −0.39, roughly a 15–20% reduction in HF-HRV power. Progesterone is the within-person driver (Schmalenberger 2020). Tenan 2014 (Psychophysiology) confirmed parallel RHR elevation; Baker & Driver 2020 (Temperature) documented the luteal +0.3–0.7 °C core-temperature rise plus altered sleep architecture.

In practice, Vital scales Stress down by 15% in the luteal phase because the underlying shift is hormonal. Phase-aware baselines are the more principled long-term approach, and that work is in progress.

Health

Health formula25% recovery + 25% activity + 10% sleep + 18% nutrition + 12% hydration + 10% (inverse) stress, with adaptive weight-shift toward weak domains

The Health score is a long-horizon trait-level composite. Each input is covered above (recovery, sleep, stress) or here:

Activity: Ekelund 2019 (BMJ, harmonised meta-analysis of 8 cohorts) shows the activity–mortality dose-response is non-linear with steepest gains in the first ~25 min/day of moderate-vigorous activity. The WHO 2020 guidelines (Bull et al., BJSM) set the 150–300 min/week aerobic target.
Nutrition: Drewnowski 2009 (J Am Coll Nutr) defines the Nutrient Rich Foods Index: 9 nutrients-to-encourage minus 3 to limit, validated against the Healthy Eating Index. It's the canonical nutrient-density framework.
Hydration: EFSA 2010 sets 2.5 L/day (males) and 2.0 L/day (females) total water Adequate Intakes; IOM 2004 sets parallel figures. Cheuvront & Sawka 2005 (and the ACSM Position Stand on fluid replacement, Sawka 2007) provides the clinically validated dehydration markers.

The 18%/12% nutrition-and-hydration split is an editorial weight. No paper validates that exact ratio, so we publish it as a product decision.

Readiness

Readiness formula35% recovery + 15% (inverse) stress + 20% Training Stress Balance + 15% (inverse) fatigue + 8% sleep trend + 7% HRV trend

The TSB term comes from the Banister 1991 / Coggan generalisation of the 1975 model. Murray 2017 (BJSM) validates exponentially-weighted moving averages for acute-to-chronic workload as injury-risk predictors. The mapping of TSB to a 0–100 contribution via hyperbolic tangent is a Vital engineering choice, since the literature treats TSB as banded (peaked >+15, productive overload −10 to −30) rather than continuously mapped, and we document it that way.

The sleep-trend and HRV-trend factors are deliberately small (8% and 7%) because they may have fewer than three snapshots in a new user's history, in which case they renormalise out. Plews 2014 establishes that ≥3 valid HRV readings per week is the minimum-validity threshold for meaningful trend interpretation.

TDEE

TDEE formulaMifflin–St Jeor BMR × activity multiplier (1.2–1.9)

Mifflin 1990 (Am J Clin Nutr) is the BMR equation derived from n = 498 healthy adults. Frankenfield 2005 (J Am Diet Assoc) is the systematic cross-equation comparison the American Dietetic Association cites: Mifflin produces RMR within ~10% of measured values in normal-weight and obese populations, with the narrowest error band versus Harris-Benedict and Owen. The activity multipliers (PAL) come from the FAO/WHO/UNU 2004 Human Energy Requirements consultation, which defines three lifestyle bands (sedentary 1.40–1.69, moderate 1.70–1.99, vigorous 2.00–2.40) from doubly-labelled-water studies.

For users with reliable body-composition data, the Katch–McArdle / Cunningham 1991 lean-mass equation (BMR = 370 + 21.6 × lean body mass in kg) is more accurate than Mifflin, particularly at the lean and obese tails of the distribution. We branch where the data supports it.

What the literature does not validate

Some Vital choices are not paper-derived. They are listed here:

The exact 35/25/15/15/10 Recovery weight ratio. Each component is supported individually; the specific ratio is an engineering judgement built around the state-vs-trait framing.
The HRV metric mismatch: the recovery research is RMSSD-based, but Apple Health exposes SDNN from ~1-minute samples, below the Task Force 1996 short-term standard. Vital scores SDNN against your own SDNN baseline; trend absorbs some of the gap but not all of it.
Consumer sleep-stage accuracy. Wrist and phone deep%/REM% estimates are approximate versus polysomnography, so the stage inputs are weighted behind duration and personal trend.
The 65/35 blend between logged RPE and HealthKit-derived exertion. Foster 1998 (MSSE) supports the combination principle but typically at closer to 50/50. The 65 favouring logged is a user-trust call.
The hyperbolic-tangent mapping of Training Stress Balance to a 0–100 Readiness contribution. Defensible (symmetric saturation matches the empirical observation that very-high and very-low TSB both saturate) but it's not a paper citation.
The 18%/12% nutrition-to-hydration ratio in the Health score.
The adaptive Health-score weight-shift toward weak domains. The principle has support in clinical wellness frameworks (PROMIS, SF-36); the exact magnitudes (5 pts per weak domain, capped at 50% of strong-side pool) are engineering.
Apple Watch as a measurement instrument. The HRV recordings (~1-minute samples) sit below the Task Force 1996 short-term threshold; VO₂ max underestimates lab values (Lambe 2025). Vital absorbs some of this slack through personal baselines and trend analysis; it does not eliminate it.

The full reference library

Every source named on this page is collected here with its link. The library covers HRV, sleep, training load, autonomic stress, menstrual-cycle physiology, BMR equations, hydration and nutrient density. It is maintained alongside the code whenever an algorithm weight changes.

Aubert 2003, Sports Medicine. Rolling HRV averages outperform single-day reads in trained populations.
Baker & Driver 2020, Temperature. Luteal +0.3–0.7°C core-temperature rise plus altered sleep architecture.
Banister 1975. Fitness-fatigue impulse-response model underlying modern training-load monitoring. (foundational text; no stable DOI)
Banister / Coggan 1991. TSB term; generalisation of the 1975 model for Training Stress Balance. (foundational text; no stable DOI)
Belenky 2003. Sleep debt asymmetric; recovery nights don't undo a week of restriction.
Borg 1982, MSSE. Validated psychophysical foundations of subjective exertion (RPE) scales.
Borresen & Lambert 2009, Sports Medicine. Banister and Edwards TRIMP correlate r=0.80–0.99 (114-article review).
Cheuvront & Sawka 2005. Clinically validated dehydration markers.
Cohen 1983, J Health Soc Behav. Canonical Perceived Stress Scale (PSS) reference for user-reported stress.
Cole 1999, NEJM. 1-minute heart-rate recovery as an autonomic-function marker.
Drewnowski 2009, J Am Coll Nutr. Nutrient Rich Foods Index; canonical nutrient-density framework.
Edwards 1993. 5-zone TRIMP simplification Vital ships for Exertion.
EFSA 2010. Total-water Adequate Intakes: 2.5 L/day males, 2.0 L/day females.
Ekelund 2019, BMJ. Non-linear activity–mortality dose-response, steepest first ~25 min/day (8-cohort meta-analysis).
Elo 2003, Scand J Work Environ Health. Single-item daily stress check validated against PSS-10 (r≈0.6–0.7).
European Society of Cardiology / NASPE Task Force 1996, Circulation. HRV as independent mortality predictor and canonical autonomic-balance marker; short-term sampling threshold.
FAO/WHO/UNU 2004, Human Energy Requirements consultation. PAL activity multipliers / three lifestyle bands for TDEE.
Foster 1998, MSSE. Supports combining logged RPE with HealthKit exertion (typically ~50/50).
Frankenfield 2005, J Am Diet Assoc. Cross-equation comparison; Mifflin within ~10% of measured RMR (ADA-cited).
Hirshkowitz (NSF) 2015, Sleep Health. 7–9 h adult sleep-duration target.
IOM 2004. Parallel total-water intake figures for hydration.
Jouven 2005, NEJM. 1-minute HRR as an autonomic-function marker in its own right, beyond a fitness proxy.
Karvonen 1957. Heart-rate-reserve framework behind personalised HR systems and zones.
Katch & McArdle. Lean-mass BMR equation (370 + 21.6 × LBM); more accurate at lean/obese tails.
Kim 2018, Psychiatry Investigation. Acute stress reliably reduces HF-HRV and RMSSD (37-study meta-analysis).
Kodama 2009, JAMA. Low cardiorespiratory fitness among largest mortality predictors (n=102,980).
Krupp 1989, Archives of Neurology. Validated short Likert fatigue instruments behind the fatigue slider.
Lambe 2025, PLOS ONE. Apple Watch VO₂ max underestimates lab values (~13% MAPE); caps composite at 10%.
Mandsager 2018, JAMA Network Open. Low fitness a leading mortality predictor with no elite plateau (n=122,007).
Mifflin 1990, Am J Clin Nutr. Mifflin–St Jeor BMR equation (n=498 healthy adults) for TDEE.
Murray 2017, British Journal of Sports Medicine. EWMA acute-to-chronic workload predicts injury risk; CTL/ATL/TSB and Readiness.
Norton (ESSA position) 2010, J Sci Med Sport. Standardises moderate/vigorous intensity descriptors.
Ohayon 2004, Sleep. Canonical lifespan reference for deep% and REM% (meta-analysis of 65 studies).
Plews 2013, Sports Medicine. Trend deviations matter more than absolute HRV; personal percentile windowing.
Plews 2014. ≥3 valid HRV readings/week as minimum-validity threshold for trend interpretation.
PROMIS (Cella 2007), Medical Care. Support for adaptive Health-score weight-shift toward weak domains.
Sawka (ACSM Position Stand on fluid replacement) 2007. ACSM fluid-replacement position stand; validated dehydration markers.
Schmalenberger 2019, J Clin Med. Vagal HRV drops follicular→luteal d=−0.39 (meta-analysis, 1,004 women).
Schmalenberger 2020. Progesterone is the within-person driver of luteal HRV reduction.
SF-36 (Ware 1992), Medical Care. Support for adaptive Health-score weight-shift toward weak domains.
Shaffer & Ginsberg 2017, Frontiers in Public Health. Wide healthy HRV range (RMSSD 20–100+ ms) mandating personal-baseline scoring.
Tenan 2014, Psychophysiology. Confirmed parallel luteal RHR elevation.
Thayer 2012, Neuroscience & Biobehavioral Reviews. Neurovisceral-integration model; HRV indexes vmPFC–amygdala–brainstem coupling.
Van Cauter 2000, JAMA. ~70% of nocturnal growth-hormone pulses coincide with SWS episodes.
Van Dongen 2003. Sleep debt is asymmetric; supports chronic-bad-night penalty.
Watson (AASM/SRS consensus) 2015, Sleep. 7–9 h adult sleep-duration target.
WHO (Bull et al.) 2020, BJSM. 150–300 min/week aerobic activity target.
Xie 2013, Science. Glymphatic clearance roughly doubles during slow-wave sleep.
Zhang 2016, CMAJ. Each 10 bpm RHR rise raises all-cause mortality 9% (46-study meta-analysis).

Found a missing citation or evidence that challenges a Vital weight? Send it to business@lmslabsltd.com. We read it.

This page is informational. Vital is a wellness app, not a medical device. Nothing here is a substitute for advice from a qualified clinician.

See the method in the app.

The scores are free on the App Store.

Download on the App Store