Wrist optical HR is good enough for your morning commute. It’s not reliable during hard intervals, and it’s useless for daily HRV decisions. That’s the short version. The longer version involves three accuracy tiers, a cadence-lock failure mode that can add 20–30 bpm to your reading, and a skin-tone disparity study showing 4.7x higher error for darker skin at race-pace effort.
The Three-Tier Framework: ECG, Arm-Band Optical, and Wrist Optical
Not all optical sensors are equal. The research distinguishes three tiers with meaningfully different accuracy profiles.
Tier 1: Chest strap (ECG-based) measures cardiac electrical signals directly. No processing delay. No motion artifact. Mateo-March et al. (2022, n=25) found Polar H10 bias of just –0.1 to –0.2 bpm at rest and 0.4 to –0.7 bpm at high intensity vs 12-lead ECG. ICC above 0.93 at all intensities.
Tier 2: Arm-band optical (Polar OH1, Verity Sense) sits on the upper arm, away from wrist tendons. Gilgen-Ammann et al. (2025, n=16) found the Verity Sense achieves MAE 1.43 bpm, MAPE 1.35%, CCC 1.00 vs Polar H10. For exercise HR, that’s functionally indistinguishable from chest strap. 10–20 second response lag remains. Can’t replace chest strap for HRV.
Tier 3: Wrist optical (Apple Watch, Garmin Fenix, Fitbit) sits over the radial artery. The site is prone to motion artifact, cadence interference, and vasoconstriction. MAE averages above 12 bpm at higher intensity. Pasadyn et al. (2019, n=50): at 8–9 mph treadmill running, NO wrist device maintained rc ≥ 0.70. The Garmin Forerunner 235 recorded rc = 0.52 (Etiwy 2019, n=80).
Where Chest Strap Accuracy Matters: Threshold Intervals and Cadence Lock
For Zone 2 steady aerobic work, wrist optical is often adequate. The intensity is low, motion artifact is manageable, and a 4–5 bpm error doesn’t cross a zone boundary. Chow & Yang (2020) found Garmin Vivosmart HR MAPE 3.77% in young adults doing moderate steady-state exercise, which is within practical tolerance.
The problem starts at threshold pace. A 6–10 bpm overread at LT pushes a Zone 2 session into Zone 3. That invalidates the aerobic stimulus. See Zone 2 vs LT1.
Then there’s cadence lock. Wrist optical uses photoplethysmography (PPG): green LED through skin, photodetector reading reflected blood-volume pulses. At 165–180 spm running cadence, repetitive arm swing creates oscillations that overwhelm the cardiac pulse. The sensor locks onto limb motion instead of your heartbeat. Readings jump 20–30 bpm above actual. The number looks plausible at 175 bpm during a hard interval, so you don’t catch it.
Bent et al. (2020) documented PPG response lag of 0–43 seconds during pace changes. For a 30-second interval rep, that means the sensor may never register your true peak HR before the rep ends.
Cold weather compounds both problems. Vasoconstriction at the wrist reduces peripheral blood volume, degrading PPG signal amplitude. Infrared signal RMS drops roughly 54% under cold conditions. Chest straps are unaffected by temperature.
Wrist Optical HR and Skin Tone: A 4.7x Error Gap
This is the data most gear comparison articles skip.
Duking et al. (2025) tested Fitbit Charge 5 against Polar H10 across 25 participants spanning light, medium, and dark skin tones (Fitzpatrick scale), recording 495 simultaneous HR samples during exercise. At high intensity (above 60% heart rate reserve), mean error was 3.5 bpm for light skin and 16.5 bpm for dark skin. At moderate intensity (40–60% HRR), dark skin error reached 14.6 ± 19.5 bpm vs 4.8 ± 8.5 bpm for light skin (p = 0.011). Melanin absorbs green LED light, reducing the signal-to-noise ratio enough to produce clinically meaningful errors.
Tattoos over the sensor site produce a similar effect. The PPG signal can’t reliably detect pulse frequency through heavily pigmented ink.
If you have dark skin or tattoos on your wrist, Tier 3 wrist optical isn’t a slightly worse option. It’s an unreliable one during hard efforts.
HRV Monitoring: Where Wrist Optical Completely Fails
Your watch’s morning HRV score and your actual autonomic state may have very little to do with each other.
Lind et al. (2024) ran 39 participants through 316 paired HRV readings over 14 days. Apple Watch Series 9/Ultra 2 vs Polar H10. SDNN MAPE was 28.88% (95% CI: 26.18–31.57%). MAE: 20.46 ms. The Apple Watch systematically underestimated SDNN by 8.31 ms. Equivalence testing failed the ±10 ms margin.
The 7-day HRV readiness trend methodology used by Plews and Vesterinen requires repeatable, low-noise RMSSD measurements. A 28.88% MAPE means a 70 ms true SDNN could read anywhere from 50 to 90 ms. That range crosses every practical training-decision threshold.
Resting HR from the same Apple Watch is fine: MAPE 5.91%, MAE 3.73 bpm. So the watch works for HR trend tracking. Don’t extend that trust to HRV.
For valid HRV monitoring, use a chest strap (Polar H10, H9) with a dedicated HRV app (HRV4Training, EliteHRV) or a finger sensor (Oura Ring). Nocturnal Oura HRV correlates with ECG at r = 0.962, though it carries a –15.88 ms systematic bias. That’s fine for trend monitoring, not for comparing against published norms.
The Use-Case Matrix: Which Sensor for Which Session
| Training Session | Minimum Adequate | Recommended | Why |
|---|---|---|---|
| Zone 2 steady run or ride | Wrist optical | Arm-band optical | Error stays below zone boundary at low intensity |
| Threshold intervals (Z4) | Arm-band optical | Chest strap | 6–10 bpm wrist error can misclassify the entire interval |
| HIIT or track reps | Chest strap | Chest strap | Cadence lock + lag invalidates peak HR |
| Long aerobic run/ride | Wrist optical | Arm-band optical | Cumulative drift matters; see aerobic decoupling |
| HRV morning measurement | Chest strap or finger sensor | Polar H10 + HRV4Training | Wrist SDNN MAPE 28.88% fails equivalence testing |
| Race (swim-bike-run) | Chest strap | Chest strap | High variability, cadence changes, cold water all increase wrist error |
| Cold weather outdoor run | Chest strap | Chest strap | Vasoconstriction drops wrist PPG signal amplitude ~54% |
For dark-skin athletes doing any high-intensity work, move the “minimum adequate” column one tier up across the board. At >60% HRR, 16.5 bpm mean error makes wrist optical unreliable regardless of session type.
AthleteOS detects the HR source from synced Garmin, Apple Watch, or Polar devices. When wrist-optical data feeds a threshold or VO2max interval session, the session analysis surfaces an inline accuracy warning — because a 6–15 bpm error at those intensities can misclassify every rep. Zone 2 sessions don’t trigger the flag. HRV readiness scores require a validated chest strap input; the platform won’t generate a readiness recommendation from Apple Watch optical HRV data alone.
When Each Tier Is Good Enough
The chest strap isn’t the answer to every situation. It’s uncomfortable in open-water swimming, can slip during transitions, and most athletes won’t wear one for easy recovery jogs. That’s fine.
Here’s the practical read. If your training is predominantly Zone 2 base work with occasional tempo, a wrist optical watch covers about 80% of sessions adequately. Add an arm-band optical (Polar Verity Sense costs around $90) for threshold and interval days, and you’ve covered the gap without the discomfort of a chest strap every session.
If HRV readiness is part of your daily routine, a chest strap for five morning minutes is the only validated option. The Polar H10 takes 90 seconds to set up with HRV4Training. That’s a small friction cost against making the right training decision each day.
Wrist optical is your iPhone weather app: useful for most days, wrong enough on the days that matter. The chest strap is the National Weather Service — less convenient, but correct when accuracy has consequences.
Key numbers:
- Wrist optical at high intensity: MAE
12.29 bpm(Polar Vantage V2),16.5 bpmfor dark skin (Fitbit Charge 5) - Arm-band optical: MAE
1.43 bpm, CCC1.00(Polar Verity Sense, Gilgen-Ammann 2025) - Wrist HRV SDNN MAPE:
28.88%(Apple Watch Series 9/Ultra 2, Lind 2024) - Cadence lock range: 165–180 spm
- PPG response lag: 0–43 seconds (Bent et al. 2020)
- Chest strap ECG bias: <0.7 bpm across all intensities