16min read

Aerobic Fitness Tests in Football: Comparing the Beep Test, Yo-Yo IR, Time Trial, and 30-15 IFT

Yo-Yo test 30-15 IFT MAS HIIT prescription individualisation

Prerequisites: This article assumes familiarity with VO₂max as a measure of aerobic capacity and the concept of lactate and ventilatory thresholds. If any of these topics are new to you, start with:

Learning Objectives

  • Explain why aerobic fitness testing is needed in football — for profiling, training prescription, and longitudinal monitoring.
  • Distinguish the protocols and measured variables of the Multi-Stage Fitness Test (Beep Test), Yo-Yo IR1/IR2, Time Trial, and 30-15 IFT.
  • Compare the VO₂max estimation validity and limitations of each test.
  • Explain how test results link to training prescription, particularly HIIT intensity individualisation.
  • Evaluate practical factors to consider when selecting tests in a team setting, including timing, the specificity–sensitivity trade-off, and player buy-in.

Why Test Aerobic Fitness?

At least 90% of the energy demand during a football match is met by aerobic metabolism (Walker et al., 2023). This single fact explains why aerobic capacity sits at the centre of any physical preparation programme. Higher VO₂max values have been positively associated with league standing, total distance covered, a 25% increase in ball involvements, and a 100% increase in the number of sprints performed during matches (Walker et al., 2023).

These relationships make aerobic fitness testing essential, but not for the reason many assume. Aerobic fitness tests are not direct predictors of match performance. A high VO₂max does not guarantee a high-quality performance on the pitch. Two players with identical test results may display entirely different match profiles depending on tactical role, decision-making, and technical ability.

The value of testing lies in three purposes. First, profiling: identifying each player’s current aerobic capacity relative to the squad and to positional benchmarks. Second, training prescription: deriving individual reference speeds that allow practitioners to individualise conditioning sessions. Third, longitudinal monitoring: tracking changes in aerobic fitness over time to evaluate whether the training programme is producing the intended adaptations.

Testing quality should be evaluated against five criteria: the test must be relevant to the physical demands being assessed, reliable in producing consistent results, representative of the movement context, realistic given available resources, and relatable to coaching staff and players (Marsh et al., 2023). Without meeting these criteria, testing becomes data collection for its own sake — a practice that adds no value to preparation.


Four Tests: Protocols and What They Measure

Football practitioners commonly use four categories of aerobic field tests. Each has a distinct work–rest structure, and this structural difference determines which physiological attributes the test primarily assesses.

Multi-Stage Fitness Test (Beep Test)

The Multi-Stage Fitness Test (MSFT), widely known as the Beep Test, is a continuous incremental shuttle run. Players run 20-metre shuttles at progressively increasing speeds dictated by audio signals until voluntary exhaustion. There is no rest period between shuttles. The final stage reached is used to estimate VO₂max through published regression equations.

The Beep Test was originally designed for general population assessment. Its continuous nature means it predominantly taxes the aerobic energy system without the intermittent recovery periods that characterise football. A notable limitation is sensitivity to non-physical factors: research has shown that as little as 30 minutes of cognitive task performance prior to the test can impair Beep Test results, raising questions about its construct validity as a pure measure of aerobic capacity (Marsh et al., 2023).

Yo-Yo Intermittent Recovery Tests (IR1 and IR2)

The Yo-Yo Intermittent Recovery Test Level 1 (YYIR1) requires players to perform 2 × 20-metre shuttles at increasing speeds, with a 10-second active recovery period (2 × 5-metre jog) between each shuttle bout. The test continues until the player fails to reach the line on time twice. The total distance covered is the primary outcome variable.

The Yo-Yo Intermittent Recovery Test Level 2 (YYIR2) follows the same structure but starts at a higher speed and progresses more rapidly. The shorter time-to-exhaustion shifts the physiological demand toward anaerobic intermittent capacity rather than aerobic intermittent capacity.

The 10-second recovery interval is the critical design feature that separates Yo-Yo tests from the Beep Test. This intermittent structure better reflects the stop–start nature of football, where players alternate between high-intensity efforts and brief recovery periods throughout a match.

Time Trial

A Time Trial (TT) is the simplest approach: players run a fixed distance (typically 1,200–2,200 metres, corresponding to approximately 5–7 minutes of running) at maximum effort. Maximal Aerobic Speed (MAS) is directly calculated by dividing the distance by the time taken.

The Time Trial’s primary advantage is that it produces a directly usable training variable — MAS — without the need for regression equations or conversion tables. Normative MAS data are readily available across sports and competition levels (Marsh et al., 2023). The protocol is simple, highly reproducible, and can be administered to an entire squad simultaneously.

30-15 Intermittent Fitness Test (30-15 IFT)

The 30-15 Intermittent Fitness Test (30-15 IFT) consists of 30-second shuttle runs at progressively increasing speeds, interspersed with 15-second passive recovery periods. Players run back and forth over a 40-metre distance (with 3-metre deceleration zones at each end), and the speed increases by 0.5 km·h⁻¹ each stage. The final speed reached is termed V_IFT.

V_IFT is distinct from MAS. Because the protocol includes direction changes and operates within the Anaerobic Speed Reserve (ASR) — the range between MAS and maximal sprint speed — V_IFT reflects both aerobic capacity and the ability to perform above MAS (Marsh et al., 2023). This dual sensitivity is what makes the 30-15 IFT unique among the four tests.

Protocol Comparison

FeatureBeep TestYo-Yo IR1Yo-Yo IR2Time Trial30-15 IFT
StructureContinuousIntermittentIntermittentContinuousIntermittent
RecoveryNone10 s active10 s activeNone15 s passive
Direction changesYes (shuttles)Yes (shuttles)Yes (shuttles)No (linear)Yes (shuttles)
Primary outcomeEstimated VO₂maxDistance (m)Distance (m)MAS (km·h⁻¹)V_IFT (km·h⁻¹)
Primary attributeAerobic capacityAerobic intermittent capacityAnaerobic intermittent capacityMaximal aerobic speedMAS + ASR combined

The protocol differences are not trivial details. They determine what each test actually measures, which in turn determines how the result can be applied.


VO₂max Estimation Validity

All four Yo-Yo test variants show large correlations with laboratory-measured VO₂max, but the strength of those correlations varies (Tan et al., 2025). A systematic review and meta-analysis covering 33 studies and 1,362 team-sport athletes reported the following pooled correlations:

TestCorrelation with VO₂max (r)Number of Studies
YYIR10.6524
YYIR20.5511
YYIE10.675
YYIE20.676

Several points require attention. First, YYIR2 consistently shows the lowest correlation with VO₂max. This does not mean the test is flawed — it means YYIR2 measures something different. The higher starting speed and faster progression of YYIR2 place a greater demand on anaerobic energy systems, making it better suited for assessing anaerobic intermittent exercise capacity than aerobic capacity (Tan et al., 2025).

Second, the correlations between Yo-Yo tests and other field-based endurance tests were large (r = 0.66), while correlations with jump performance were moderate (r = 0.40). This confirms that Yo-Yo tests share variance with other endurance measures but are not redundant — they capture the intermittent component that continuous tests miss.

Third, sub-group analysis revealed that the correlation between Yo-Yo tests and VO₂max was not significantly affected by training level. This means the relationship holds across recreational, trained, and highly trained athletes.

The practical implication is clear. Practitioners who need to estimate aerobic intermittent capacity should use the YYIR1 or YYIE2. Practitioners who need to assess anaerobic intermittent capacity should use the YYIR2. Using YYIR2 to estimate VO₂max would underestimate the true relationship between the player’s aerobic system and the test outcome.

A broader limitation applies to all field-based VO₂max estimates. Laboratory measurement using gas analysis remains the gold standard, but it is impractical in elite football environments where 25 or more players need to be assessed within tight pre-season schedules (Marsh et al., 2023). Field tests trade precision for scalability — a trade-off that is acceptable when the purpose is profiling and prescription rather than clinical diagnosis.


From Test Results to Training Prescription

The value of a fitness test extends beyond the number it produces. The critical question is whether that number can be directly translated into a training prescription.

MAS as a Prescription Anchor

MAS is the most widely used reference speed for prescribing aerobic and high-intensity interval training. A Time Trial directly yields MAS, making it the most straightforward path from assessment to prescription. When a practitioner knows a player’s MAS is 16.5 km·h⁻¹, interval sessions can be prescribed at specific percentages of that value — for example, 4 × 4-minute runs at 90% MAS for aerobic development, or shorter intervals at 100–120% MAS for work near and above the anaerobic threshold.

Research on professional football players has demonstrated why individualising speed zones around MAS matters. When comparing arbitrary fixed speed thresholds with MAS-based individualised thresholds, measurement bias reached up to 70% for certain speed zones (Rago et al., 2020). Two players running at the same absolute speed may be working at entirely different percentages of their aerobic capacity. The individualised approach captures this difference; the arbitrary approach does not.

The Limitation of Yo-Yo End Speed

Yo-Yo tests produce a total distance covered, and while the final shuttle speed can be identified, this speed does not correspond proportionally to MAS. The progressive nature of the Yo-Yo protocol — where speed increases are paired with cumulative fatigue — means the final speed reflects a complex interaction of aerobic capacity, anaerobic contribution, and fatigue resistance rather than a clean MAS value (Marsh et al., 2023).

This makes Yo-Yo results valuable for profiling — ranking players, tracking changes over time, comparing to normative data — but not directly usable for prescribing training speeds. A practitioner cannot take a player’s YYIR1 final speed and use it as the basis for HIIT prescription without introducing systematic error.

The 30-15 IFT Advantage

The 30-15 IFT is the only test among the four that simultaneously provides both profiling and training prescription (Marsh et al., 2023). V_IFT directly serves as the reference running speed for prescribing short-duration, high-intensity intervals within the ASR. Because V_IFT accounts for both MAS and the player’s capacity above MAS, it is particularly well suited for prescribing supramaximal HIIT with direction changes — a training format that closely resembles the intermittent demands of football.

However, MAS alone is insufficient for accurate HIIT programming. A recent methodological framework proposed the formula:

Estimated Maximal MP1min=MAS+0.695×ASR\text{Estimated Maximal MP}_{1\text{min}} = \text{MAS} + 0.695 \times \text{ASR}

where ASR is the difference between maximal sprint speed and MAS (Buchheit et al., 2026). This formula highlights that two players with the same MAS but different maximal sprint speeds will have different capacities for sustaining high-intensity efforts — a difference that MAS-only prescription would miss entirely.

Combining Tests for Richer Profiles

Adding a separate MAS assessment (such as a Time Trial or Vam-Eval) alongside the 30-15 IFT allows practitioners to isolate the contribution of high-intensity and directional components. The difference between V_IFT and MAS reflects the player’s supramaximal and change-of-direction capacity: a large gap indicates strong performance above MAS and during direction changes, while a small gap suggests the player’s profile is predominantly aerobic (Marsh et al., 2023).

This combined approach transforms testing from a single-number output into a multi-dimensional profile. Experimental work has confirmed the practical relevance of MAS-based intensity prescription: when between-sprint running intensity was set at MAS during repeated-sprint protocols, sprint performance declined from the second repetition onward, with approximately twice the decrement compared to sub-threshold running (Bizas et al., 2026). This illustrates how precisely calibrated intensity — anchored to individually measured reference speeds — determines the training stimulus a player actually receives.


Choosing the Right Test for Your Context

No single test is universally optimal. The choice depends on the specific question the practitioner is trying to answer and the practical constraints of the environment.

The Specificity–Sensitivity Trade-Off

A fundamental principle governs test selection: specificity and sensitivity are inversely related (Marsh et al., 2023). A highly sport-specific test (such as a football-specific fitness test involving the ball) may appear to have greater face validity, but its multifactorial nature — combining aerobic function, anaerobic capacity, direction changes, neuromuscular ability, and technical skill — makes it impossible to isolate which component needs improvement. Two players with entirely different physical profiles could produce the same test score (Marsh et al., 2023).

Removing the technical element is the first step toward a clearer fitness profile. Tests like the Yo-Yo IR and the 30-15 IFT achieve this by using standardised running protocols without ball involvement, allowing practitioners to attribute changes in performance to changes in physical capacity rather than technical variability.

Timing Within the Season

Test selection also depends on when in the season the assessment occurs. On the first day of pre-season, maximal tests carry injury risk because players return in a de-trained state. The recommended approach is to begin with submaximal tests and screening, then introduce maximal fitness testing after several weeks of progressive loading (Marsh et al., 2023).

The 30-15 IFT, for instance, requires players to reach maximal effort including direction changes at high speed — an exposure that demands adequate preparation. Scheduling it too early increases the risk of soft-tissue injury; scheduling it after an appropriate build-up period yields safer and more valid results.

Sensitivity to Change

A test must be sensitive enough to detect meaningful changes in fitness. If a player improves their aerobic capacity by 5% over a training block but the test’s typical error is 6%, the improvement is invisible. Practitioners should establish reliability data — including the coefficient of variation — within their own testing environment rather than relying solely on published norms (McGuigan, 2022). Equipment, surface, motivation, and testing personnel all introduce variability that published studies cannot account for.

Field-based intermittent tests have demonstrated sensitivity to training interventions in applied settings. Following a 14-day altitude training camp, elite team-sport athletes showed approximately 45% improvement in YYIR2 performance at the three-week follow-up (Brocherie et al., 2015). This magnitude of change far exceeds typical test error, confirming that YYIR2 can detect large training effects. However, sensitivity to smaller, incremental in-season changes requires tighter control of testing conditions.

A Decision Framework

QuestionRecommended Test
What is the player’s aerobic intermittent capacity?YYIR1 or YYIE2
What is the player’s anaerobic intermittent capacity?YYIR2
What is the player’s MAS for direct training prescription?Time Trial (5–7 min)
Profiling and HIIT prescription simultaneously?30-15 IFT
Isolating supramaximal and direction-change components?30-15 IFT + Time Trial combined

When resources allow only one test, the 30-15 IFT offers the broadest utility because it serves both profiling and prescription purposes. When the primary goal is simple, reproducible MAS measurement for a large squad, the Time Trial is the most efficient option. The choice is never about which test is “best” in absolute terms — it is about which test answers the specific question being asked, within the constraints of the specific environment.


Key Takeaways

  • Aerobic fitness tests are tools for profiling players and prescribing individualised training — not direct predictors of match performance.
  • The Beep Test, Yo-Yo IR1/IR2, Time Trial, and 30-15 IFT each have distinct work–rest structures, and these structural differences determine which physiological attributes they assess.
  • All Yo-Yo tests show large correlations with VO₂max, but YYIR2 is better suited for assessing anaerobic intermittent capacity, while YYIR1 or YYIE2 are recommended for aerobic intermittent assessment.
  • The 30-15 IFT is the only test that simultaneously provides profiling and HIIT prescription through V_IFT; MAS alone is insufficient for accurate high-intensity programming because it does not account for the anaerobic speed reserve.
  • When selecting tests, rather than over-emphasising sport-specificity, prioritise tests that remove technical elements to clarify the fitness profile and that can be directly linked to training prescription.

References

  1. Bizas, G., Smilios, I., Thomakos, P., & Bogdanis, G. C. (2026). Effects of between-sprint running intensity on repeated-sprint performance in professional soccer players. Sports, 14(3), 97. https://doi.org/10.3390/sports14030097
  2. BROCHERIE, F., MILLET, G. P., HAUSER, A., STEINER, T., RYSMAN, J., WEHRLIN, J. P., & GIRARD, O. (2015). “Live High–Train Low and High” Hypoxic Training Improves Team-Sport Performance. Medicine & Science in Sports & Exercise, 47(10), 2140-2149. https://doi.org/10.1249/mss.0000000000000630
  3. Buchheit, M., Sagarra, A. L., Boskovic, A., Komino, P., Norman, D., & Hader, K. (2026). GPS 3.0: from distance into zones toward better proxies of internal neuromuscular load in elite football. Sport Performance & Science Reports, 280, v1.
  4. Marsh, J., Calder, A., Stewart-Mackie, J., & Buchheit, M. (2023). Needs analysis and testing. In A. Calder & A. Centofanti (Eds.), Peak performance for soccer: The elite coaching and training manual. Routledge.
  5. McGuigan, M. (2022). Profiling and Benchmarking. In D. N. French & L. Torres Ronda (Eds.), NSCA’s Essentials of Sport Science. Human Kinetics.
  6. Rago, V., Brito, J., Figueiredo, P., Krustrup, P., & Rebelo, A. (2020). Application of Individualized Speed Zones to Quantify External Training Load in Professional Soccer. Journal of Human Kinetics, 72(1), 279-289. https://doi.org/10.2478/hukin-2019-0113
  7. Tan, Z., Castagna, C., Krustrup, P., Wong, D. P., Póvoas, S., Boullosa, D., Xu, K., & Cuk, I. (2025). Exploring the Use of 5 Different Yo‐Yo Tests in Evaluating V̇O2max and Fitness Profile in Team Sports: A Systematic Review and Meta‐Analysis. Scandinavian Journal of Medicine & Science in Sports, 35(5), e70054. https://doi.org/10.1111/sms.70054
  8. Walker, G., Read, M., Burgess, D., Leng, E., & Centofanti, A. (2023). Conditioning. In A. Calder & A. Centofanti (Eds.), Peak performance for soccer: The elite coaching and training manual. Routledge.