Skip to main content

Measurement properties of cervical joint position error in people with and without neck pain: a systematic review and narrative synthesis

Abstract

Introduction

Proprioception can be impaired in people with neck pain. The cervical joint position sense test, which measures joint position error (JPE), is the most common test used to assess neck proprioception. The aim of this systematic review was to assess the measurement properties of this test for the assessment of people with and without neck pain.

Methods

This systematic review was registered prospectively on Prospero (CRD42020188715). It was designed using the COSMIN guidelines and reported in line with the PRISMA checklist. Two reviewers independently searched Medline, Embase, SportDiscus, and CINAHL Plus databases from inception to the 24th July 2022 with an update of the search conducted until 14th of October 2023. The COSMIN risk of bias checklist was used to assess the risk of bias in each study. The updated criteria for good measurement properties were used to rate individual studies and then the overall pooled results. The level of evidence was rated by two reviewers independently using a modified GRADE approach.

Results

Fifteen studies were included in this review, 13 reporting absolute JPE and 2 reporting constant JPE. The measurement properties assessed were reliability, measurement error, and validity. The measurement of JPE showed sufficient reliability and validity, however, the level of evidence was low/very low for both measurement properties, apart from convergent validity of the constant JPE, which was high.

Conclusion

The measure of cervical JPE showed sufficient reliability and validity but with low/very low levels of evidence. Further studies are required to investigate the reliability and validity of this test as well as the responsiveness of the measure.

Peer Review reports

Background

Cervical sensorimotor control relies on the integration of visual, vestibular, and somatosensory information [1]. Afferent information from muscle spindles is known to contribute significantly to cervical proprioception; in particular muscles in the upper cervical region contain a high density of muscle spindles, which implies that they have an important role in neck proprioception [2].

Several outcome measures have been used to assess cervical proprioception with the joint position sense test being the most common test to evaluate joint position error (JPE) [2]. The joint position sense test determines a person’s ability to relocate their head back to a target position following active neck movement whilst their vision is occluded. Two commonly used joint position sense tests that measure JPE include the neutral head position (NHP) test, having the participant return to a neutral head position following active movement, or the target head position (THP) test, where target head position is determined by the participant or assessor [3].

Several studies have evaluated cervical proprioception by quantifying JPE in people with neck pain and have shown that cervical proprioception can be impaired in people with neck pain regardless of the aetiology [4,5,6]. For example, Revel et al. reported a higher repositioning error in people with chronic neck pain (CNP) after returning to neutral from flexion, extension, and right and left rotation when compared to asymptomatic participants [4]. Feipel et al. showed that people with chronic whiplash-associated disorders (WAD) had higher repositioning errors compared to asymptomatic participants [7]. Nevertheless, evidence indicates that cervical proprioception is more likely to be affected and to a greater extent in people that have trauma-induced neck pain [8]. Additionally, people with WAD and dizziness complaints usually have higher repositioning errors when compared to people with WAD but without dizziness [9, 10]. Impaired neck proprioception is thought to be at least partially attributed to a disturbance in cervical afferent activity [2]. Several mechanisms can contribute to this disturbance such as direct trauma to cervical structures, the influence of nociception, the presence of inflammatory mediators [9], and psychological distress [8]. A disturbance in cervical afferent input is also thought to contribute symptoms of dizziness for some patients [8].

Knowledge of the psychometric properties of outcome measures, which includes their reliability, validity, and responsiveness, are important as they reflect data accuracy and precision [11]. Michiels et al. carried out a systematic review investigating the measurement properties of cervical sensorimotor control tests [12]. In their 2012 review, they investigated the reliability and discriminative validity of tests. Although this systematic review did not use the now recommended Consensus-based Standards for the Selection of health Measurement Instrument (COSMIN) reporting guidelines [13], they reported that the NHP test showed fair to excellent reliability (ICC range: 0.35–0.87) while the THP showed poor to excellent reliability (ICC range: 0.01–0.9). Additionally, the JPE test was able to discriminate between people with and without chronic neck pain.

Given the number of publications since this last review, in this current systematic review, we aimed to build upon this research to synthesise the available evidence in relation to a range of measurement properties (reliability, measurement error, validity, and responsiveness) of the measure of cervical JPE for the assessment of people with and without neck pain.

Design and methods

This systematic review was designed using the COSMIN risk of bias (RoB) guidelines for reliability and measurement error of outcome measurement instruments as well as the COSMIN methodology for systematic reviews of Patient-Reported Outcome Measures (PROMs) [13, 14] and is reported in accordance with the Preferred Reporting Items for Systematic Review and Meta-Analysis (PRISMA) [15]. The protocol was registered with PROSPERO on the 10th of July 2020 (CRD42020188715).

Deviations from the study protocol

The initial protocol described a systematic review of the measurement properties of proprioception tests for all regions of the spine. However, following an initial review of the literature and appreciation of the number of studies conducted in different spinal regions, the decision was made to focus on the measurement properties of cervical JPE only. Additionally, the original plan was to use the COSMIN RoB checklist for PROMs, however since publishing the protocol, the authors were made aware of the new COSMIN RoB checklist for reliability and measurement error of outcome measurement instruments. Thus, this new tool was used to assess RoB of reliability, measurement error, and criterion validity [14]. The COSMIN RoB checklist for PROMs was used to assess construct and discriminative validity [16] as suggested in the manual for the COSMIN RoB checklist for reliability and measurement error of outcome measurement instruments.

Eligibility criteria

The following inclusion criteria are based on the Sample, Phenomena of Interest, Design, Evaluation, and Research type (SPIDER) guidelines [17].

  • Sample: people with and without neck pain aged ≥ 18 years. Those with neck pain included regardless of the stage of their neck pain (e.g., acute, or chronic) or aetiology (e.g., non-specific or attributed to pathology).

  • Phenomena of interest: cervical proprioception.

  • Design: any study which investigated at least one of the domains (reliability, validity, responsiveness, and their sub-domains) of the COSMIN checklist and reported absolute error (AE) or constant error (CE) in degrees.

  • Evaluation: any study that evaluated measurement properties of the measure of cervical JPE.

  • Research type: quantitative research.

Exclusion criteria

Studies that included patients that had undergone cervical spine surgery and studies not written in English were excluded.

Information sources

The following databases were searched as recommended by the COSMIN guidelines for systematic reviews [13], from inception to the 24th July 2022 with an update of the search conducted until 14th of October 2023: MEDLINE, Embase, SportDiscus, and CINAHL plus. Manual searches were carried out for: The Spine Journal, European Spine Journal, Journal of Musculoskeletal Science and Practice, and the Journal of Orthopaedic and Sport Physical Therapy. Grey literature (Open Grey, ProQuest, and EThOS) was hand searched.

Search strategy

Following scoping searches and discussions with co-authors, the search strategy was developed, and a librarian was consulted. Search terms are provided in Table 1. Search syntax was translated to meet the requirements of each database.

Table 1 MEDLINE syntax used in MEDLINE database

Data management

Endnote software version X9 (Clarivate Analytics) was used to manage citations and bibliographies and store articles found and eliminate duplicates.

Study selection

AA carried out the initial search of the databases, after that, two researchers (AA, SA) independently carried out the screening of potentially eligible studies. The screening and selection were carried out in two steps. Step 1: Abstracts and titles using the eligibility criteria. Step 2: Retrieve full text of potentially relevant studies to be screened. Studies were included if both reviewers had agreed on inclusion after screening the full text. In case of any disagreement, a third reviewer (DF) was consulted.

Data extraction and data items

Two researchers (AA, SA) independently carried out the data extraction from the included studies. Extracted data items were characteristics of the studies (study design and sample size), characteristics of the participants (age, gender, population), testing instrument, testing protocols, measurement properties (reliability, measurement error, validity, and responsiveness), and results. In case of any disagreement, a third reviewer (DF) was consulted.

Risk of bias assessment

Included studies were independently assessed by two reviewers (AA, SA) using the COSMIN RoB checklist for reliability and measurement error of outcome measurement instruments to assess RoB of reliability, measurement error, and criterion validity [14]. The COSMIN RoB checklist for PROMs was used to assess construct and discriminative validity [16]. Both checklists have four scores (very good, adequate, doubtful, and inadequate) [16] that assess measurement properties with regard to design and statistical methods. In case of any disagreement, a third reviewer (DF) was consulted.

Data synthesis

Data synthesis of the results was undertaken in accordance with COSMIN guidelines [13]. After assessing the risk of bias, each study was rated using the updated criteria for good measurement properties as sufficient ( +), insufficient (-), or indeterminate (?) [13], then, the overall results of each measurement property per outcome measure per population were rated against the criteria of a good measurement property as sufficient ( +), insufficient (-), inconsistent ( ±), or indeterminate (?) [13]. Table 2 presents the updated criteria for good measurement properties.

Table 2 The updated criteria for good measurement properties [13, 18]

The overall level of evidence for each outcome measure and its respective measurement property was then determined independently by two reviewers (AA, SA) using a modified Grading of Recommendations Assessment, Development, and Evaluation (GRADE) approach [19]. Table 3 presents the modified GRADE approach used to rate the overall quality of the evidence. More information on how to downgrade the level of evidence can be found in the COSMIN user manual [19].

Table 3 Modified GRADE approach used to rate the overall level of evidence [13]

Results

Fifteen studies were included four with CNP, three that did not specify the type of neck pain, one with cervicogenic disc disease, and seven studies that included participants without neck pain. There was a 100% agreement between raters (AA, SA) for the included studies. Search results are summarised in Fig. 1 and Table 4 summarises the extracted data from the included studies.

Fig. 1
figure 1

Prisma flow diagram of the study selection process [20]. CNP = chronic neck pain. NP = neck pain. CDD = cervicogenic disc disease. n = number of studies

Table 4 Data extracted from the studies included in this review

Absolute joint position error for people with neck pain

Intra-rater reliability

For the NHP test, six studies investigated intra-rater reliability of absolute JPE. One study included participants with CDD [5] testing right and left rotation using a CROM device and 3 trials for their assessment in sitting position, however only the NHP test was reported. This study was rated as inadequate in the RoB checklist and sufficient in the updated criteria for good measurement properties. Three studies mentioned neck pain participants but failed to report type or duration of neck pain [3, 30, 33]. Alahmari et al. [3] carried out their intra-rater reliability assessment for the NHP test, it was rated as inadequate in the RoB checklist and sufficient in the updated criteria for good measurement properties. Burke et al. [30] carried out their intra-rater reliability using two devices, the CROM and laser. Both were rated as inadequate in the RoB checklist and insufficient in the updated criteria for good measurement properties. Cid et al. [33] investigated the intra-rater reliability of the NHP, it was rated as inadequate in the RoB checklist and sufficient in the updated criteria for good measurement properties. Moreover, two studies included CNP participants [27, 31], and tested both right and left rotation in sitting position. Roren et al. [27] included 5 trials in their assessment, and used a laser pointer and US device. Both parts were rated as inadequate in the RoB checklist and insufficient in the updated criteria for good measurement properties. Goncalves and Silva [31] carried out within-day and between-day intra-rater reliability investigations of different types of NHP tests: NHP, figure of 8 (F8T) relocation test, and torsion test (TT). All investigations for were rated as doubtful in RoB checklist and sufficient in the updated criteria for good measurement properties. Nine studies showed sufficient results and four studies showed insufficient results. Therefore, the overall rating was taken. The overall rating of the intra-rater reliability was rated as sufficient, but the quality of evidence was downgraded to very low due to inconsistency of results and risk of bias (multiple studies with doubtful/inadequate ratings and inconsistency of results) (Table 5).

Table 5 Summary of measurement properties of the measure of absolute JPE

For the THP test, two studies tested the intra-rater reliability of the THP test [3, 31]. Alahmari et al. [3] was rated as inadequate in the RoB checklist and sufficient in the updated criteria for good measurement properties. Goncalves and Silva [31] carried out a within-day and between-day testing. Both investigations were rated as doubtful in the RoB checklist and sufficient in the updated criteria for good measurement properties. The overall rating of the intra-rater reliability of the THP test was rated as sufficient, but the quality of evidence was downgraded to low due to risk of bias (multiple studies with doubtful/inadequate rating) (Table 5).

Inter-rater reliability

Only two studies investigated inter-rater reliability of the NHP test in this population, and both did not report type of neck pain. Alahmari et al. [3] was rated as inadequate in the RoB checklist and sufficient in the updated criteria for good measurement properties. Burke et al. [30] carried out their investigation using two devices the laser pointer and the CROM. Both were rated as inadequate in the RoB checklist and sufficient in the updated criteria for good measurement properties. A total of three investigation showing sufficient results. The overall rating was rated as sufficient, but the quality of evidence was downgraded to low due to risk of bias (multiple studies with inadequate ratings) (Table 5).

Measurement error

For the THP test, five studies investigated measurement error [3, 5, 27, 30, 31]. GRADE was not possible to apply due to minimal important change (MIC) not provided (Table 5). For the THP test, two studies investigated measurement error [3, 31]. GRADE was not possible to apply as the minimal important change was not provided (Table 5).

Convergent validity

Two studies investigated the convergent validity in this population and were on CNP people. Chen and Treleaven [28] correlated three JPE tests (conventional, TT, Enbloc) with the neck disability index (NDI) and the visual analogue scale (VAS). All parts were rated as adequate in the RoB checklist and insufficient in the updated criteria for good measurement properties, apart from the correlation of JPE conventional with VAS, which showed sufficient results. Goncalves and Silva [31] correlated four JPE tests (NHP, THP, TT, and F8T) against each other and against disability, pain catastrophising, and fear of movement questionnaires. All parts were rated as adequate in the RoB checklist. Correlation of the tests against the questionnaires were rated as insufficient in the updated criteria for good measurement properties, while correlation of tests against each other were rated as sufficient. Seventeen investigations showed insufficient results and thirteen studies showed sufficient results. The overall rating was taken and rated as insufficient, and the quality of evidence was downgraded to low due to inconsistency of results (Table 5).

Discriminative validity

Three studies investigated the discriminative validity in people with CNP. Chen and Treleaven [28] used three tests (JPE conventional, TT, Enbloc), Goncalves and Silva [31] used four tests (NHP, THP, TT, F8T), and Roren et al. [27] used the NHP test. All investigation were rated as inadequate in the RoB checklist. All studies were rate as indeterminate in the updated criteria for good measurement properties due to improper statistical tests used for analysis, apart from the study by Roren et al. [27], which was rated as sufficient. Seven studies showed indeterminate results and one study showed sufficient results. The overall rating of the discriminative validity was rated as indeterminate, and the quality of evidence was downgraded to very low due to inconsistency of results and risk of bias (multiple studies with inadequate rating) (Table 5).

Criterion validity

The criterion validity was reported only in CNP population testing for only right and left rotation. Roren et al. [27] correlated the laser pointer against an US device in sitting position for the NHP test only. This study was rated as inadequate in the RoB checklist and sufficient in the updated criteria for good measurement properties. Chen and Treleavan [28] correlated the laser pointer against the 3-Space Fastrak for both the NHP and TT in sitting position. Both parts were rated as adequate in the RoB checklist. The conventional JPE was rated as sufficient, and the TT was rated as insufficient in the updated criteria for good measurement properties. Two investigations showed sufficient results and one showed insufficient results. The overall rating was rated as sufficient, and the quality of evidence was downgraded to low due to inconsistency of results (Table 5).

Absolute joint position error for asymptomatic people

Intra-rater reliability

A total of six studies investigated intra-rater reliability of the NHP test in this population. Kristjansson et al. [22] carried their investigation on four JPE tests: NHP, Preset trunk rotation, and F8T relocation test. All parts were rated as inadequate in the RoB checklist. The NHP and F8T investigations were rated as insufficient, and Present trunk rotation investigation was rated as sufficient in the updated criteria for good measurement properties. Strimpakos et al. [24] carried out their intra-rater investigation in sitting and standing. Both were rated as inadequate in the RoB checklist and insufficient in the updated criteria for good measurement properties. Pinsault et al. [25] was rated as doubtful in the RoB checklist and sufficient in the updated criteria for good measurement properties. Goncalves and Silva [31] carried out within-day and between day investigations for three NHP tests (NHP, TT, and F8T). All investigations were rater as doubtful in the RoB checklist. The between-day investigation of the TT was rated as insufficient, while the remaining investigations were rated as sufficient in the updated criteria for good measurement properties. Nikkhoo et al. [34] carried out within-day and between-day investigations using US MOCAP and IMU devices. All investigations were rated as doubtful in the RoB checklist and sufficient in the updated criteria for good measurement properties. Cid et al. [33] was rated as doubtful in the RoB checklist and insufficient in the updated criteria for good measurement properties. Eleven studies showed sufficient results and six studies showed insufficient results. The overall rating was sufficient, and the quality of evidence was downgraded to very low due to inconsistency of results and risk of bias (multiple studies with doubtful/inadequate rating) (Table 5).

Regarding the THP test, three studies investigated the intra-rater reliability of this test in this population [21, 22, 31] Artz et al. [21] carried out within-day and between-day intra-rater reliability of THP test only in sitting and standing. All parts were rated as inadequate in the RoB checklist and insufficient in the updated criteria for good measurement properties, apart from the between-day assessment in sitting, which was rated as sufficient. Kristjansson et al. [22] was rated as inadequate in the RoB checklist and sufficient in the updated criteria for good measurement properties. Goncalves and Silva [31] carried out a within-day and between-day investigations, both investigation were rated as doubtful in the RoB checklist and sufficient in the updated criteria for good measurement properties. Four studies showed sufficient results and three studies showed insufficient results. The overall rating was rated as sufficient, but the quality of evidence was downgraded to very low due to risk of bias and inconsistency of results (Table 5).

Inter-rater reliability

Only one study investigated inter-rater reliability of the NHP test [24] in this population. This study was rated as inadequate in the RoB checklist and insufficient in the updated criteria for good measurement properties. The overall rating was insufficient, and the quality of evidence was downgraded to very low due to risk of bias and low imprecision (sample size < 100) (Table 5).

Intra-session reliability

Only one study [26] investigated in intra-session reliability of the NHP test in this population. This study was rated as doubtful in the RoB checklist and insufficient in the updated criteria for good measurement properties. The overall rating was insufficient, and the quality of evidence was very low due to risk of bias and imprecision (sample size < 100) (Table 5).

Inter-session reliability

Only one study [26] investigated in inter-session reliability of the NHP test in this population. This study was rated as doubtful in the RoB checklist and insufficient in the updated criteria for good measurement properties. The overall rating was insufficient, and the quality of evidence was very low due to risk of bias and imprecision (sample size < 100) (Table 5).

Measurement error

For the NHP test, six studies investigated measurement error [21, 22, 24, 25, 31, 34]. GRADE was not possible to apply due to MIC no provided. For the THP test, three studies investigated measurement error [21, 22, 31]. GRADE was not possible to apply as the minimal important change was not provided.

Criterion validity

Two studies investigated criterion validity in this population. Wibault et al. [5] was rated as doubtful in the RoB checklist and indeterminate in the updated criteria for good measurement properties. Nikkhoo et al. [32] was rated as adequate in the RoB checklist and sufficient in the updated criteria for good measurement properties. We were not able to take an overall rating as one study showed sufficient results and the other one showed indeterminate results. Therefore, the overall rating was indeterminate, and no GRADE was applied due to inconsistency of results (Table 5).

Constant joint position error for asymptomatic people

Intra-rater reliability

Two studies investigated the intra-rater reliability of the NHP test. Lee et al. [23] was rated as inadequate in the RoB checklist and sufficient in the updated criteria for good measurement properties. Dugailly et al. [29] carried out four intra-rater reliability investigation of the NHP test; low and fast speeds at 90cm and 180cm from a target. All four parts were rated as inadequate in the RoB checklist. Only the low speed at 90cm was rated as insufficient, while the remaining three were rated as sufficient in the updated criteria for good measurement properties. Four studies showed sufficient result, one study showed insufficient results. The overall rating was sufficient, and the quality of evidence was downgraded to very low due to inconsistency of results, risk of bias (multiple studies with inadequate ratings), and imprecision (sample size < 100) (Table 6).

Table 6 Summary of measurement properties of the measure of constant JPE

For the THP test, only one study investigated the intra-rater reliability of this test [23]. This study was rated as inadequate in the RoB checklist and sufficient in the updated criteria for good measurement properties. The overall rating was sufficient, but the quality of evidence was downgraded to very low due to risk of bias and imprecision (sample size < 100) (Table 6).

Measurement error

For the NHP test, two studies investigated the measurement error in this population [23, 29]. GRADE was not possible to apply due to MIC not provided (Table 6). For the THP, only one study investigated measurement error [23]. GRADE was not possible to apply as the minimal important change was not provided (Table 6).

Convergent validity

One study by Dugailly et al. [29] correlated the JPE test against disability questionnaire, pain duration, and pain intensity. All parts were rated as adequate in the RoB checklist and insufficient in the updated criteria for good measurement properties. The overall rating was insufficient, and the quality of evidence was high due to multiple studies with adequate ratings (no risk of bias) (Table 6).

Criterion validity

The criterion validity was reported only once by Dugailly et al. [29]. This study was rated as doubtful in the RoB checklist and indeterminate in the updated criteria for good measurement properties. The overall rating was indeterminate, and the quality of evidence was downgraded to very low due to risk of bias and imprecision (sample size < 50) (Table 6).

Discussion

This is the first systematic review to synthesise and appraise the measurement properties of cervical JPE in people with and without neck pain using the COSMIN checklist. Our search yielded 8 studies that included neck pain participants and 7 in which asymptomatic participants were included. Absolute and constant errors were reported in this review since they are recommended when assessing JPE [35]. The large range of testing procedures used in the studies reviewed highlight the lack of any consensus in the literature on how best to assess JPE. A key factor contributing to this may be the heterogeneity of neck pain participants recruited for the reviewed studies, each with different clinical features. Given these differences in testing procedures and the vast range in types of neck pain, it is difficult to draw any general conclusions on the gold standard for testing the measurement properties of cervical JPE.

Similar to other systematic reviews, the current systematic review highlighted several issues with the quality of the included studies [36, 37]. Most of the included studies in this review were rated as inadequate or doubtful in the RoB checklist with an overall quality of the evidence being low to very low, apart from the convergent validity of the constant JPE, which was high. This was due to a failure in adhering to COSMIN guidelines when carrying out investigations of measurement properties of outcome measures. For example, according to COSMIN, the time-interval should be long enough to prevent recall bias, and short enough to ensure that the patients have not been changed on the construct to be measured [13]. When assessing the RoB for reliability and measurement error, there are no guidelines for the time-interval between sessions, therefore, this section was rated as doubtful. Other issues highlighted were statistical tests used for validity investigations. COSMIN recommends Pearson’s or Spearman’s correlation for validity assessment, which the criterion validity in the constant JPE did not use. Therefore, some of the included studies were rated as indeterminate in the updated criteria for good measurement properties. A further limitation in the included studies was when the model of the ICC used for reliability assessment was not stated. When using the RoB checklist for reliability and measurement error [14], if a study used ICC and reported the model used, it should be rated as very good; if the study used ICC but failed to report the model, then it should be rated as inadequate. Three studies failed to report the ICC model used [21, 27, 29], thus, they were rated as inadequate in the RoB checklist. Reporting the ICC model is important because the model used and the type of coefficient will impact on the magnitude of the ICC [38]. Failure to report the ICC model will affect the study’s generalisability and interpretation of the results. Inclusion of a replicable measure of response stability will aid the interpretation of results and comparison between studies.

Another issue in the current review was the inconsistency of results for the criterion validity of absolute JPE in the asymptomatic population. This inconsistency was probably due to differences in statistical tests used for validity assessment and variations in testing protocols. For example, Wibault et al. [5] correlated the CROM device against a laser pointer after returning from right and left rotation using three trials per movement in their assessment. They used the ICC for their validity assessment, which is not recommended by COSMIN, and thus were rated as indeterminate in the updated criteria for good measurement properties. Nikkhoo et al. [32] correlated the US MOCAP against IMUs after returning from flexion, extension, and bilateral rotation using five trials per movement. This study was rated as sufficient in the updated criteria for good measurement properties. Therefore, it was not possible to draw an overall rating for this measurement property due to inconsistency of the results and it was rated as indeterminate. The convergent validity on the other hand was rated as high. This was due to no risk of bias in the included studies; however, it did not show sufficient results. Sample size was another issue that affected the overall rating of an outcome measure. When applying the modified GRADE approach, sample size should be ≥ 100. However, the total sample size of the inter-rater reliability of absolute JPE in the asymptomatic population was 62 participants; this led to downgrading the overall evidence to one level. Similarly, the criterion validity of constant JPE was downgraded to two levels due to sample size < 50. In addition, the wording around reliability studies was challenging as several studies did not report the word ‘’reliability’’ in the title of the study, affecting the quality of the study.

Furthermore, the current systematic review highlighted gaps in the literature when testing the measurement properties of the measure of cervical JPE. First, the testing position. Most of the included studies carried out their investigations in sitting. Only two studies carried out their investigation in sitting as well as standing [21, 24]. However, these two studies did not include any neck pain patients, and only asymptomatic participants were recruited. In addition, they reported only constant JPE, failing to report absolute JPE. A second gap was the lack of investigation of inter-rater reliability of constant JPE in people with neck pain. The third gap we uncovered was regarding the criterion validity of absolute JPE. Although this property was investigated twice, it was limited to right and left rotation. Lastly, the domain of responsiveness was not reported in our systematic review.

Methodological considerations

This is the first systematic review to summarise and appraise the evidence of measurement properties of the cervical JPE measure using COSMIN guidelines. Two raters carried out the study selection, data extraction, the risk of bias checklist, and the GRADE approach minimising bias, which is considered a strength of this systematic review. Additionally, we included studies that have reported absolute and constant errors, which is recommended when testing cervical proprioception [35]. Prospective registration with PROSPERO is another strength of this review. A potential limitation is that the principle of lowest rating counts when using the COSMIN risk of bias checklist, thus underestimating the overall quality of the study, and potentially downgrading the overall quality of the evidence.

Recommendations for future research

Additional research is clearly warranted to assess the measurement properties of the measure of JPE in people with and without neck pain. Another recommendation is to report both absolute and constant errors in future research. Also, assessing the measurement properties of the measure of JPE in standing in addition to sitting is recommended, as well as reporting absolute and constant error for both. Responsiveness of the measure of JPE was not investigated, which we recommend investigating in future research.

Conclusion

Conclusions about the measurement properties of the measure of cervical JPE were difficult to draw due to lack of consensus on testing procedures and tools used. Further high-quality research to overcome the risk of bias in the included studies is required. Studies are also required to investigate the responsiveness of this measure.

Availability of data and materials

All data generated or analysed during this study are included in this published article and its supplementary information files.

References

  1. Peng B, Yang L, Li Y, Liu T, Liu Y. Cervical proprioception impairment in neck pain-pathophysiology, clinical evaluation, and management: a narrative review. Pain Ther. 2021;10:143–64.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Qu N, Tian HC, De Martino E, Zhang B. Neck pain: do we know enough about the sensorimotor control system? Front Comput Neurosci. 2022;16(July):1–10.

    Google Scholar 

  3. Alahmari K, Reddy RS, Silvian P, Ahmad I, Nagaraj V, Mahtab M. Intra- and inter-rater reliability of neutral head position and target head position tests in patients with and without neck pain. Brazilian J Phys Ther. 2017;21(4):259–67.

    Article  Google Scholar 

  4. Revel M, Andre-Deshays C, Minguet M. Cervicocephalic kinesthetic sensibility in patients with cervical pain. Arch Phys Med Rehabil. 1991;72(5):288–91.

    CAS  PubMed  Google Scholar 

  5. Wibault J, Vaillant J, Vuillerme N, Dedering Å, Peolsson A. Using the cervical range of motion (CROM) device to assess head repositioning accuracy in individuals with cervical radiculopathy in comparison to neck- healthy individuals. Man Ther. 2013;18(5):403–9.

    Article  PubMed  Google Scholar 

  6. Reddy RS, Tedla JS, Dixit S, Abohashrh M. Cervical proprioception and its relationship with neck pain intensity in subjects with cervical spondylosis. BMC Musculoskelet Disord. 2019;20(1):1–7.

    Article  Google Scholar 

  7. Feipel V, Salvia P, Klein H, Rooze M. Head repositioning accuracy in patients with whiplash-associated disorders. Spine (Phila Pa 1976). 2006;31(2):51–8.

    Article  Google Scholar 

  8. Treleaven J. Dizziness, unsteadiness, visual disturbances, and sensorimotor control in traumatic neck pain. J Orthop Sports Phys Ther. 2017;47(7):492–502.

    Article  PubMed  Google Scholar 

  9. Treleaven J, Jull G, Sterling M. Dizziness and unsteadiness following whiplash injury: characteristic features and relationship with cervical joint position error. J Rehabil Med. 2003;35(1):36–43.

    Article  PubMed  Google Scholar 

  10. Mazaheri M, Abichandani D, Kingma I, Treleaven J, Falla D. A meta-analysis and systematic review of changes in joint position sense and static standing balance in patients with whiplashassociated disorder. PLoS One. 2021;16(4):e0249659.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Roach KE. Measurement of health outcomes: reliability, validity and responsiveness. J Prosthetics Orthot. 2006;18(6):P8-12.

    Article  Google Scholar 

  12. Michiels S, De Hertogh W, Truijen S, November D, Wuyts F, Van de Heyning P. The assessment of cervical sensory motor control: a systematic review focusing on measuring methods and their clinimetric characteristics. Gait Posture. 2013;38(1):1–7. https://0-doi-org.brum.beds.ac.uk/10.1016/j.gaitpost.2012.10.007.

    Article  PubMed  Google Scholar 

  13. Prinsen CAC, Mokkink LB, Bouter LM, Alonso J, Patrick DL, de Vet HCW, et al. COSMIN guideline for systematic reviews of patient-reported outcome measures. Qual Life Res. 2018;27:1147–57.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Mokkink LB, Boers M, van der Vleuten CPM, Bouter LM, Alonso J, Patrick DL, et al. COSMIN risk of bias tool to assess the quality of studies on reliability or measurement error of outcome measurement instruments: a delphi study. BMC Med Res Methodol. 2020;20(1):1–18.

    Article  Google Scholar 

  15. Moher D, Shamseer L, Clarke M, Ghersi D, Liberati A, Petticrew M, et al. Preferred reporting items for systematic review and meta-analysis protocols (PRISMA-P) 2015 statement. Rev Esp Nutr Humana y Diet. 2016;20(2):148–60.

    Google Scholar 

  16. Lidwine B Mokkink, Cecilia AC Prinsen, Donald L Patrick, Jordi Alonso, Lex M Bouter, Henrica CW de Vet, et al. COSMIN Study Design checklist for Patient-reported outcome measurement instruments. Dep Epidemiol Biostat Amsterdam Public Heal Res Inst Amsterdam Univ Med Centers, Locat VUmc. 2019;

  17. Cooke A, Smith D, Booth A. Beyond PICO: the SPIDER tool for qualitative evidence synthesis. Qual Health Res. 2012;

  18. Terwee CB, Mokkink LB, Knol DL, Ostelo RWJG, Bouter LM, De Vet HCW. Rating the methodological quality in systematic reviews of studies on measurement properties: a scoring system for the COSMIN checklist. Qual Life Res. 2012;21(4):651–7.

    Article  PubMed  Google Scholar 

  19. Mokkink Cecilia AC Prinsen Donald L Patrick Jordi Alonso Lex M Bouter Henrica CW de Vet Caroline B Terwee Contact LB Mokkink LB. COSMIN manual for systematic reviews of PROMs COSMIN methodology for systematic reviews of Patient-Reported Outcome Measures (PROMs) user manual. 2018. Available from: www.cosmin.nl. Cited 2020 Feb 11.

  20. Moher D, Liberati A, Tetzlaff J, Altman DG, Altman D, Antes G, et al. Preferred reporting items for systematic reviews and meta-analyses: The PRISMA statement. PLoS Medicine. 2009;8(12):e83138.

    Google Scholar 

  21. Artz NJ, Adams MA, Dolan P. Sensorimotor function of the cervical spine in healthy volunteers. Clin Biomech. 2015;30(3):260–8.

    Article  Google Scholar 

  22. Kristjansson E, Dall’alba P, Jull G. Cervicocephalic kinaesthesia: Reliability of a new test approach. Physiother Res Int. 2001;6(4):224–35.

    Article  CAS  PubMed  Google Scholar 

  23. Lee HY, Teng CC, Chai HM, Wang SF. Test-retest reliability of cervicocephalic kinesthetic sensibility in three cardinal planes. Man Ther. 2006;11(1):61–8.

    Article  PubMed  Google Scholar 

  24. Strimpakos N, Sakellari V, Gioftsos G, Kapreli E, Oldham J. Cervical joint position sense: an intra- and inter-examiner reliability study. Gait Posture. 2006;23(1):22–31.

    Article  PubMed  Google Scholar 

  25. Pinsault N, Fleury A, Virone G, Bouvier B, Vaillant J, Vuillerme N. Test-retest reliability of cervicocephalic relocation test to neutral head position. Physiother Theory Pract. 2008;24(5):380–91.

    Article  PubMed  Google Scholar 

  26. Kramer M, Honold M, Hohl K, Bockholt U, Rettig A, Elbel M, et al. Reliability of a new virtual reality test to measure cervicocephalic kinaesthesia. J Electromyogr Kinesiol. 2009;19(5):e353-61. https://0-doi-org.brum.beds.ac.uk/10.1016/j.jelekin.2008.05.005.

    Article  PubMed  Google Scholar 

  27. Roren A, Mayoux-Benhamou MA, Fayad F, Poiraudeau S, Lantz D, Revel M. Comparison of visual and ultrasound based techniques to measure head repositioning in healthy and neck-pain subjects. Man Ther. 2009;14(3):270–7.

    Article  PubMed  Google Scholar 

  28. Chen X, Treleaven J. The effect of neck torsion on joint position error in subjects with chronic neck pain. Man Ther. 2013;18(6):562–7.

    Article  PubMed  Google Scholar 

  29. Dugailly PM, De Santis R, Tits M, Sobczak S, Vigne A, Feipel V. Head repositioning accuracy in patients with neck pain and asymptomatic subjects: concurrent validity, influence of motion speed, motion direction and target distance. Eur Spine J. 2015;24:2885–91.

    Article  PubMed  Google Scholar 

  30. Burke S, Lynch K, Moghul Z, Young C, Saviola K, Schenk R. The reliability of the cervical relocation test on people with and without a history of neck pain. J Man Manip Ther. 2016;24(4):210–4.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Gonçalves C, Silva AG. Reliability, measurement error and construct validity of four proprioceptive tests in patients with chronic idiopathic neck pain. Musculoskelet Sci Pract. 2019;43:103–9.

    Article  PubMed  Google Scholar 

  32. Nikkhoo M, Niu CC, Fu CJ, Lu ML, Chen WC, Lin YH, et al. Reliability and validity of a mobile device for assessing head control ability. J Med Biol Eng. 2021;41(1):45–52. https://0-doi-org.brum.beds.ac.uk/10.1007/s40846-020-00577-w.

    Article  Google Scholar 

  33. Cid MM, Calixtre LB, da Silva Grüninger BL, Sousa FS, Oliveira AB. Reliability of the joint position sense error test for women with neck pain and asymptomatic men and women. J Manipulative Physiol Ther. 2022;45(5):329–36.

    Article  PubMed  Google Scholar 

  34. Nikkhoo M, Niu CC, Fu CJ, Lu ML, Chen WC, Lin YH, et al. Reliability and validity of a mobile device for assessing head control ability. J Med Biol Eng. 2021;41(1):45–52.

    Article  Google Scholar 

  35. Hill R, Jensen P, Baardsen T, Kulvik K, Jull G, Treleaven J. Head repositioning accuracy to neutral: A comparative study of error calculation. Man Ther. 2009;14(1):110–4.

    Article  PubMed  Google Scholar 

  36. Althobaiti S, Rushton A, Aldahas A, Falla D, Heneghan NR. Practicable performance-based outcome measures of trunk muscle strength and their measurement properties: a systematic review and narrative synthesis. PLoS One. 2022;17(6 June):1–30. https://0-doi-org.brum.beds.ac.uk/10.1371/journal.pone.0270101.

    Article  CAS  Google Scholar 

  37. Middlebrook N, Rushton AB, Abichandani D, Kuithan P, Heneghan NR, Falla D. Measures of central sensitization and their measurement properties in musculoskeletal trauma: a systematic review. Eur J Pain (United Kingdom). 2021;25(1):71–87.

    Google Scholar 

  38. Harvey ND. A Simple Guide to Inter-rater, Intra-rater and Test-retest Reliability for Animal Behaviour Studies. OSF Prepr. 2021;1–13. Available from: https://osf.io/8stpy.

Download references

Acknowledgements

Not applicable

Funding

The authors declare that this research received no funding for this research.

Author information

Authors and Affiliations

Authors

Contributions

AA is a PhD student and DF is lead supervisor and JD, is co-supervisor. AA drafted the initial version of the manuscript with the supervisory team providing guidance on topic, methodology and analyses. SA was the second reviewer for stage 2 data extraction, risk of bias checklists, updated criteria for good measurement property and the GRADE approach. AA completed the analysis. All authors reviewed and commented on each draft. All authors have approved the final draft for submission.

Corresponding author

Correspondence to Deborah Falla.

Ethics declarations

Ethics approval and consent to participate

No research ethics were required for this systematic review due to no patient data being collected.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

AlDahas, A., Heneghan, N.R., Althobaiti, S. et al. Measurement properties of cervical joint position error in people with and without neck pain: a systematic review and narrative synthesis. BMC Musculoskelet Disord 25, 44 (2024). https://0-doi-org.brum.beds.ac.uk/10.1186/s12891-023-07111-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://0-doi-org.brum.beds.ac.uk/10.1186/s12891-023-07111-4

Keywords