Monday, March 12, 2012

Is there regional variation in the SF-36 scores of Canadian adults?

ABSTRACT

Background: Canadian normative data for the Medical Outcomes Study 36-item short form (SF-36) have recently been published. However, there is evidence from other countries to suggest that regional variation in health-related quality of life (HRQOL) may exist. We therefore examined the SF-36 data from nine Canadian centres for evidence of systematic differences.

Methods: Bayesian hierarchical modelling was used to compare the differences in the eight SF-36 domains and the two summary component scores within each of the age and gender strata across the nine sites.

Results: Five domains and the two summary component scores showed little clinically important variation. Other than a small number of exceptions, there was little overall evidence of HRQOL differences across most domains and across most sites.

Interpretation: Our finding of only a few small differences suggests that there is no need to develop region-specific Canadian normative data for the SF-36 health survey.

The Medical Outcomes Study 36-item short form (SF-36)1,2 health survey has proven to be useful for comparing the relative burden of different diseases, as well as the efficacy of treatment interventions on quality of life.3 Moreover, it is increasingly being used in clinical trials research, reflecting a shift away from research that had a more narrow focus on clinical indicators such as morbidity and mortality, to a broader assessment of patient functioning and well-being.4

The recent publication of Canadian normative data for the SF-36 health surveys has allowed researchers and health care professionals to compare SF-36 data they have collected to age- and/or gender-- appropriate norms. While this marks a substantial improvement over comparisons with US normative data,1 there is also evidence to suggest that there may be regional variation in SF-36 scores in some countries.6 Within Canada, where both an English and a French version of the SF-36(7) have been used, there may also be differences according to language of form completion.8

The initiation of data collection for the Canadian Multicentre Osteoporosis Study (Cantos) in 1996 provided the opportunity to develop the age- and sex-adjusted norms for Canadians.5 Since these data were collected at nine centres across Canada, they can also be used to assess differences between the nine cities and surrounding regions. This will provide an opportunity to validate and confirm the usefulness of the normative data within all regions of Canada, as well as address the question of whether those living in various parts of Canada have similar health-related quality of life.

METHODS

The Canadian Multicentre Osteoporosis Study (Cantos) is a prospective cohort study of 9,423 non-institutionalized randomly selected males and females aged 25 years and older. The sample is drawn from a 50-km radius of nine Canadian cities: Vancouver, Calgary, Saskatoon, Hamilton, Toronto, Kingston, Quebec City, Halifax and St. John's. Details of the study's purpose, methodology and sampling framework are presented elsewhere.5,9 Ethical approval for the study was obtained through the Review Boards of each participating centre, as well as at the coordinating centre in Montreal.

The SF-36 contains 36 items which, when scored, yield 8 domains, including physical functioning, role physical, role emotional, bodily pain, vitality, social functioning, mental health, and general health perceptions. A detailed description of these domains is available elsewhere.1,5 Summary scores for a Physical Component and a Mental Component can also be derived.2 All domains are scored on a scale from 0 to 100, with 100 representing the best possible health state.

The data were scored by means of the Medical Outcomes Trust scoring methodology.1,2 The data were age- and sex-- standardized to the Canadian population by weighting the total means based on the underlying population for each of the nine centres, using Statistics Canada data.10,11 The normative data5 (means, standard deviations, 95% confidence intervals, and percent at floor and ceiling) were generated for the entire sample, by gender, by age groups (10-year increments) and for each of the nine centres.

A preliminary and descriptive analysis of these data, without taking into account the age and sex stratifications, indicated that there were some differences between the centres in some, though not all, domains. However, given that the developers of the SF-36 consider a clinically and socially meaningful difference to be a minimum of five points,1 few of the differences were meaningful. Moreover, it is necessary to examine these differences within each of the two gender and six age strata, for each of the eight domains and two summary scores, for each of the nine centres.

We used a Bayesian hierarchical model to evaluate regional differences. Results are reported as posterior mean differences with 95% credible intervals (Bayesian analogues to frequentist confidence intervals). In contrast with other methods, these models also allow for the direct calculation of the probability of a clinically important difference, which we also report. A separate model was created for each combination of sex, age group and SF-36 domain, resulting in a total of 2 x 6 x 10 = 120 models. Each of these hierarchical models consists of three stages, described in detail in Appendix A.

RESULTS

Data were collected between January of 1996 and September of 1997. The entire sample consisted of 9,423 participants, with a mean age of 62.1 years and a standard deviation (SD) of 13.4 years. There were 2,884 men (mean age 59.9, SD 14.5 years, range 25-97 years), and 6,539 women (mean age 63.0, SD 12.8 years, range 25-103 years). Table I indicates that each centre was well represented, and the age and gender distributions were similar across the centres.

Table II contains the relative ranking attained by each of the centres on each of the domains of the SF-36, with age and sex combined within each region. This descriptive analysis was undertaken prior to completing the Bayesian hierarchical modelling. The highest (best) score is assigned a ranking of one, while the lowest (poorest) score is assigned a ranking of nine. For example, on the physical functioning domain, Toronto attained the highest score, while Hamilton had the poorest. Wide variation is apparent across all of the centres on all domains and summary scores of the SF-36.

The magnitude of the maximum difference between centres (e.g., the difference between the centre scoring the highest and the centre scoring the lowest) is close to or below the five points identified as clinically and socially relevant by the developers.1,2 The exceptions are the role physical and role emotional domains, where there was a difference of 11.1 points between the highest (St. John's in both cases) and the lowest (Hamilton and Vancouver, respectively) scoring centres prior to assessing the age and gender stratifications. While these data are interesting, they must be treated with caution because they do not take into account the age and gender stratifications. As a result, effects in one direction in one age and gender group may cancel opposite effects in other strata. However, even when examining the maximum between-site differences within all age and gender categories within each domain, there were still only three domains (role physical at 38.4, role emotional at 19.0 and vitality at 14.7) in which the largest between-centre comparison exceeded five points (Table II, last row).

The hierarchical modelling indicates that there are very few clinically meaningful between-centre differences, when comparing those within the same age and gender stratification, in the domains of physical functioning, bodily pain, general health perception, social functioning, mental health, and the physical and mental component summary scores. Table III presents data (by domain, age group and centre) in which meaningful differences were found. Two domains, vitality and role emotional, showed some potentially meaningful between-centre differences in females aged 75+. For vitality, the 75+ females from Quebec City scored higher than their counterparts, with mean differences ranging from 8.5 (as compared to Vancouver) to 14.7 (Kingston). For role emotional, the 75+ females from St. John's scored somewhat higher than their counterparts in other centres, with mean differences ranging from 7.4 (Saskatoon) to 19.0 (Calgary). Females 75+ from Calgary scored somewhat lower than their peers, with mean differences ranging from -2.7 (Hamilton) to -19.0 (St. John's).

For the final domain, role physical, there were some between-centre differences for males aged 75+. Those from St. John's and to a lesser extent those from Quebec City scored higher than their counterparts at other sites. For St. John's, the differences ranged from 11.7 (Quebec City) to 29.7 (Calgary), and for Quebec City, the differences ranged from 5.0 (Kingston) to 18.0 (Calgary). St. John's women in the age groups 55-64 years and 65-74 years scored higher on the role physical domain than the other sites. For the age group 55-64 years, differences ranged from 9.5 (Saskatoon) to 15.0 (Hamilton), while for the age group 65-74 years, they ranged from 7.8 (Toronto) to 17.4 (Hamilton). For the age group of 75+, women from St. John's and from Quebec City scored somewhat higher than the other centres. For St. John's, differences ranged from 12.0 (Quebec City) to 38.4 (Calgary), while for Quebec City, the differences ranged from 11.3 (Kingston) to 26.3 (Calgary). It should be noted, however, that where sizeable between-site differences are noted, the credible regions are quite wide.

DISCUSSION

The question of quality-of-life variation between Canadian cities and the surrounding regions needs to be addressed to identify whether regional normative data need to be developed, or whether the Canadian normative data5 are valid for use across Canada. While the centres can be ranked in terms of their scores, Table II indicates that there is wide variation across the domains and the summary scores in terms of the relative ranking of each centre. Looking across the many comparisons made, there were very few differences that reached clinical importance, so that overall there appears to be no strong need for region-specific norms over most domains.

Given that the hierarchical modelling examined between-site differences on the basis of nine sites, six age groups and two gender groups for eight SF-36 domains and two summary scores, the absence of between-centre variation other than that identified in Table III is noteworthy. The data therefore suggest that the Canadian SF-36 normative data already published5 can be used for most comparative purposes. Since many comparisons were done, some differences may have arisen due to chance variations alone. In general, hierarchical modelling reduces the probability of such chance findings, by borrowing strength from all regions to estimate the domain means from each individual region. Nevertheless, chance remains a possible explanation for some of the differences reported in Table III.

One limitation of our study is that although the Cantos participants were randomly selected, not everyone invited decided to participate. Therefore, the results apply only to those who participated (or would have participated had they been invited), who may differ from the Canadians who did not (or would not) participate. It is therefore possible that the regions do vary but we did not find this because only certain people were interested in participating. Our data also do not allow us to fully investigate rural regions, since each of our study centres was based in an urban area. Although the 50-kin region around each urban centre included surrounding rural areas, it remains possible that differences between rural regions in Canada exist that are not captured by our data.

In conclusion, our finding of only a few small differences within a few age groups within three centres suggests that there is no need to develop region-specific Canadian normative data for the SF-36 health survey. However, the few differences we did find should be kept in mind when comparing role physical scores of women aged 55 years and older in St. John's to Canadian normative data, and for three domains when assessing women aged 75 years and over in St. John's and Quebec City. For men, caution needs to be used when assessing the role physical scores of men aged 75 years and over in Quebec City and St. John's. Other than this minor variation, it appears that those living in various Canadian cities and their surrounding areas have similar health-related quality of life.

[Reference]

REFERENCES

[Reference]

1. Ware JE. SF-36 Health Survey Manual and Interpretation Guide. Boston, Massachusetts: The Health Institute, New England Medical Centre, 1993.

[Reference]

2. Ware JE, Kosinski M, Keller SD. SF-36 Physical and Mental Health Summary Scales: A User's Manual. Boston, Massachusetts: The Health Institute, New England Medical Centre, 1994.

3. Ware JE. The SF-36 Health Survey. In: Spilker B (Ed.), Quality of Life and Pharmaco-economics in Clinical Trials, Second Edition. Philadelphia: Lippincott-Raven Publishers, 1996;337-45.

4. Berzon RA. Understanding and using healthrelated quality of life instruments within clinical research studies. In: Staquet MJ, Hays RD, Fayers FM (Eds.), Quality of Life Assessment in Clinical Trials: Methods and Practice. Oxford: Oxford University Press, 1998;3-15.

5. Hopman WM, Towheed T, Anastassiades T, Tenenhouse A, Poliquin S, Berger C, et al. Canadian normative data for the SF-36 Health Survey. CMAJ2000;163:265-71.

6. Lyons RA, Fielder H, Littlepage NC. Measuring health status with the SF-36: The need for regional norms. J Public Health Med 1995;17:4650.

[Reference]

7. Wood-Dauphinee SW, Gauthier L, Gandek B, Magnan L, Pierre U. Readying a US measure of

[Reference]

health status, the SF-36, for use in Canada. Clinical and Investigative Medicine 1997;20:22438.

[Reference]

8. Wood-Dauphinee SW. The Canadian SF-36 health survey: Normative data add to its value. CA]J2000;163:283-84.

9. Kreiger N, Tenenhouse A, Joseph L, MacKenzie T, Poliquin S, Brown JP, et al. The Canadian Multicentre Osteoporosis Study (Cantos): Background, rationale, methods. Can J Aging 1999;18:376-87.

10. Statistics Canada provincial census data, 1991. Age, Sex and Marital Status: The Nation. Ottawa: Statistics Canada, 1993, Cat. No. 93310.

[Reference]

It. Statistics Canada provincial census data, 1991. Profile of Census Metropolitan Areas and Census Agglomeration, Part A. Ottawa: Statistics Canada, 1993, Cat. No. 93-337.

12. Raftery A, Lewis S. How many iterations in the Gibbs sampler? In: Bernardo JM, Berger JO, Dawid JO, Smith AFM (Eds.), Bayesian Statistics 4. Oxford: University Press, 1992;763-73.

13. Spiegelhalter DJ, Thomas A, Best NG, Gilks WR. BUGS: Bayesian Inference Using Gibbs Sampling, Version 0.5, (version ii), MRC Biostatistics Unit, Cambridge, UK, 1996.

Received: June 1, 2001

Accepted: December 6, 2001

[Reference]

RESUME

[Reference]

Contexte: Les donnees normatives canadiennes s'appliquant a la version abregee du questionnaire sur l'evolution medicale comportant 36 questions (SF-36) ont ete publiees recemment. Toutefois, les donnees recueillies dans d'autres pays suggerent qu'il peut exister des variations regionales au niveau de la qualite de vie reliee a l'etat de sante (HRQOL). Nous avons donc etudie les donnees du SF-36 provenant de neuf centres canadiens pour demontrer les differences systematiques.

Methodes : Un modele hierarchique bayesien a ete utilise pour comparer les differences entre les resultats des huit domaines du SF-36 et des deux composantes sommaires pour chaque strate d'age et de sexe, et ce, pour les neuf centres.

[Reference]

Resultats : Les resultats de cinq domaines et des deux composantes sommaires demontraient des differences peu significatives cliniquement. Outre de rares exceptions, il y avait peu d'evidence de variations du HRQOL entre la plupart des domaines et des centres.

Interpretation : Les resultats obtenus, ne demontrant que de legeres differences, suggerent qu'il n'est pas necessaire d'etablir des donnees normatives specifiques aux regions du Canada pour le questionnaire de sante SF-36.

[Author Affiliation]

Wilma M. Hopman, M1

Claudie Berger, MSc2

Lawrence Joseph, PhD3

Tanveer Towheed, MD4

Tassos Anastassiades, MD4

Alan Tenenhouse, MD5

Suzette Poliquin, BSc5

[Author Affiliation]

Jacques P Brown, MD6

Timothy M. Murray, MD

Jonathan D. Adachi, MD8

David A. Hanley, MD9

Emmanuel A. Papadimitropoulos, PhD10

CaMos Research Group11

[Author Affiliation]

1. MacKenzie Health Services Research Group, Queen's University, Kingston, ON

2. Cantos Methods Centre, McGill University, Montreal, QC

3. Department of Epidemiology and Biostatistics, McGill University

4. Division of Rheumatology, Queen's University

5. CaMos National Coordinating Centre, McGill University

6. Laval University, Ste-Foy, QC

7. University of Toronto, Toronto, ON

8. McMaster University, Hamilton, ON

9. University of Calgary, Calgary, AB

10. Eli Lilly, Toronto

11. See Appendix B for Complete List

[Author Affiliation]

Correspondence and reprint requests: Wilma Hopman, Director, MacKenzie Health Services Research Group, Department of Community Health and Epidemiology, 3rd Floor, Abramsky Hall, Queen's University, Kingston, ON K7L 3N6, E-mail: hopmanw@post.queensu.ca

Funding: The Canadian Multicentre Osteoporosis Study was funded by the Senior's Independence Research Program, through the National Health Research and Development Program of Health Canada (Project No. 6605-4003-OS), The Medical Research Council of Canada, MRC-PMAC Health Program, Merck Frosst Canada Inc., Eli Lilly Canada Inc., Procter and Gamble Pharmaceuticals Canada Inc., Dairy Farmers of Canada.

Acknowledgements: The authors would like to thank all of the participants in the Canadian Multicentre Osteoporosis Study. The research team and investigators in the Cantos Research Group are listed in Appendix 2.

No comments:

Post a Comment