The efficacy of arthroscopic surgery for the treatment of osteoarthritis of the knee is unknown.
Methods
We conducted a single-center, randomized, controlled trial of arthroscopic surgery in patients with moderate-to-severe osteoarthritis of the knee. Patients were randomly assigned to surgical lavage and arthroscopic débridement together with optimized physical and medical therapy or to treatment with physical and medical therapy alone. The primary outcome was the total Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) score (range, 0 to 2400; higher scores indicate more severe symptoms) at 2 years of follow-up. Secondary outcomes included the Short Form-36 (SF-36) Physical Component Summary score (range, 0 to 100; higher scores indicate better quality of life).
Results
Of the 92 patients assigned to surgery, 6 did not undergo surgery. Of the 86 patients assigned to control treatment, all received only physical and medical therapy. After 2 years, the mean (±SD) WOMAC score for the surgery group was 874±624, as compared with 897±583 for the control group (absolute difference [surgery-group score minus control-group score], −23±605; 95% confidence interval [CI], −208 to 161; P=0.22 after adjustment for baseline score and grade of severity). The SF-36 Physical Component Summary scores were 37.0±11.4 and 37.2±10.6, respectively (absolute difference, −0.2±11.1; 95% CI, −3.6 to 3.2; P=0.93). Analyses of WOMAC scores at interim visits and other secondary outcomes also failed to show superiority of surgery.
Conclusions
Arthroscopic surgery for osteoarthritis of the knee provides no additional benefit to optimized physical and medical therapy. (ClinicalTrials.gov number, NCT00158431.)
Introduction
Osteoarthritis of the knee is a degenerative disease that causes joint pain, stiffness, and decreased function.1-3 Treatment is multidisciplinary and involves physical therapy, medication, and surgery. Arthroscopic surgery, in which an arthroscope is inserted into the knee joint, allows for lavage, a procedure that removes particulate material such as cartilage fragments and calcium crystals. It also allows for débridement, whereby articular surfaces and osteophytes can be surgically smoothed. The goal of this procedure is to reduce synovitis and eliminate mechanical interference with joint motion.
Although arthroscopic surgery has been widely used for osteoarthritis of the knee, scientific evidence to support its efficacy is lacking.4 No benefit of surgery was shown in a large-scale, randomized, controlled trial reported in the literature.5 However, the methods used in that study have been questioned,6-11 and the authors' conclusion that arthroscopic surgery is ineffective for the treatment of moderate-to-severe osteoarthritis of the knee has not been generally accepted.12-14 Accordingly, the procedure remains widely used.15 We conducted a randomized, controlled trial to compare optimized physical and medical therapy alone with arthroscopic treatment in addition to optimized physical and medical therapy.
Methods
Patients
We conducted the trial between January 1999 and August 2007 at the Fowler Kennedy Sport Medicine Clinic, University of Western Ontario, London, Ontario, Canada. The investigators who assessed outcomes were unaware of treatment assignments. The protocol was approved by the institutional review board of the University of Western Ontario. All patients gave written informed consent.
Eligible patients were 18 years of age or older with idiopathic or secondary osteoarthritis of the knee16,17 with grade 2, 3, or 4 radiographic severity, as defined by the modified Kellgren–Lawrence classification.18-20 Patients were excluded if they had large meniscal tears (“bucket handle” tears), as detected by clinical examination21,22 or, in a minority of cases, by magnetic resonance imaging. Other exclusion criteria were inflammatory or postinfectious arthritis, previous arthroscopic treatment for knee osteoarthritis, more than 5 degrees of varus or valgus deformity, previous major knee trauma, Kellgren–Lawrence grade 4 osteoarthritis in two compartments (the medial or lateral compartments of the tibiofemoral joint or the patellofemoral compartment) in persons over 60 years of age, intraarticular corticosteroid injection within the previous 3 months, a major neurologic deficit, serious medical illness (life expectancy of less than 2 years or high intraoperative risk), and pregnancy. Patients who were unable to provide informed consent or who were deemed unlikely to comply with follow-up were also excluded.
Baseline Studies
Patients referred to any of seven orthopedic surgeons were assessed for eligibility. The trial coordinator and one of two surgeons independently reviewed the diagnosis of osteoarthritis. Disagreements regarding eligibility, degree of malalignment (i.e., degree of varus or valgus deformity), and the Kellgren–Lawrence grade were resolved by consensus. Baseline scores on the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC),23,24 the Short Form-36 (SF-36) Physical Component Summary,25 the McMaster–Toronto Arthritis Patient Preference Disability Questionnaire (MACTAR),26,27 and the Arthritis Self-Efficacy Scale (ASES)28 and standard-gamble29 utility scores were obtained. An orthopedic surgeon performed a detailed examination of the knee and documented the range of motion, the presence of an effusion, and the results of meniscal and stability tests.
Study Treatment
The patients were randomly assigned, with the use of a computer-generated schedule, to receive optimized physical and medical therapy alone (control group) or to receive both optimized physical and medical therapy and arthroscopic treatment. The randomization was stratified according to surgeon and disease severity (defined according to the Kellgren–Lawrence grade). To minimize the risk of predicting the treatment assignment of the next eligible patient, randomization was performed in permuted blocks of two or four with random variation of the blocking number. Both for patients assigned to surgery and for those assigned to control treatment, the date of treatment initiation was defined as the next available date of surgery.
Arthroscopic treatment was performed within 6 weeks after randomization with the patient under general anesthesia and with the use of a tourniquet and a thigh holder. The orthopedic surgeon evaluated the medial, lateral, and patellofemoral joint compartments, graded articular lesions according to the Outerbridge classification,30 irrigated the compartment with at least 1 liter of saline, and performed one or more of the following treatments: synovectomy; débridement; or excision of degenerative tears of the menisci, fragments of articular cartilage, or chondral flaps and osteophytes that prevented full extension. Abrasion or microfracture of chondral defects was not performed.
Optimized physical and medical therapy was initiated within 7 days after surgery and followed an identical program in both groups. Physical therapy was provided for 1 hour once a week for 12 consecutive weeks. The intervention was standardized and based on a review of the literature and a formal survey of university physical therapists.31 Information regarding a home exercise program that emphasized range-of-motion and strengthening exercises was provided to all patients. Individualized exercises were recommended on the basis of the severity of osteoarthritis, the patient's age, and the patient's specific needs. Instruction was also provided regarding activities of daily living, walking, use of stairs, and methods of treatment involving cold and heat. The patients were asked to perform the exercises twice daily and once on the day of a scheduled physical-therapy session. After the patients had completed 12 weeks of supervised activity, they continued an unsupervised exercise program at home for the duration of the study. The patients received additional education from attendance at local Arthritis Society workshops, from a copy of The Arthritis Helpbook32 that was provided to them, and from an educational videotape.
After undergoing randomization, the patients reviewed their medical treatment plans with an orthopedic surgeon, and the plans were optimized according to an evidence-based treatment algorithm based on published guidelines2 that recommended stepwise use of acetaminophen and nonsteroidal antiinflammatory drugs and intraarticular injection of hyaluronic acid. Hyaluronic acid and oral glucosamine were offered to the patients.
The patients were seen in the clinic 3, 6, 12, 18, and 24 months after the initiation of treatment. At each visit, the patients were evaluated by a nurse who was unaware of the treatment assignment. To preserve blinding, each patient wore a neoprene sleeve over the knee so that the study nurse could not identify a surgical scar. Scores on the WOMAC, MACTAR, SF-36, and ASES questionnaires and standard-gamble utility scores were obtained at each visit. Medical treatment was reviewed at each visit, and treatment options were modified according to the algorithm. Records were kept of medical therapies used.
Outcome Measures
The primary outcome was the WOMAC score at 2 years after the initiation of treatment. The WOMAC is a validated, self-administered instrument specifically designed to evaluate knee and hip osteoarthritis. The WOMAC has subscales for pain, stiffness, and physical function. Total scores can range from 0 to 2400; higher scores indicate increased pain, increased stiffness, and decreased physical function.23 Patients with moderate-to-severe osteoarthritis of the knee typically have a score of approximately 1000.24,33 A 20% improvement (typically, a decrease of about 200 points) in the total WOMAC score was considered clinically important.34-36 We also analyzed the three WOMAC subscales separately. The Physical Component Summary of the SF-36 was used to assess quality of life; scores can range from 0 to 100, with higher scores indicating better quality of life. The MACTAR and ASES are validated questionnaires that assess the symptoms and functional status of patients with osteoarthritis. MACTAR scores can range from 0 to 500; higher scores indicate greater disability. ASES scores can range from 10 to 100; higher scores indicate greater self-efficacy (i.e., perceived ability to cope with the consequences of arthritis). Health-related quality of life was assessed by the standard-gamble utility technique; scores can range from 0.0 (death) to 1.0 (perfect health).29
Statistical Analysis
Baseline characteristics were analyzed by descriptive statistics. For the primary analysis, the total WOMAC score at 2 years was compared between the two study groups with the use of analysis of covariance, with adjustment for the baseline score and disease severity (as measured by the Kellgren–Lawrence grade). A two-sided P value of 0.05 was considered to indicate statistical significance. Post hoc analyses of the total WOMAC score were also performed at 3, 6, 12, and 18 months. Missing values were not imputed. A similar approach was used to analyze the scores on the WOMAC subscales, the SF-36 Physical Component Summary, MACTAR, and ASES. Two prespecified subgroup analyses were performed. Patients with less severe disease (Kellgren–Lawrence grade 2) and patients reporting mechanical symptoms of catching, locking, or both catching and locking of the knee were hypothesized to derive greater benefit from surgery. The proportions of patients who received physical therapy and the average number of visits were compared with the use of the chi-square test and Student's t-test, respectively. The proportions of patients who received the various algorithm-specified medical therapies were compared with the use of the score-type test for simultaneous marginal homogeneity.37 Statistical comparisons were made with the use of SAS software, version 8.2.38 All analyses were performed according to the intention-to-treat principle.
On the basis of the results of a previous study,33 the standard deviation of the total WOMAC score was estimated to be 452. Assignment of 186 patients to treatment would provide 80% statistical power to detect a 200-point difference between the two treatment groups, with allowance for up to 15% of patients whose data could not be evaluated.
A single prespecified interim analysis was performed by an external data monitoring board when one third of the patients had completed 2 years of follow-up. This analysis was based on an O'Brien–Fleming boundary39 that specified a P value of 0.0007 to stop the trial because of superiority of treatment and a P value of 0.984 to stop the trial because of futility of treatment. Neither criterion was met.
Results
Figure 1. Figure 1. Enrollment of Patients and Completion of the Study.Table 1. Table 1. Baseline Characteristics of the Patients.
Figure 1 shows the disposition of the study participants. Between January 1999 and August 2005, 277 patients were assessed for eligibility. Fifty-eight patients were not eligible and 31 declined participation, resulting in a total of 188 patients who underwent randomization. Ninety-four patients were assigned to receive arthroscopic surgery and optimized physical and medical therapy and 94 to receive physical and medical therapy alone. Ten patients (two in the surgery group and eight in the control group) withdrew consent after randomization. Six patients assigned to surgery elected not to have the procedure; data from these patients were analyzed, according to the intention-to-treat principle, with data from the surgery group. Although the baseline characteristics of the groups were similar (Table 1), patients assigned to surgery had slightly higher total WOMAC scores.
Study Treatment
Table 2. Table 2. Use of Medical, Physical, and Surgical Therapy in the Patients.
The use of physical and medical therapy was similar in the two treatment groups. The majority of patients assigned to arthroscopic surgery underwent débridement of articular cartilage or meniscal lesions (Table 2).
Primary Outcome Measure
Figure 2. Figure 2. Total WOMAC Scores over Time According to Treatment Group.
Scores are shown for all patients (Panel A), those with less severe disease (Kellgren–Lawrence grade 2) (Panel B), and those with more severe disease (Kellgren–Lawrence grade 3 or 4) (Panel C). P values for the differences in 24-month scores were generated by analysis of covariance with adjustment for baseline score (Panels A, B, and C) and disease severity (Panel A). Error bars indicate the standard error. The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) comprises three subscales (pain, stiffness, and physical function) composed of 24 questions. Total scores can range from 0 to 2400; higher scores indicate more severe disease.
Figure 2A shows the changes in the mean total WOMAC scores. At 3 months, scores in the surgery group had improved to a greater extent than those in the control group, but there were no significant differences between the groups at any visits thereafter. For patients assigned to surgery, the mean (±SD) total WOMAC score at 24 months was 874±624, as compared with 897±583 in the control group (absolute difference [surgery-group score minus control-group score], −23±605; 95% confidence interval [CI], −208 to 161; P=0.22 after adjustment for baseline score and radiographic grade of disease severity). A similar analysis performed in patients with less severe disease (Kellgren–Lawrence grade 2) at baseline also found no significant difference between the treatment groups (Figure 2B). Likewise, no benefit was conferred by surgery among the subgroup of patients with mechanical symptoms of catching or locking. A post hoc analysis of patients with more severe radiographic disease (Kellgren–Lawrence grade 3 or 4) also found no benefit of surgery (Figure 2C). We repeated these analyses on the basis of treatment actually received by including the data from the six patients assigned to surgery who elected not to undergo the procedure with the data from the patients in the control group. The results of these analyses were consistent with those of the intention-to-treat analyses.
Secondary Outcome Measures
Table 3. Table 3. Secondary Outcome Measures in the Patients.
No significant differences were observed between the treatment groups for any of the secondary outcome measures (Table 3). Specifically, patients assigned to arthroscopic surgery were no more likely to improve with respect to physical function, pain, or health-related quality of life than were those assigned to the control group. After 2 years, the SF-36 Physical Component Summary scores were 37.0±11.4 and 37.2±10.6, respectively (absolute difference, –0.2±11.1; 95% CI, –3.6 to 3.2; P=0.93).
Discussion
This study failed to show a benefit of arthroscopic surgery for the treatment of osteoarthritis of the knee. At the end of 2 years, patients assigned to arthroscopic treatment in addition to optimized physical and medical therapy had no greater improvement in WOMAC scores than did those who received only physical and medical therapy. Patients assigned to surgery did have a greater improvement in WOMAC scores within the first 3 months; however, this transient benefit was anticipated, since sham surgery is associated with a large, short-term placebo effect.5 WOMAC scores at all other time points did not significantly differ between the groups. In addition to WOMAC scores, a broad range of validated patient-reported outcomes was assessed at multiple time points. None of these instruments identified a benefit of arthroscopic treatment.
These negative results are in agreement with the previously published findings of Moseley and colleagues.5 That trial, which was conducted by a single surgeon at a Veterans Affairs hospital, was methodologically rigorous, since use of a sham-operation control allowed concealment of the treatment assignment. Nevertheless, several methodologic issues were raised that we believe are addressed in the current study. For example, the outcome measure in the study by Moseley et al., the Knee Specific Pain Scale,5 was not validated.9 We used the WOMAC score, a validated instrument that has been widely used in osteoarthritis research, as the primary measure of efficacy. Patients with substantial malalignment (varus or valgus deformity) and those with advanced disease, who might have a poorer response to surgical intervention,7,11 were included in the earlier trial; we excluded patients with more than 5 degrees of malalignment and stratified the randomization according to both surgeon and Kellgren–Lawrence grade of radiographic severity. Moseley and colleagues evaluated mostly older men who were treated in a Veterans Affairs Medical Center.6,7 In contrast, our study evaluated a more typical population of both men and women who were treated in a university hospital. Seven experienced arthroscopists performed lavage, débridement, or both at their discretion. Thus, we believe that our results are highly generalizable to usual orthopedic practice.
The results of the present trial, along with the results of Moseley et al., call into question the widespread use of arthroscopic treatment for osteoarthritis of the knee. Although some may argue that treatment is beneficial for patients with mechanical symptoms of catching or locking or those with early disease, prespecified subgroup analyses also failed to show efficacy in this population of patients. However, patients suspected of having large meniscal (“bucket handle”) tears, in whom arthroscopic surgery is considered an effective intervention, were excluded from this study.
Our study had limitations. Bias is possible because of the lack of a sham-surgery control. However, such bias would be expected to favor surgery and would not be expected to explain the present results. The observation that the two study groups had very similar exposures to drug treatment also suggests that bias, at least in the form of differential prescription of cointerventions, did not influence the results.
A second limitation is that only 68% of the patients who were evaluated for participation were deemed eligible and ultimately assigned to treatment. However, the majority of the excluded patients had substantial malalignment (38%) or declined participation (35%). The stringent inclusion and exclusion criteria ensured appropriate patient selection and optimized the chance of identifying a benefit of surgery. Thus, we do not believe that the participants in this trial were less likely to have a response to arthroscopic therapy than those treated in the community.
In summary, the results of this randomized, controlled trial show that arthroscopic surgery provides no additional benefit to optimized physical and medical therapy for the treatment of osteoarthritis of the knee.
Funding and Disclosures
Supported by the Canadian Institutes of Health Research.
No potential conflict of interest relevant to this article was reported.
This article (10.1056/NEJMoa0708333) was updated on November 10, 2009, at NEJM.org.
This article is dedicated to the memory of Dr. Alexandra Kirkley.
We thank Beverley Jasevicius for assistance in the preparation of the manuscript.
Author Affiliations
From the Fowler Kennedy Sport Medicine Clinic (A.K., T.B.B., R.B.L., J.R.G., K.R.W., S.H.G., L.M.D., P.J.F.); the Faculty of Health Sciences (T.B.B.); Robarts Clinical Trials, Robarts Research Institute (C.J.W., B.G.F., A.D.); and the Departments of Surgery (A.K., R.B.L., J.R.G., K.R.W., P.J.F.), Medicine (B.G.F., J.E.P.), and Epidemiology and Biostatistics (B.G.F., A.D.) — all at the University of Western Ontario; and St. Joseph's Health Care (J.E.P.) — in London, ON, Canada.
Address reprint requests to Dr. Litchfield at the Fowler Kennedy Sport Medicine Clinic, 3M Centre, University of Western Ontario, London, ON N6A 3K7, Canada, or at [email protected]; or to Dr. Feagan at Robarts Clinical Trials, Robarts Research Institute, University of Western Ontario, 100 Perth Dr., London, ON N6A 5K8, Canada, or at [email protected].
Dr. Kirkley is deceased.
Appendix
The following persons participated in the study: surgeons — A. Amendola, P.J. Fowler, J.R. Giffin, A. Kirkley, R.B. Litchfield, R. McCalden, K.R. Willits; members of the Data Coordinating Center, Robarts Clinical Trials, Robarts Research Institute — B. Sarazin, B. Bergman, H. Sun, G.Y. Zou, L. Liddiard, B. Annunziello, L. Smith, T. Clayton, M. Brine, N. Ng; data monitoring committee — D. Whalen, A. Donner.
References (39)
1. Felson DT. Osteoarthritis of the knee. N Engl J Med2006;354:841-848[Erratum, N Engl J Med 2006;354:2520.]
2. American College of Rheumatology Subcommittee on Osteoarthritis Guidelines. Recommendations for the medical management of osteoarthritis of the hip and knee: 2000 update. Arthritis Rheum2000;43:1905-1915
3. Hochberg MC, Altman RD, Brandt KD, et al. Guidelines for the medical management of osteoarthritis. II. Osteoarthritis of the knee. Arthritis Rheum1995;38:1541-1546
6. Gillespie WJ. Arthroscopic surgery was not effective for relieving pain or improving function in osteoarthritis of the knee. ACP J Club2003;138:49-49
14. Siparsky P, Ryzewicz M, Peterson B, Bartz R. Arthroscopic treatment of osteoarthritis of the knee: are there any evidence-based indications? Clin Orthop Relat Res2007;455:107-112
15. Hawker G, Guan J, Judge A, Dieppe P. Knee arthroscopy in England and Ontario: patterns of use, changes over time, and relationship to total knee joint replacement. J Bone Joint Surg Am (in press).
16. Altman R, Asch E, Bloch D, et al. Development of criteria for the classification and reporting of osteoarthritis: classification of osteoarthritis of the knee. Arthritis Rheum1986;29:1039-1049
19. Blackburn WD Jr, Bernreuter WK, Rominger M, Loose LL. Arthroscopic evaluation of knee articular cartilage: a comparison with plain radiographs and magnetic resonance imaging. J Rheumatol1994;21:675-679
21. Bhattacharyya T, Gale D, Dewire P, et al. The clinical importance of meniscal tears demonstrated by magnetic resonance imaging in ostoearthritis of the knee. J Bone Joint Surg Am2003;85:4-9
22. Ryzewicz M, Peterson B, Siparsky PN, Bartz RL. The diagnosis of meniscus tears: the role of MRI and clinical examination. Clin Orthop Relat Res2007;455:123-133
24. Bellamy N, Buchanan WW, Goldsmith CH, Campbell J, Stitt LW. Validation study of WOMAC: a health status instrument for measuring clinically important patient relevant outcomes to antirheumatic drug therapy in patients with osteoarthritis of the hip or knee. J Rheumatol1988;15:1833-1840
26. Verhoeven AC, Boers M, van der Liden S. Validity of the MACTAR questionnaire as a functional index in a rheumatoid arthritis clinical trial. J Rheumatol2000;27:2801-2809
27. Tugwell P, Bombardier C, Buchanan WW, Goldsmith CH, Grace E, Hanna B. The MACTAR Patient Preference Disability Questionnaire -- an individualized functional priority approach for assessing improvement in physical disability in clinical trials in rheumatoid arthritis. J Rheumatol1987;14:446-451
28. Lorig K, Chastain RL, Ung E, Shoor S, Holman HR. Development and evaluation of a scale to measure perceived self-efficacy in people with arthritis. Arthritis Rheum1989;32:37-44
31. van Baar ME, Assendelft WJ, Dekker J, Oostendorp RA, Bijlsma JW. Effectiveness of exercise therapy in patients with osteoarthritis of the hip or knee: a systematic review of randomized clinical trials. Arthritis Rheum1999;42:1361-1369
32. Lorig K, Fries JF. The arthritis helpbook: a tested self-management program for coping with arthritis and fibromyalgia. 5th ed. New York: Perseus Books, 2000.
34. Angst F, Aeschlimann A, Michel BA, Stucki G. Minimal clinically important rehabilitation effects in patients with osteoarthritis of the lower extremities. J Rheumatol2002;29:131-138
35. Ehrich EW, Davies GM, Watson DJ, Bolognese JA, Seidenberg BC, Bellamy N. Minimal perceptible clinical improvement with the Western Ontario and McMaster Universities Osteoarthritis Index questionnnaire and global assessments in patients with osteoarthritis. J Rheumatol2000;27:2635-2641
36. Goldsmith CH, Boers M, Bombardier C, Tugwell P. Criteria for clinically important changes in outcomes: development, scoring and evaluation of rheumatoid arthritis patient and trial profiles. J Rheumatol1993;20:561-565
37. Agresti A, Klingenberg B. Multivariate test comparing binomial probabilities, with application to safety studies for drugs. Appl Stat2005;54:691-706
Figure 1. Enrollment of Patients and Completion of the Study.
Figure 1. Enrollment of Patients and Completion of the Study.
Table 1. Baseline Characteristics of the Patients.
Table 1. Baseline Characteristics of the Patients.
Table 2. Use of Medical, Physical, and Surgical Therapy in the Patients.
Table 2. Use of Medical, Physical, and Surgical Therapy in the Patients.
Figure 2. Total WOMAC Scores over Time According to Treatment Group.
Figure 2. Total WOMAC Scores over Time According to Treatment Group.
Scores are shown for all patients (Panel A), those with less severe disease (Kellgren–Lawrence grade 2) (Panel B), and those with more severe disease (Kellgren–Lawrence grade 3 or 4) (Panel C). P values for the differences in 24-month scores were generated by analysis of covariance with adjustment for baseline score (Panels A, B, and C) and disease severity (Panel A). Error bars indicate the standard error. The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) comprises three subscales (pain, stiffness, and physical function) composed of 24 questions. Total scores can range from 0 to 2400; higher scores indicate more severe disease.
Table 3. Secondary Outcome Measures in the Patients.
Table 3. Secondary Outcome Measures in the Patients.