The urine dipstick test useful to rule out infections. A meta-analysis of the accuracy
BMC Urology volume 4, Article number: 4 (2004)
Many studies have evaluated the accuracy of dipstick tests as rapid detectors of bacteriuria and urinary tract infections (UTI). The lack of an adequate explanation for the heterogeneity of the dipstick accuracy stimulates an ongoing debate. The objective of the present meta-analysis was to summarise the available evidence on the diagnostic accuracy of the urine dipstick test, taking into account various pre-defined potential sources of heterogeneity.
Literature from 1990 through 1999 was searched in Medline and Embase, and by reference tracking. Selected publications should be concerned with the diagnosis of bacteriuria or urinary tract infections, investigate the use of dipstick tests for nitrites and/or leukocyte esterase, and present empirical data. A checklist was used to assess methodological quality.
70 publications were included. Accuracy of nitrites was high in pregnant women (Diagnostic Odds Ratio = 165) and elderly people (DOR = 108). Positive predictive values were ≥80% in elderly and in family medicine. Accuracy of leukocyte-esterase was high in studies in urology patients (DOR = 276). Sensitivities were highest in family medicine (86%). Negative predictive values were high in both tests in all patient groups and settings, except for in family medicine. The combination of both test results showed an important increase in sensitivity. Accuracy was high in studies in urology patients (DOR = 52), in children (DOR = 46), and if clinical information was present (DOR = 28). Sensitivity was highest in studies carried out in family medicine (90%). Predictive values of combinations of positive test results were low in all other situations.
Overall, this review demonstrates that the urine dipstick test alone seems to be useful in all populations to exclude the presence of infection if the results of both nitrites and leukocyte-esterase are negative. Sensitivities of the combination of both tests vary between 68 and 88% in different patient groups, but positive test results have to be confirmed. Although the combination of positive test results is very sensitive in family practice, the usefulness of the dipstick test alone to rule in infection remains doubtful, even with high pre-test probabilities.
Testing for the presence of micro-organisms in the urinary tract, in order to diagnose asymptomatic bacteriuria or symptomatic urinary tract infections (UTI), is very common at all levels of health care. UTI are a common cause of fever in young children, often accompanied by subtle and non-specific clinical findings . In a small percentage of children this may lead to kidney scarring, and at a later age to hypertension, and even renal failure . In general practice, 2–3% of all consultations, and even 6% in the case of women, are due to symptoms suggesting UTI . The prevalence of asymptomatic bacteriuria is 4–7% in pregnancy, when it can progress to symptomatic UTI, postpartum UTI or pyelonephritis [4, 5]. Untreated bacteriuria during pregnancy has been shown to be associated with low birth-weight and premature delivery . Bacteriuria is more common with increasing age. Elderly non-institutionalised women and men show a prevalence rate of 6 – 30% and 11–13%, respectively, while in institutionalised elderly people the prevalence ranges from 25 to 50% .
Many tests are available for the diagnosis of bacteriuria or UTI. A (semi-) quantitative culture of a urine specimen is the only method that can provide detailed documentation of a bacterial urine infection. However, making a culture is costly, and takes at least 24 hours. An ideal test requires only limited technical expertise, is cheap and has a high accuracy, enabling a quick diagnosis in high-risk patients [2, 8]. One example is the dipstick test, where only nitrites and leukocyte esterase – and not proteins and blood – show fair accuracy, compared with a quantitative culture .
In the past 25 years, many studies have evaluated the accuracy of dipsticks tests as rapid detectors of bacteriuria and UTI in different populations and age groups. Several narrative reviews have been written [6, 9–12], and two meta-analyses [1, 13] have been performed. The meta-analysis by Hurlbut and Littenberg  did not report on sources of heterogeneity. The most recent meta-analysis  of 26 studies in children, showed major heterogeneity of diagnostic accuracy across studies, which could not be fully explained by differences in age, or by differences in the definition of the criterion standard. The lack of an adequate explanation for the heterogeneity of the dipstick accuracy stimulates an ongoing debate. Many elements and differences in the process of urine-collection and analysis, and in the selection of patients, may influence the presence of micro-organisms which can be detected by the dipstick, as well as the presence of substances that may give false results [10, 14–16]. The methodological quality of the studies might also be an important determinant of the reported accuracy .
The objective of the present meta-analysis was to summarise the available evidence on the diagnostic accuracy of the urine dipstick test, taking into account various pre-defined potential sources of heterogeneity.
Standardised searches were conducted in 1998 and 1999 in computerised databases (Medline and Embase), by reference tracking  and through personal contacts with experts in the field of research. In January 2000, the search was extended and updated by conducting an on-line Medline search at the PubMed website http://www.ncbi.nlm.nih.gov/pubmed Table 5 [see Additional file 1].
Two reviewers (WLJMD, JCY) selected the studies. The following inclusion criteria were applied: publications should concern the diagnosis of bacteriuria or urinary tract infections, investigate the use of dipstick tests for nitrites and/or leukocyte esterase, and present empirical data. Excluded were studies which focused only on sexually transmitted diseases, urethritis or schistosomiasis, studies with no accepted criterion standard (at least semi-quantitative or quantitative urine culture), studies which did not provide sufficient data for the reconstruction of a diagnostic two-by-two table, and studies which based test positivity on the combination of various other tests jointly with tests for nitrites and/or leukocyte esterase. Studies carried out before 1990 and studies in animals were also excluded. There were no language restrictions. When consensus was not reached, a third reviewer (NPD) was consulted to resolve disagreements.
Quality and applicability of studies
The checklist of the Cochrane Methods Working Group on Meta-analysis of Diagnostic and Screening Tests was used to assess the methodological quality of the selected studies  (available on request). Three reviewers (CJY, NPvD, WLJMD) independently assessed all selected publications. Disagreements were resolved in consensus meetings.
Internal validity criteria (IV) were scored as 'positive' (adequate methods), 'negative' (inadequate methods, potential bias) or 'no information'. External validity criteria (EV) were scored positive if sufficient information was provided to assess the generalisability of the findings. Sub-totals were calculated separately for internal validity (maximum 8) and external validity (maximum 15 for nitrites or 16 for leukocyte-esterase), and percentages of the maximum possible score were calculated. Estimates are presented with 95% confidence intervals (95% CI).
Potential sources of heterogeneity
For each publication detailed information was abstracted on: the colony count used to define UTI (cut-off used for the criterion standard), exclusion criteria, setting, level of care, symptomatic or asymptomatic bacteriuria, population sampled (children, general population, pregnant women, etc), age of the study population, urine-collection procedures, whether only first voided urine was collected, micro-organisms, procedures followed when urine was contaminated, duration of transport of the urine sample to the laboratory for culture, visual or automatic reading, and person who was reading the dipstick. In addition, information was collected on the year of study, disease prevalence at the setting, sample-size, country in which the study was performed, brand of dipstick and language of publication.
Data on sensitivity and specificity were derived from the original publications. If absolute data were not presented, published sensitivity and specificity data were used to reconstruct two-by-two tables. Sensitivity and specificity were pooled after natural logarithmic transformation. The average predictive values were calculated on the basis of geometric means of sensitivity and specificity using the weighted mean prevalence in the sub-group of studies at issue. The diagnostic odds ratio (DOR) of each individual study was calculated according to the following formula [20, 21]:
The DOR represents the ratio of the odds of a positive test result in the diseased group to the odds of a positive test result in the non-diseased group. A DOR of 1 means that the test has no discriminative power. When the DOR is more than one, the odds of a positive test result are higher in the diseased population. Pooling of the DOR was also performed after natural logarithmic transformation [ln(DOR)].
The statistical heterogeneity of sensitivity, specificity and the ln(DOR) across studies was tested by a χ2 test of independency with k-1 degrees of freedom (k = number of studies) . As the validity of weighting by the inverse of the variance of the DOR is still under debate for meta-analysis of diagnostic studies , only the results of fixed unweighted pooling are presented. Outliers were detected by means of the Galbraith plot . If a factor was significantly associated with outlying results (according to logistic regression), all studies with that factor were excluded from further analysis.
In case of negatively associated pairs of sensitivity and specificity, and a homogeneous ln(DOR), a regression line was fitted as a Summary ROC curve (SROC) [20, 25] in a scatter plot of the various studies included, with their sensitivity on the y-axis and (1 – specificity) on the x-axis. If sensitivity and specificity are negatively associated, it may be assumed that they represent a single DOR and that any variation between the pairs is caused by the use of different cut-off points for the test across studies. Dependency of the ln(DOR) on the cut-off point (S) can be tested using meta-regression analysis:
ln(DOR) = α + βS
If pairs of sensitivity and specificity still showed weak negative or no association, and if sensitivity or specificity was heterogeneous, sub-group analyses of the ln(DOR) were performed by means of ANOVA. All individual validity criteria, and all pre-defined potential sources of heterogeneity mentioned above, were used for sub-group analyses. Association with continuous variables was tested in univariate meta-regression analysis of the ln(DOR).
After sub-group analyses, all sources of heterogeneity associated with ln(DOR) up to p = 0.25 were selected for a multiple meta-regression analysis, to study the presence of independent factors associated with the ln(DOR).
Analyses were performed with SPSS 7.5 for Windows95 and with Meta-test . For a more detailed description of the model used in this analysis, reference is made to Midgette et al.  and Devillé et al .
The accuracy of the dipstick test for nitrite and leukocyte-esterase was studied both separately and in combination: positive results for either nitrites or leukocyte-esterase or for both.
The search strategy identified 220 publications, of which 70 [29–98] met the inclusion and exclusion criteria. Five selected publications [94–98] were only detected by the search in EMBASE (n = 1), by reference tracking (n = 1) or personal contacts (n = 3). See Table 6 [see Additional file 2] for the main characteristics of the publications included. 150 publications were excluded from meta-analysis for the following reasons: they did not report on the accuracy of the urine dipstick test (nitrites and/or leukocyte esterase) for the diagnosis of UTI or bacteriuria (n = 99), they were reviews (n = 22), they did not use culture as a criterion standard (n = 6), they did not base test positivity only on nitrites and/or leukocyte esterase (n = 6), or they did not include sufficient data to calculate the diagnostic two by two table (n = 17). The 70 selected publications represent studies from 18 countries in five continents, published in seven languages. Two selected publications present the results of two different studies. Therefore, 72 different studies were included, 17 of which studied nitrites only, and 2 studied leukocyte-esterase only. The other studies evaluated different combinations of both.
Quality and applicability of studies
The mean score for internal validity was 72% (95% CI 69 to 75). Nine publications used the culture on dipslide as a criterion standard. Nine publications (13%) concerned double-blind studies; only two were clearly hampered by verification bias. In 65% of the studies the dipstick test was evaluated with the help of clinical information.
The mean score for external validity was 69% (95% CI 65 to 73). Some outpatient departments provided care at primary level, resulting in 15 primary care studies (21%). 17 studies (24%) did not provide details about the general population studied. Sixty percent of the studies did not mention any exclusion criteria; 20% gave no information on the way in which urine was collected, and 86% did not state whether first-voided urine was collected. Information on mixed or contaminated cultures was not available in over 50% of the studies (details are available on request).
Nitrites (n = 58)
Sensitivity and specificity were poorly correlated (Spearman ρ = -0.377) and highly heterogeneous (Q = 776 and 9609, respectively, df 57). So was the ln(DOR) (Q = 145, df 57). On the Galbraith plot, 22 studies were outside the 95% bounds (+/-2Z) from the standardised mean ln(DOR). Univariate logistic regression revealed an association of outliers with lower categories of internal and external validity (internal ≤ 50%: OR = 15.9, 95% CI 1.1 to 233.2, external ≤ 75%: OR = 4.2, 95% CI 1.2 to 15.1). Therefore, studies in the lowest categories of internal or external validity (≤50%) were excluded from further sub-group analysis and meta-regression (n = 12, references: [36, 37, 44, 62, 71, 72, 77, 78, 82, 83, 88, 89]).
The ln(DOR) remained heterogeneous (Q = 125, df 45). Univariate sub-group analyses revealed statistically significant differences in the ln(DOR) between several sub-categories of internal validity (blinding and prospective versus retrospective data collection) and external validity (types of patient population and care setting) (Table 1).
The ln(DOR) was also univariately associated with the cut-off point of the dipstick used in the evaluations (β = -0.439, 95% CI -0.606 to -0.272), pre-test probability (β = -4.54, 95% CI -6.499 to -2.082) and year of publication (β = -0.197, 95% CI -0.197 to -0.013).
Further analysis within sub-groups showed the following results:
blinding: only in double-blind studies were sensitivity and specificity found to be highly negatively correlated (ρ = -0.647) with a homogeneous ln(DOR). In unblinded studies the ln(DOR) was associated with the cut-off point for a positive result of the dipstick, and in single blind studies it was associated with both the cut-off point and the general population;
patient populations: sensitivity and specificity were highly negatively correlated in studies involving general populations (ρ = -0.539), pregnant women (ρ = -0.559) and surgery patients (ρ = -1.00), resulting in a homogeneous ln(DOR). In multiple meta-regression, the ln(DOR) for studies in general populations was associated with the cut-off point of the dipstick, supra-pubic urine-collection and automatic or visual reading. For studies in pregnant women it was associated with the presence of clinical information, and for studies in children it was associated with the cut-off point of the dipstick only;
care setting: strong negative correlations existed between sensitivity and specificity in family practices (ρ = -0.714) and emergency departments (ρ = -0.400) with a homogeneous ln(DOR) in both sub-groups. In multivariate meta-regression analysis, the ln(DOR) was associated in family practices with the pre-test probability; in outpatient departments it was associated with the cut-off point of the dipstick, and in inpatient departments with the cut-off point, pre-test probability, automatic or visual reading, and the presence of clinical information.
Multiple meta-regression analysis of all studies revealed an independent association of the ln(DOR) with the cut-off point of the dipstick (β = -0.348, 95% CI -0.505 – -0.192), studies executed in pregnant women (β = 1.082, 95% CI 0.178 – 1.985), in general populations (β = -0.772, 95% CI -1.601 – 0.057) or in elderly people (β = 1.457, 95% CI 0.022 – -2.882) (adjusted R2 regression model: 0.55).
For details on sensitivity, specificity, odds ratios and predictive values, see Table 1. Post-test probabilities at different pre-test probabilities for different patient populations and care settings are shown in Table 4, and Figure 1 and 2.
Leukocyte-esterase (n = 42)
On the Galbraith plot 10 studies were outside the 95% bounds (+/-2Z) from the standardised mean ln(DOR). Univariate logistic regression revealed an association of outlier studies with lower categories of external validity (external ≤ 50%: OR = 32, 95% CI 2.3 to 447). Studies in the lowest category (≤50%) of internal validity (n = 1, reference: ) and external validity (n= 6, references: [37, 62, 71, 80, 83, 89]) were excluded from the analysis. Sensitivity and specificity were correlated after exclusion (Spearman ρ = -0.635), but remained heterogeneous (Q = 368 and 1799, df 34), as did the ln(DOR) (Q = 64, df 34).
Univariate sub-group analyses showed statistically significant differences in the ln(DOR) between sub-categories of external validity (Table 2): disease (UTI versus bacteriuria), type of patient population, care setting, method of urine-collection, reported exclusion criteria, and brand of dipstick. The ln(DOR) was not associated with the cut-off point for a positive leucocyte-esterase test. Further analysis of sub-groups showed that sensitivity and specificity were strongly negatively correlated in the non-urology studies (ρ = -0.798), as well as in the two urology studies (ρ = -1.00) resulting in a homogeneous ln(DOR) in the non-urology sub-group. Multiple meta-regression analysis in the non-urology studies showed an association of the ln(DOR) with the cut-off point of the dipstick, the disease and the family physician reading the test, but not with setting of care. At this level an interaction existed between disease and family physician reading the test (adjusted R2 regression model: 0.42). All other associations disappeared.
Nitrite and leucocyte-esterase: one or both positive (n = 39)
Eleven studies were outliers; low internal validity (n = 3, references: [29, 36, 82]) and supra-pubic urine-collection (n = 1, reference: ) were associated with outlying results: these studies were excluded. Sensitivity and specificity were weakly correlated (Spearman ρ = -0.227), and both remained heterogeneous. The ln(DOR) was homogeneous (Q = 41, df 34).
The ln(DOR) was univariately associated with the cut-off point of the criterion standard, the availability of clinical information, population groups and brand of dipstick (Table 3). Sensitivity and specificity were negatively correlated in the sub-group of the general population (ρ = -0.406), in children (ρ = -0.417), surgery patients (ρ = -1.0) and urology patients (ρ = -0.50). Sensitivity was homogeneous in pregnant women, surgery and urology, specificity was homogeneous in the later two groups. The ln(DOR) was homogeneous in all population groups.
Multivariate regression analysis retained the following independent factors: a cut-off point for the criterion standard of 1000 mcu/ml (1 study only, β = -1.823, 95% CI -3.629 – -0.017), studies in children (β = 1.176, 95% CI 0.477 – 1.875), studies in urology patients (β = 1.184, 95% CI 0.103 – 2.264) and the presence of clinical information (β = 0.893, 95% CI 0.259 – 1.527) (adjusted R2 regression model: 0.39). The model did not change when excluding the one study with the low criterion standard cut-off point (1000 mcu/ml).
Nitrite and leucocyte-esterase positive (n = 14)
Four studies were outliers, of which two had low external validity. As no factor was associated with the outliers, no studies were excluded. Sensitivity and specificity were negatively correlated (Spearman ρ = -0.275), and were both heterogeneous, as was the ln(DOR) (Q = 43, df 13).
The diagnostic odds ratio was associated with the cut-off point of the criterion standard and with population groups (Table 3). It was also associated with the cut-off point of the dipstick (β = -0.421, 95% CI -0.071 to -2.308), because of one study  that used a cut-off point of 1000 mcu/ml for the criterion standard. Sensitivity and specificity were negatively associated after exclusion of this study (Spearman ρ = -0.36), but remained heterogeneous, as did the ln(DOR). The ln(DOR) was only homogeneous in studies on children (Q = 9, df 5, Spearman ρ = -0.49).
In multivariate meta-regression, the independent factors were: studies in general populations, studies in surgery patients and one study with a criterion cut-off of 1000 mcu/ml. When excluding this last-mentioned study, studies in general populations (β = -2.312, 95% CI -3.950 to -0.675) and studies in surgery patients (β = -2.846, 95% CI -5.435 to -0.257) remained in the regression model (adjusted R2 regression model: 0.50).
Quality of the evidence
Before discussing the accuracy of the dipstick itself, one must take into account the amount and quality of the available evidence. The search was extensive, and identified a large number of studies published during the nineties. The quality of the research, as could be derived from the publications, was reasonable: 70% of the selected studies had an internal validity score which was approximately 70% of the maximum score. Only one in three publications had an external validity score of 75% or more. The importance of internal and external validity becomes clear from the fact that low scores were predominantly found among the outliers in this meta-analysis. A good description of the study population, using explicit selection criteria, is important: a major part of the existing heterogeneity in this meta-analysis could be explained by differences between study populations. The majority of the publications gave no information on important factors (such as the handling of contaminated samples or mixed cultures, or the micro-organisms cultured), which did not facilitate evaluation.
Overall, the sensitivity of the urine dipstick test for nitrites was low (45 – 60% in most situations) with higher levels of specificity (85 – 98%). The typically low pre-test probabilities resulted in high predictive values of negative test results. The test for nitrites had its highest accuracy in specific populations such as pregnant women, urology patients and elderly people. Only in the elderly did the test for nitrites reach a high sensitivity, while in pregnant women sensitivity was the lowest, confirming the results reported by Patterson . Although statistically not significant, the test for nitrites might perform better in asymptomatic patients and in patients who are not on antibiotics, confirming the results reported by Beer .
In multivariable analysis the accuracy of the dipstick for nitrites was affected only by the cut-off point for the nitrites and the population tested. The differences between the studies with regard to implicit cut-off points may be effected by human, instrumental or environmental factors.
Patient populations and care setting were highly correlated. Pre-test probabilities differed between some levels of care. While it is often expected that pre-test probability increases with each level of the health care system, in this study it was found to be higher in family physician or primary care studies, compared to hospital studies. Family physicians apparently use the dipstick test to diagnose an infection based on clinical signs and symptoms, while hospital-based physicians order a dipstick test to screen patients to exclude the presence of an infection.
Sensitivity of the urine dipstick test for leukocyte-esterase was, in general, slightly higher than for the dipstick test for nitrites (48 – 86%), while the specificity was slightly lower (17 – 93%). Generally, this resulted in a lower accuracy, compared to the test for nitrites, lower predictive values of positive test results and similar predictive values of negative test results.
The heterogeneity of the results of the urine dipstick test for leukocyte-esterase was only caused by factors related to external validity. Accuracy was higher for the detection of symptomatic UTI, compared to asymptomatic bacteriuria, as opposed to the test for nitrites. The leucocyte-esterase test had a much higher accuracy in urology patients, and consequently also in tertiary care, and when using a catheter for urine-collection. Sensitivity is highest in primary care, but requires further diagnostic work-up because of the high rates of false positives. In primary care negative results do not exclude the presence of infection.
Combination of nitrites and leukocyte-esterase
Combining the results of both parts of the dipstick tests with one or both showing a positive result increased sensitivity (68 to 88%), but had different effects on specificity. The considerable false positive rates weigh upon the predictive values of positive test results, as reported earlier . This resulted in different effects on accuracy, but increased the predictive values of a negative test result in all study populations, except studies in general populations. A negative dipstick test result excluded the presence of infection in most studies, contrary to the findings of Hurlbut et al. . Accuracy was highest in urology patients, surgery patients and in children. No differences were found between symptomatic UTI and asymptomatic bacteriuria, as was reported by Pelgrom . When both tests were positive specificity increased, also raising the predictive value of a positive result to an acceptable level in general populations.
Recommendations for practice
Care setting and patient population are the major sources of heterogeneity. Consequently, these factors should be taken into account for optimal test use in different clinical circumstances. In the general population a negative test result for one of both tests has a sufficient predictive value to exclude disease, and when both test results are positive there is sufficient evidence to rule in infection. Also in children, pregnant women, surgery or urology populations a negative result for both tests rules out infection, while a positive nitrite test still needs working-up, although the probability of infection increases considerably. In the elderly a negative test result for both tests rules out infection, while a positive nitrite test rules in infection. Post-test probabilities of positive leucocyte-esterase are low in all population subgroups.
A family physician should take these considerations in specific population groups into account, but in non-specific patients in a general practice a positive nitrite test rules in infection. On the other hand, if both tests are available and one of them is negative, confirmation remains necessary, because of the amount of false positive results. In other settings clinicians may exclude infection on the basis of a single negative test result.
For nitrites and leukocyte-esterase both separately or combined, the use of a more stringent definition of infection by increasing the cut-off point of the culture raised accuracy significantly. The lower cut-off point, at less than 1,000 mcu/ml, used mainly in supra-pubic urine-collection, resulted for nitrites in a higher accuracy through higher sensitivities. The present findings do not demonstrate systematically higher false positive rates with more stringent definitions of infection, as was observed by Gorelick . The lowest cut-off point had higher false positive rates, but not the cut-off point at 105 mcu/ml.
Research in this field can still be improved by implementing clear inclusion and exclusion criteria, and by double-blind study designs. Reporting on the distribution of micro-organisms, the way in which urine is collected, the time delay between collection and analysis, whether only first-voided urine was collected, the handling of mixed cultures and contaminated urine samples, and who was reading the test, may improve future systematic reviews of test accuracy. If sample-sizes are adequate, the publication of results for relevant sub-groups may also increase the quality of future diagnostic studies in this field. Although this meta-analysis covers the evidence published over the last decade, the validity of its results is also limited by the limited specifications given in the publications. As specific patient populations – a proxy-indicator for spectrum of disease – seem to be the major source of heterogeneity of accuracy, more details about patients in different clinical settings might increase the validity of a future meta-analysis.
Overall, this review demonstrates that the urine dipstick test alone seems to be useful in all populations to exclude the presence of infection if the results for nitrites or leukocyte-esterase are negative. Sensitivities of the combination whereby one or both test results are positive vary between 68 and 88% in different patient groups, but positive test results have to be confirmed or pre-test probabilities have to be high on the basis of the clinical history and/or a combination of other tests. In family practice, the combination of both tests with at least one positive result is very sensitive, but because of its low specificity remains the usefulness of the dipstick test alone doubtful, even with high pre-test probabilities.
Gorelick MH, Shaw KN: Screening tests for urinary tract infection in children: A meta-analysis. Pediatrics. 1999, 104: e54-
Brooks D: The management of suspected urinary tact infection in general practice. Br J Gen Pract. 1990, 40: 399-402.
Nazareth I, King M: Decision making by general practitioners in diagnosis and management of lower urinary tract symptoms in women. BMJ. 1993, 306: 1103-6.
Andriole VT, Patterson TF: Epidemiology, natural history and management of urinary tract infections in pregnancy. Med Clin N Am. 1991, 75: 359-73.
Patterson TF, Andriole VT: Detection, significance and therapy of bacteriuria in pregnancy. Update in the managed health care era. Inf Dis Clin N Am. 1997, 11: 593-608.
Romero R, Oyarzun E, Mazor M, Sirtori M, Hobbines JC, Bracken M: Meta-analysis of the relationship between asymptomatic bacteriuria and preterm delivery/low birth weight. Obstet Gynecol. 1989, 73: 576-
Wolfhagen MJHM, Hoepelman IM, Verhoef J: Urineweginfectie bij ouderen, wat is de betekenis? [Urinary tract infection in elderly people: what does it mean?]. Ned Tijdschr Geneeskd. 1990, 134: 470-2.
Cochat P, Dubourg L, Koch Nogueira P, Peretti N, Vial M: French: Urine analysis by dipstick. Arch Pédiatr. 1998, 5: 65-70. 10.1016/S0929-693X(97)83470-7.
Lohr JA: Use of routine urinalysis in making a presumptive diagnosis of urinary tract infection in children. Pediatr Infect Dis J. 1991, 10: 646-50.
The U.S. Preventive Services Task Force: Screening for asymptomatic bacteriuria, hematuria and proteinuria. Am Fam Physician. 1990, 42: 389-95.
Fihn SD: Lower urinary tract infection in women. Curr Opinion Obstet Gyneco. 1992, 4: 571-8.
Pelgrom J, de Maeseneer J: De dipstickmethode: vaarwel urinesediment? [The dipstick: goodbye microscopy?]. Huisarts Nu. 1995, 24: 8-11.
Hurlbut TA, Littenberg B, the Diagnostic Technology Assessment Consortium: The diagnostic accuracy of rapid dipstick tests to predict urinary tract infection. Am J Clin Path. 1991, 96: 582-8.
Beer JH, Vogt A, Neftel K, Cottagnoud P: False positive results for leucocytes in urine dipstick test with common antibiotics. BMJ. 1996, 313: 25-
Gallagher EJ, Schwartz E, Weinstein RS: Performance characteristics of urine dipsticks stored in open containers. Am J Emerg Med. 1990, 8: 121-3. 10.1016/0735-6757(90)90197-8.
Edwards A, Granier S: Packaging may lead to false positive results. BMJ. 1996, 313: 1010-
Lijmer JG, Mol BW, Heisterkamp S, Bonsel GJ, Prins MH, van der Meulen JH, Bossuyt PM: Empirical evidence of design-related bias in studies of diagnostic tests. JAMA. 1999, 282: 1061-6. 10.1001/jama.282.11.1061.
van der Weijden T, Yzermans CJ, Dinant GJ, van Duijn NP, de Vet R, Buntinx F: Identifying relevant diagnostic studies in MEDLINE. The diagnostic value of the erythrocyte sedimentation rate (ESR) and dipstick as an example. Fam Pract. 1997, 14: 204-8. 10.1093/fampra/14.3.204.
Cochrane Methods Working Group on Screening and Diagnostic Tests: Recommended methods. Accessed 21 May 2004, [http://www.cochrane.org/cochrane/sadt.htm]
Moses LE, Shapiro D, Littenberg B: Combining independent studies of a diagnostic test into a summary ROC curve: data-analytic approaches and some additional considerations. Stat Med. 1993, 12: 1293-1316.
Irwig L, Macaskill P, Glasziou P, Fahey M: Meta-analytic methods for diagnostic accuracy. J Clin Epidemiol. 1995, 48: 119-30. 10.1016/0895-4356(94)00099-C.
Fleiss JL: Statistical methods for rates and proportions. 1973, New-York: Wiley
Irwig L, McAskill P, Glasziou P, Fahey M: Meta-analytic methods for diagnostic test accuracy. J Clin Epidemiol. 1995, 48: 119-30. 10.1016/0895-4356(94)00099-C.
Galbraith RF: A note on graphical presentation of estimated odds ratios from several clinical trials. Stat Med. 1988, 7: 889-94.
Rutter CM, Gatsonis CA: Regression methods for meta-analysis of diagnostic test data. Acad Radiol. 1995, 2: S48-S56.
Lau J: Meta-test version 0.6. 1997, New England Medical Center, Boston
Midgette AS, Stukel TA, Littenberg B: A meta-analytic method for summarising diagnostic test performances: Receiver-operating-characteristic-summary point estimates. Med Decis Making. 1993, 13: 253-7.
Devillé WL, Buntinx F, Bouter LM, Montori VM, De Vet HC, Van Der Windt D, Bezemer PD: Conducting systematic reviews of diagnostic studies: didactic guidelines. BMC Med Res Methodol. 2 (1): 9-
Semeniuk H, Church D: Evaluation of the leukocyte esterase and nitrite urine dipstick screening tests for detection of bacteriuria in women with suspected uncomplicated urinary tract infections. J Clin Microbiol. 1999, 37: 3051-2.
Waisman Y, Zerem E, Amir L, Mimouni M: The validity of the uriscreen test for early detection of urinary tract infection in children. Pediatrics. 1999, 104: e41-
Munyi ST, Macharia WM, Alwar AJ, Njeru EK: Screening for urinary tract infection in children with cancer. East Afr Med J. 1998, 75: 264-7.
Sharief N, Hameed M, Petts D: Use of rapid dipstick tests to exclude urinary tract infection in children. Br J Biomed Sci. 1998, 55: 242-6.
Shaw KN, McGowan KL, Gorelick MH, Schwartz JS: Screening for urinary tract infection in infants in the emergency department: which test is best?. Pediatrics. 1998, 101: e1-
Tincello DG, Richmond DH: Evaluation of reagent strips in detecting asymptomatic bacteriuria in early pregnancy: prospective case series. BMJ. 1998, 316: 435-7.
Zaman Z, Borremans A, Verhaegen J, Verbist L, Blanckaert N: Disappointing dipstick screening for urinary tract infection in hospital inpatients. J Clin Pathol. 1998, 51: 471-2.
Edwards A, van der Voort J, Newcombe R, Thayer H, Verrier Jones K: A urine analysis method suitable for children's nappies. J Clin Pathol. 1997, 50: 569-72.
Hoberman A, Wald ER: Urinary tract infections in young febrile children. Pediatr Infect Dis J. 1997, 16: 11-7. 10.1097/00006454-199701000-00004.
Rivierre P, Dauphin L, Lemonnier JY, Rea C, Chavanne D, Gauvain JB: Infection urinaire en court séjour gériatrique: intérêt de la bandelette urinaire. [Urinary infection in geriatric short stay: value of urinary strips]. Rev Med Interne. 1997, 18: 765-8. 10.1016/S0248-8663(97)89965-1.
Shaw KN, McGowan KL: Evaluation of a rapid screening filter test for urinary tract infection in children. Pediatr Infect Dis J. 1997, 16: 283-7. 10.1097/00006454-199703000-00006.
Hagay Z, Levy R, Miskin A, Milman D, Sharabi H, Insler V: Uriscreen, a rapid enzymatic urine screening test: useful predictor of significant bacteriuria in pregnancy. Obstet Gynecol. 1996, 87: 410-3. 10.1016/0029-7844(95)00451-3.
Jellheden B, Norrby RS, Sandberg T: Symptomatic urinary tract infection in women in primary health care. Bacteriological, clinical and diagnostic aspects in relation to host response to infection. Scand J Prim Health Care. 1996, 14: 122-8.
Leanos-Miranda A, Contreras-Hernandez I, Camacho R, Villagomez-Salcedo E, Cervantes-Gorayeb I: Rendimiento diagnóstico de algunas pruebas en orina en las infecciones de vías urinarias. [Diagnostic yield of various urine tests in urinary tract infections]. Rev Invest Clin. 1996, 48: 117-23.
Osterberg E, Aspevall O, Grillner L, Persson E: Young women with symptoms of urinary tract infection. Prevalence and diagnosis of chlamydial infection and evaluation of rapid screening of bacteriuria. Scand J Prim Health Care. 1996, 14: 43-9.
Zainal D, Baba A: The value of positive nitrites in screening asymptomatic bacteriuria amongst Malaysian school children. Southeast Asian J Trop Med Public Health. 1996, 27: 184-8.
Bailey BL: Urinalysis predictive of urine culture results. J Fam Prac. 1995, 40: 45-50.
Holland DJ, Bliss KJ, Allen CD, Gilbert GL: A comparison of chemical dipsticks read visually or by photometry in the routine screening of urine specimens in the clinical microbiology laboratory. Pathology. 1995, 27: 91-6.
Mimoz O, Bouchet E, Edouard A, Costa Y, Samii K: Limited usefulness of urinary dipsticks to screen out catheter-associated bacteriuria in ICU patients. Anaesth Intensive Care. 1995, 23: 706-7.
Monane M, Gurwitz JH, Lipsitz LA, Glynn RJ, Choodnovskiy I, Avorn J: Epidemiologic and diagnostic aspects of bacteriuria: a longitudinal study in older women. J Am Geriatr Soc. 1995, 43: 618-22.
Nunns D, Smith AR, Hosker G: Reagent strip testing urine for significant bacteriuria in a urodynamic clinic. Br J Urol. 1995, 76: 87-9.
Reed RP, Wegerhoff FO: Urinary tract infection in malnourished rural African children. Ann Trop Paediatr. 1995, 15: 21-6.
Winkens RA, Leffers P, Trienekens TA, Stobberingh EE: The validity of urine examination for urinary tract infections in daily practice. Fam Pract. 1995, 12: 290-3.
Carroll KC, Hale DC, Von Boerum DH, Reich GC, Hamilton LT, Matsen JM: Laboratory evaluation of urinary tract infections in an ambulatory clinic. Am J Clin Pathol. 1994, 101: 100-3.
Fowlis GA, Waters J, Williams G: The cost effectiveness of combined rapid tests (Multistix) in screening for urinary tract infections. J R Soc Med. 1994, 87: 681-2.
Gerber B, Schmidt H, Ohde A: Zur Diagnostik von Harnwegsinfektionen im Wochenbett. [Diagnosis of urinary tract infections in puerperium]. Geburtshilfe Frauenheilkd. 1994, 54: 524-8.
Hiraoka M, Hida Y, Hori C, Tuchida S, Kuroda M, Sudo M: Rapid dipstick test for diagnosis of urinary tract infection. Acta Paediatr Jpn. 1994, 36: 379-82.
Villanustre Ordóñez C, Buznego Sánchez R, Rodicio García M, Rodrigo Sáez E, Fernandez Seara MJ, Pavón Belinchón P, Castro-Cago M: Estudio comparativo de los métodos semicuantitativos (leucocituria, test de nitritios y Uricult) con el urocultivo para el diagnóstico de infección urinaria en el lactante. [Comparative study of semi-quantitative methods (microscopy of leucocytes, nitrites, and Uricult) with urine culture for the diagnosis of urine infection in infants]. Anales Españoles de Pediatria. 1994, 41: 325-8.
Ravichandran D, Daltrey I, Uglow M, Johnson CD: Urine testing for acute lower abdominal pain in adults. Br J Surg. 1994, 81: 1460-1.
Anderson JD, Chambers GK, Johnson HW: Application of a leukocyte and nitrite urine test strip to the management of children with neurogenic bladder. Diagn Microbiol Infect Dis. 1993, 17: 29-33. 10.1016/0732-8893(93)90066-G.
Bachman JW, Heise RH, Naessens JM, Timmerman MG: A study of various tests to detect asymptomatic urinary tract infections in an obstetric population. JAMA. 1993, 270: 1971-4. 10.1001/jama.270.16.1971.
Dalton MT, Comeau S, Rainnie B, Lambert K, Forward KR: A comparison of the API Uriscreen with the Vitek Urine Identification-3 and the leukocyte esterase or nitrite strip as a screening test for bacteriuria. Diagn Microbiol Infect Dis. 1993, 16: 93-7. 10.1016/0732-8893(93)90001-N.
Perula de Torres LA, de Borja Ranz Garijo F, Martinez de la Iglesia J, Blanco Negrede A, Acasuso Diaz G, Crespo Crespo A, Lechuga Varona MT, Seco Pinero MI: Validacíon de un método de diagnóstico rápido de infección urinaria en población escolar. [The validation of a rapid diagnostic method for urinary infection in the school-age population]. Rev Clin Esp. 1993, 192: 209-13.
Etherington IJ, James DK: Reagent strip testing of antenatal urine specimens for infection. Br J Obstet Gynaecol. 1993, 100: 806-8.
Fabre R, Baudet JM, Cavallo JD, Crenn Y, Meyran M: Évaluation de tests rapides de dépistage dans le diagnostic des infection urinaires. [Evaluation of rapid screening tests in the diagnosis of urinary infections]. Pathol Biol (Paris). 1993, 41: 923-6.
Liptak GS, Campbell J, Stewart R, Hulbert WC: Screening for urinary tract infection in children with neurogenic bladders. Am J Phys Med Rehabil. 1993, 72: 122-6.
Lohr JA, Portilla MG, Geuder TG, Dunn ML, Dudley SM: Making a presumptive diagnosis of urinary tract infection by using a urinalysis performed in an on-site laboratory. J Pediatr. 1993, 122: 22-5.
Nauschuetz WF, Harrison LS, Trevino SB, Becker GR, Benton J: Two rapid urine screens for detection of bacteriuria: an evaluation. Curr Microbiol. 1993, 26: 43-5.
Ibarra OJF, Vera GE, Garcia AJL: Utilidad de dos pruebas para el diagnóstico presuntivo rápido en infección de vías urinarias y embarazo. [Usefulness of two tests for rapid diagnosis of urinary infections in pregnancy]. Ginecol Obstet Mex. 1993, 61: 290-4.
Woodward MN, Griffiths DM: Use of dipsticks for routine analysis of urine from children with acute abdominal pain. BMJ. 1993, 306: 1512-
Blum RN, Wright RA: Detection of pyuria and bacteriuria in symptomatic ambulatory women. J Gen Intern Med. 1992, 7: 140-4.
Cooper J, Raeburn A, Hamilton-Miller JM, Brumfitt W: Nitrite test for bacteriuria detection. Br J Gen Pract. 1992, 42: 346-7.
Graninger W, Fleischmann D, Schneeweiss B, Aram L, Stockenhuber F: Rapid screening for bacteriuria in pregnancy. Infection. 1992, 20: 9-11.
Hellerstein S, Alon U, Warady BA: Urinary screening tests. Pediatr Infect Dis J. 1992, 11: 56-7.
Kumawaza J, Matsumoto T: The dipstick test in the diagnosis of UTI and the effect of pretreatment catheter exchange in catheter-associated UTI. Infection. 1992, 20: S157-9.
Lachs MS, Nachamkin I, Edelstein PH, Goldman J, Feinstein AR, Schwartz JS: Spectrum bias in the evaluation of diagnostic tests: lessons from the rapid dipstick test for urinary tract infection. Ann Intern Med. 1992, 117: 135-40.
Madsen OR, Faber M, Philipsen L, Frimodt-Moller N: Påvisning af bakteriuri hos ældre indlagte patienter. Sammenligning af kombineret leukocyt- og nitritstix med dyrkning. [Demonstration of bacteriuria in elderly hospitalised patients. Comparison between leukocyte and nitrite strips and culture]. Ugeskr Laeger. 1992, 154: 3682-6.
Michie JR, Thakker B, Bowman A, McCartney AC: Evaluation of enzyme linked immunosorbent assay for screening urinary tract infection in elderly people. J Clin Pathol. 1992, 45: 42-5.
Mills SJ, Ford M, Gould FK, Burton S, Neal DE: Screening for bacteriuria in urological patients using reagent strips. Br J Urol. 1992, 70: 314-7.
Takagi S, Arakawa S, Matsumoto O, Kamidono S, Terasoma K, Mita T: Japanese: Usefulness of dipstick test for determining leukocytes and bacteria in urine. Hinyokika Kiyo. 1992, 38: 31-6.
Evans PJ, Leaker BR, McNabb WR, Lewis RR: Accuracy of reagent strip testing for urinary tract infection in the elderly. J R Soc Med. 1991, 84: 598-9.
Fulcher RA, Maisey SP: Evaluation of dipstick tests and reflectance meter for screening for bacteriuria in elderly patients. Br J Clin Pract. 1991, 45: 245-6.
Lejeune B, Baron R, Guillois B, Mayeux D: Evaluation of a screening test for detecting urinary tract infection in newborns and infants. J Clin Pathol. 1991, 44: 1029-30.
Shaw KN, Hexter D, McGowan KL, Schwartz JS: Clinical evaluation of a rapid screening test for urinary tract infections in children. J Pediatr. 1991, 118: 733-6.
Weinberg AG, Gan VN: Urine screen for bacteriuria in symptomatic pediatric outpatients. Pediatr Infect Dis J. 1991, 10: 651-4.
Berger SA, Bogokowsky B, Block C: Rapid screening of urine for bacteria and cells by using a catalase reagent. J Clin Microbiol. 1990, 28: 1066-7.
Ditchburn RK, Ditchburn JS: A study of microscopical and chemical tests for the rapid diagnosis of urinary tract infections in general practice. Br J Gen Pract. 1990, 40: 406-8.
Goldsmith BM, Campos JM: Comparison of urine dipstick, microscopy, and culture for the detection of bacteriuria in children. Clin Pediatr (Phila). 1990, 29: 214-8.
Harlass FF, Duff P, Herd M: The evaluation of urine pH in screening for asymptomatic bacteriuria in pregnancy. Mil Med. 1990, 155: 49-51.
Hiscoke C, Yoxall H, Greig D, Lightfoot NF: Validation of a method for the rapid diagnosis of urinary tract infection suitable for use in general practice. Br J Gen Pract. 1990, 40: 403-5.
Iitaka K, Sakai T, Oyama K, Izawa T, Igarashi S: Screening for bacteriuria in Japanese school children. Acta Paediatr Jpn. 1990, 32: 690-5.
Lévy M, Tournot F, Ledesert B, Muller C, Carbon C, Yeni P: Evaluation du dépistage de l'infection urinaire par la technique de la bandelette réactive. Chez les malades hospitalisés. [Evaluation of the reagent strip method to detect urinary tract infections. In hospital patients]. Presse Med. 1990, 19: 1359-63.
Lorentzon S, Hovelius B, Miorner H, Tendler M, Aberg A: The diagnosis of bacteriuria during pregnancy. Scand J Prim Health Care. 1990, 8: 81-3.
McGlone R, Lambert M, Clancy M, Hawkey PM: Use of Ames SG10 Urine Dipstick for diagnosis of abdominal pain in the accident and emergency department. Arch Emerg Med. 1990, 7: 42-7.
Pallares J, Casas J, Guarga A, Marquet R, Solans P, Muxi C, Ibars I, Grifell E: Evaluación de diferentes métodos de diagnóstico rápido en la detección de bacteriuria asintomática en la gestante. [The evaluation of different methods for rapid diagnosis in the detection of asymptomatic bacteriuria in pregnant women]. Aten Primaria. 1990, 7: 547-50.
Tuel SM, Meythaler JM, Cross LL, McLaughlin S: Cost-effective screening by nursing staff for urinary tract infection in the spinal cord injured patient. Am J Phys Med Rehabil. 1990, 69: 128-31.
García C, Gonzáles J, Arruebarrena D, Urbieta MA, Emparanza J, Arriola M, Aurtenetxe A, Mingo T, Areses R: Utilidad de la tira reactiva de orina en una consulta de nefrología pediátrica: depistaje de la bacteriuria. [Utility of the urine dipstick at a pediatric consultation of nephrology: screening of bacteriuria]. Nefrologia. 1997, 17: 250-6.
Timmermans AE, Walter AEGM, van Duijn NP, Timmerman CP: De diagnostische waarde van urineonderzoek in de huisartspraktijk. [Diagnostic accuracy of urine examination in family practice]. Huisarts Wet. 1996, 39: 165-8.
Sloos JH, Vreede RW, Floor M, Adam A: Urineweginfekties in de huisartspraktijk: diagnostiek, verwekkers en gevoeligheid voor antibiotica. [Urinary tract infections in family practice: diagnosis, organisms and sensitivity to antibiotics]. Med J Delft. 1995, 182-6.
Christiaens TCM, De Meyere M, Derese A: Disappointing specificity of the leukocyte-esterase-test for the diagnosis of urinary tract infection in general practice. Eur J Gen Pract. 1998, 4: 144-7.
Oxman AD, Guyatt GH: A consumer's guide to sub-group analyses. Ann Int Med. 1992, 116: 78-84.
The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2490/4/4/prepub
We thank Ms. Mika van der Leden for her translation of the Japanese publications.
WLJMD, JCY and NPvD conducted the searches, read the papers, extracted the data and analysed the methodological quality of the papers. Statistical analysis was done by WLJMD, and checked by PDB, DAWMvdW and LMB. All authors contributed equally to writing and reviewing the paper.
Electronic supplementary material
Table 5: PubMed search strategy for literature concerning the diagnosis of bacteriuria or urinary tract infections by measuring nitrites and/or leukocyte esterase with a dipstick, 1990 – 1999 (DOC 20 KB)
Additional File 6: Studies (n = 72)* included in the meta-analysis of the urine dipstick for diagnosing bacteriuria or urinary tract infections by measuring nitrites and/or leukocyte esterase. (DOC 28 KB)
About this article
Cite this article
Devillé, W.L., Yzermans, J.C., van Duijn, N.P. et al. The urine dipstick test useful to rule out infections. A meta-analysis of the accuracy. BMC Urol 4, 4 (2004). https://doi.org/10.1186/1471-2490-4-4
- Urinary Tract Infection
- Positive Test Result
- Diagnostic Odds Ratio