Health Education Research Advance Access originally published online on October 23, 2006
Health Education Research 2006 21(Supplement 1):i98-i109; doi:10.1093/her/cyl087
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
© 2006 The Author(s).
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Does participation in an intervention affect responses on self-report questionnaires?
1 Department of Pediatrics, Baylor College of Medicine, Houston, Texas 77030, USA
2 College of Education, University of California Berkeley, Berkeley, California 94720, USA
3 University of British Columbia, Department of Pediatrics, Centre for Community Child Health Research, L408-4480 Oak Street, Vancouver, BC V6H 3V4, Canada
* Correspondence to: T. Baranowski. E-mail: tbaranow{at}bcm.tmc.edu
| Abstract |
|---|
|
|
|---|
There has been some concern that participation in an intervention and exposure to a measurement instrument can change participants' interpretation of the items on a self-report questionnaire thereby distorting subsequent responses and biasing results. Differential item functioning (DIF) analysis using item response modeling can ascertain possible differences in item interpretation by testing for differences in item location between groups. The DIF for treatment versus control group differences at post-intervention assessment and the Time 1 and Time 2 differences in a control group were analyzed using data from a dietary change intervention trial for Boy Scouts. The measures included fruit and vegetable (FV) frequency of consumption, preferences and self-efficacy. Treatmentcontrol group DIF at post-intervention assessment was detected in a higher percentage of items for FV frequency than for preference or self-efficacy. Time 1 to Time 2 differences in items for the control group were detected in one item for each of the three scales. Further research will need to clarify whether the obtained DIFs reflected true changes in frequency, preference or self-efficacy or some reinterpretation of items by participants following an intervention or merely after previous exposure to the measure.
| Introduction |
|---|
|
|
|---|
Theory-based behavioral interventions attempt to change selected behaviors through mediating variables [13]. Common psychosocial mediating variables in behavior change programs include preference [4] and self-efficacy [5]. Participants commonly reveal both behavioral outcomes and psychosocial mediators through self-report methods in intervention evaluations.
Some investigators have been concerned that encountering questions in self-report measures changes the participants' understanding of the behaviors targeted by the intervention and the factors influencing participants to do (or not to do) the behaviors. Along these lines, a mere exposure effect was proposed that simply being exposed to a message could change attitudes and even behavior [6]. Research since then has tried to specify the conditions under which this might be true. Short (5 ms) exposures produced more stimuli recognition than longer exposures (500 ms) [7]. Repeated subliminal exposures (five) produced more positive effect responses to stimuli [8]. Asking about a person's intention to engage in a behavior increased the likelihood of performance of the behavior, including the purchase of big-ticket items such as automobiles [911]. The conversion from intent to behavior was more likely to occur when the participant did mental homework and could more easily represent or imagine doing that behavior [12]. This literature has not differentiated between whether respondents experienced changes in the underlying dimension or their position on the scale item (i.e. their interpretation of the items on the dimension). The latter could suggest that change in item response occurred without any necessary change in the underlying behavior or attitudinal variable. If this happens, treatment group participants' understanding of the items in behavioral or psychosocial assessment tools would be different without indicating a difference in behavior due to the intervention. One of the purposes of a comparison group in a research study is to control for measurement effects, such as the effect of simple exposure to an instrument, to differentiate them from the intervention effect in the treatment group. Heretofore, only whole instrument differences noted in the control group have been noted and discussed, however. It may be that some of the effects of mere exposure relate to specific items rather than the summary score of a whole instrument.
Differential item functioning (DIF) is a method that educational researchers have used to examine bias at the item level [13]. DIF analyses have been conducted most often to identify items that function differently among subgroups of participants. Specifically, these methods have been used to determine if participants who have similar scores on a given construct have the same interpretation of specific items. DIF analyses consist of focusing on the individual items themselves as a method for locating which items function differently between the two groups. DIF can determine if the properties of the items have changed as a result of participating in an intervention or from repeating the measurement. Item response modeling (IRM) [14] is one of the methods used to assess DIF. The properties of IRM, including stability of item and test parameters [14], make this a powerful DIF method.
To demonstrate the concept, this paper reports on two simple applications of a DIF analysis of self-reported behaviors (consumption of fruit and vegetables) and related psychosocial variables (preferences and self-efficacy). IRM DIF analysis was used to compare treatment and control groups after an intervention. The effect of taking the scales twice in the control group was also examined. Participant outcomes were reassessed after correcting for DIF.
| Methods |
|---|
|
|
|---|
Design
Two interventions with Boy Scouts were conducted: one to increase their fruit and vegetable (FV) consumption (5ADay Badge program) [15] and the other to increase their physical activity (Fit for Life Badge program) [16]. Forty-two troops were assigned randomly to intervention groups. Each treatment group served as the attention placebo control for the other, thereby enabling all troops to receive an intervention, and providing meaningful attention to each control group. Baseline data were collected at the troop meeting the week before the start of the intervention, the ninth week of the intervention (right after the badge awards ceremony) and 6 months after that. This report includes the baseline and immediate post-data relevant to the 5ADay Badge program only.
Intervention
The 5ADay Badge program was implemented across 9 weeks. Each week included an
30-min in-troop educational experience and an
20-min Internet experience. Paid badge troop leaders were trained in the delivery of the in-troop experience to ensure fidelity of delivery. The troop experience involved activities requiring personal contact: e.g. weekly recipe preparation (skill building) and taste testing (multiple exposures). FV recipes were provided in a Boy Scout cookbook and were previously tested to be enjoyed by many children/adolescents. Fidelity of the Internet program to desired theoretically specified procedures was assured by the computer programming. The Internet experience involved a weekly comic strip of Boy Scouts facing and overcoming challenges (modeling of problem solving) to eating FV in situations from those reported to be common problems among children (thereby enhancing self-efficacy), anticipatory problem solving, personal goal setting, self-monitoring, goal attainment assessment, problem solving if goal not attained, point assignment (reward) and functional knowledge games (e.g. what counts as a fruit?). Logging on to the Internet component of the intervention was monitored by the paid troop leader and Scouts were encouraged to login if they had not. Badges were earned if 70% of maximum points were attained. The 5ADay Badge program has been described in greater detail elsewhere [15]. Analyses were performed at the individual scout level for simplicity, and because the same Internet program and trained badge troop leaders delivered all instruction.
Measures
Three FV-related measures were employed: frequency of consumption, preference and self-efficacy. Both the frequency and preference measures combined both FV items. The frequency of FV consumption measure (FvFreq) had 38 items. Participants responded with the categorized frequency of consumption of each item in the last week, with seven possible response categories ranging from none to 15 or more servings last week. We expect that these items assess a latent variable of perceived frequency of consumption of fruit and vegetables, with some items more frequently consumed by most people, even low consumers, and other items only consumed by more frequent consumers. Within the context of dietary change intervention, frequency of consumption is the primary dependent variable, and a DIF analysis would verify whether change in consumption of items occurred along this latent continuum, or is more random. The FV preference measure (FvPref) had 39 items, with three possible response categories, ranging from I do not like this to I like this a lot. We expect that these items assess a latent variable of preferences for fruits and vegetables with some items more commonly preferred by most people, even those not generally liking FV, and other items only preferred by those who generally prefer FV. The self-efficacy for eating FV measure (FvSE) had 21 items, with five possible response categories, ranging from disagree a lot to agree a lot. The preference and self-efficacy scales were previously construct validated using classical test theory procedures [5]. Cronbach's alpha varied from 0.72 to 0.87 [5]. As determinants of FV consumption, FV preferences and self-efficacy should be mediators of behavior change (but those analyses will not be reported here). Demographic characteristics were assessed with commonly used questions.
Psychometric analyses
The same procedures were used to analyze all three scales in ConQuest software [17]. IRM procedures for analyzing a scale have been described in detail elsewhere [14] and are only briefly summarized here.
Models were selected from the one-parameter Rasch family of item response models in order to estimate locations for both participants and items on the same scale. Pre-intervention and post-intervention data were analyzed first with a rating scale model and then with a partial-credit model [18] to determine whether the categories of responses were the same or different across items. The partial-credit model fit better for all measures, so for each item in each measure, the ordered categories of response (e.g. five ordered responses from disagree a lot to agree a lot) were modeled to have different values. The units for the relationship between participant characteristic and the endorsability of an item are expressed in logits (log of the odds) because the IRM procedures examine the probability of a particular response to an item.
To detect DIF, we introduce an interaction term in the model to estimate the probability of interaction between the item and the grouping variable (either treatment group or time in this study). The result of each DIF analysis is a list of estimated locations
for the participants (each individual
and their standard errors), a list of estimated locations
for the items i and categories k (
ik) with their standard errors, and a list of estimated additions
(with their standard errors) to the item locations for each group g that correct for differences in the item functioning between the groups: responses in logits =
(
ik + gig). By examining the gig additions and comparing them to their standard errors (e), we can see whether the DIF is statistically significant (gig > 1.96e). Since the average of the DIF effects is zero by definition, the overall difference in the locations of the participants should remain unchanged when comparing an analysis modeling DIF with a similar analysis without modeling DIF. We can check this observation by comparing the main effect of the grouping variable in the different analyses.
To examine item fit within the partial credit models used for these scales, ConQuest compared the observed item estimates with estimates that might have occurred if the model fit exactly, using a weighted fit and t statistic [17]. If both these values were outside the designated ranges (0.751.34 for weighted fit and 2 to +2 for t values), the item was considered misfitting. Items or categories that did not fit the model well were examined for possible substantive reasons. All analyses constrained mean item estimates to equal zero, and designated that ConQuest calculate full standard errors for items and steps from one category to the next.
For the initial assessment of participant outcomes, performed without accounting for DIF, we obtained group effects from pre- and post-intervention analyses for each of the three scales. The difference between the treatment and control groups' pre- and post-intervention was calculated as the difference between the mean estimated locations of each group. A difference was considered statistically significant (at
= 0.05) if the distance from zero to either group mean was >1.96 times the standard error of the estimate. We hoped to see no significant difference between the treatment and control groups' pre-intervention since scout troops were randomly assigned to these groups. We also hoped to see a significant difference in the group effect post-intervention for all the three scales to reflect changes in responses attributable to the intervention. Significant differences in the group effects would indicate differential impact, a term used to indicate that the groups have different amounts of the underlying trait. We contrasted this term with DIF which indicates a difference in the way a particular item functions for members of different groups having the same amount of the underlying trait.
In the DIF analysis, we compared post-intervention data from the treatment and control groups to see which, if any, items exhibited significant DIF following the intervention. Data were coded so that if gig for the treatment group was a positive number, then that item was more easily endorsed by the control group; if gig for the treatment group was a negative number, then that item was more easily endorsed by the treatment group.
The effect size of DIF was considered small if two times the gig was <0.426 logits, and moderate if it was between 0.426 and 0.638 logits. DIF >0.638 logits was considered large [19, 20]. Main effect of the grouping variable was obtained through the analysis that accounted for DIF. The main effect of group at post-intervention was expected to show significance for all three scales to reflect changes in the treatment group following intervention.
In the second analysis, pre- and post-intervention data (Time 1 and Time 2, 9 weeks apart, but no nutrition intervention) from the control group only were each calibrated to see which, if any, items exhibited significant difference following mere double exposure to the scales. Since the control group takes the items twice in these comparisons, jeopardizing the assumption of item independence needed to perform a DIF analysis as above, we used a method of comparing estimates from different occasions described by Wright and Masters [18]. The separately calibrated item estimates from pre- and post-intervention were compared using their means and variances and z-scores to set the pre-intervention estimates on the same post-intervention scale:
'prei = zprei (
posti) + µposti; where zprei = (
prei µprei)/
prei, µprei and µposti are the pre- and post-intervention item calibration means,
prei and
posti are the pre- and post-intervention item calibration standard deviations and
prei and
'prei are the pre-adjusted and post-adjusted item estimates for pre-intervention items. Then the standard errors (e12) were calculated: e12 = (e12+e22)1/2. By comparing the differences with their standard errors, we judged whether they were statistically significant (
prei
posti > 1.96e12).
| Results |
|---|
|
|
|---|
Demographics
Data were available on up to 473 Boy Scouts (Table I). There were no statistically significant differences between treatment and control groups for age, height, weight, body mass index (BMI), ethnicity, educational attainment or family owning a home. The data set included complete data on 228 participants in the treatment group and 230 in the control group.
|
Three measures were analyzed. Mean raw scores on each of the three measures are shown at the bottom of Table II with the maximum value of each measure appearing next to the measure name. There were no responses for FvFreq in item categories 5 and 7 for item 37 (cabbage); so the categories for this item were collapsed to make five categories.
|
As a precursor to the DIF analyses, we assessed the fit statistics of the items for their respective scales. Only a small number of items did not meet the fit criteria for these scales. In the FV frequency scale, consumption of French fries was misfitting (weighted mean square = 1.37, t = 6.2) suggesting that responses to this item were more random than expected for this model. The item was retained because of its unique role as a putative vegetable, but one whose consumption should have been minimized on the basis of the intervention. In the self-efficacy scale, both eating fruit and eating vegetables served in school lunches were misfitting (weighted mean squares = 1.43 and 1.50, respectively, t = 8.0 and 9.3, respectively), again suggesting that responses to these items were more random than expected. These items were retained because of the prevalence of school lunches in the diets of these participants.
Tables IIIV![]()
show the item estimates resulting from the FvFreq, FvPref and FvSE measure DIF analyses, respectively. Only items with statistically significant DIF have their DIF estimates listed under the DIF column. Although the DIF gig for the other items are not listed, the reader should keep in mind that all items have either a positive or a negative DIF gig, whether or not the magnitude is statistically significant, and that the sum of the positive gig must equal the sum of the negative gig in this model. A net positive or negative sum would indicate differential impact rather than DIF. For FvFreq, eight items showed significant DIF when analyzing treatment and control groups' post-intervention; all DIF were small and they were evenly distributed with four items showing positive DIF (grapes, apples, French fries and cantaloupe: the control group consumed these more frequently) and four items showing negative DIF (orange juice, carrots, oranges and pears: the treatment group consumed these more frequently). For FvPref, DIF for French fries was statistically significant and moderate in size, with the control group reporting more liking for them than the treatment group post-intervention. The control group also expressed more liking for watermelon (small DIF), whereas the treatment group expressed more liking for spinach, strawberries and lettuce post-intervention. For FvSE at post-intervention, the treatment group was more certain that they could eat fruit with a fruit dip, eat vegetables with a dip (both small DIF) or make a favorite fruit, juice or vegetable recipe (moderate DIF) than the control group.
|
|
|
Table VI shows the main effect of treatment group both pre- and post-intervention without accounting for DIF. Higher mean logit values indicated more of the underlying trait. The differences between treatment and control group main effects were significant (P < 0.05) for both FvFreq and FvSE post-intervention. No other main effect differences were significant. Main effects of the grouping variables were also estimated in the DIF analyses: treatment versus control group post-intervention (bottom of Table VI). After controlling for DIF, group membership was significant as a main effect in the frequency and self-efficacy analyses post-intervention, but not the preference analyses.
|
Mere double exposure to the three scales by the control group at Times 1 and 2 resulted in a significant difference between item calibrations for only one item in each of the three scales. For FvFreq, strawberries were consumed more frequently at Time 1 (0.33 logits, e12 = 0.15, P < 0.05); for FvPref, cabbage was better liked at Time 2 (0.47 logits, e12 = 0.17, P < 0.01) and for FvSE, I am sure I can eat 3 or more servings of vegetables every day was more easily endorsed at Time 2 (0.40 logits, e12 = 0.18, P < 0.05).
| Discussion |
|---|
|
|
|---|
Analysis of individual items (Table III) showed that eight of the 38 FvFreq items (21%) demonstrated DIF between treatment and control groups' post-intervention. The number of items with statistically significant positive and negative gig was evenly balanced with four items each. The DIF effects were statistically significant, but deemed negligible according to DIF standards of effect size [19]. For treatment and control groups at post-assessment, three of the 21 FvSE items (14%), and five of the 39 FvPref items (13%) showed significant DIF. Thus, DIF was found in more items of the self-report of FV intake, compared with the number of items showing DIF in the psychosocial correlates of FV intake, but all the intake DIFs were considered small, and thereby negligible.
The treatmentcontrol group DIF could indicate any of several things: random variations in item functioning; artifact of the model and the need for DIF estimates to sum to zero; true change in frequency of consumption (which was reflected in differences between treatment and control at post) or difference in understanding items in relation to the underlying measure as a mere exposure effect. Given that 21% of the FVFreq items demonstrated DIF, it seems unlikely that all of the observed DIF would be due to chance. The underlying measure reflects the self-reported frequency of consumption of the items, with items at the negative end of the logit scale reflecting more frequent consumption. The fact that the treatment group consumed some items more frequently than the control group after receiving an intervention that promoted increased consumption of all these items is not unexpected if the DIF reflects true changes in intake. The fact that the control group consumed some items more frequently than the treatment group is less explainable, although French fries were specifically addressed in the intervention as less desirable due to high fat and thus perhaps less frequently consumed by the treatment group post-intervention.
Future intervention studies should combine self-report methods and more objective markers (e.g. blood markers), and employ IRM and DIF techniques to correct for this problem, thereby obtaining a clearer assessment of the functioning of different items in comparison to the more objective markers. The greater preference for French fries at post-assessment in the control group appears to reflect an intervention effect, since scouts in the intervention group were clearly informed that French fries did not count as a vegetable. The moderate increase in self-efficacy for making vegetable recipes should reflect the scout participation in weekly recipe preparation in troop meetings. The small effects for eating fruit and vegetables with dip should also reflect the intervention emphasis on these issues. Thus, the DIF appears to have identified change in a few aspects of frequency, preference and self-efficacy specifically changed by the intervention. Given the negligible level of DIF on all the items, these analyses provide little support for a mere exposure effect on understanding of these items, but the possibility of mere exposure effect on the mean change in location (differential impact) on the latent variable for behavior and self-efficacy remains.
Table VI showed that statistically correcting for treatmentcontrol group DIF at post-intervention resulted in no change in outcome main effects. Thus, DIF did not mask differential impact.
In this paper, we assessed possible DIF as an artifact of taking the measure twice by assessing possible differences in item calibrations from two different occasions of taking the scales. A total of three items showed different calibrations from Time 1 to Time 2. One item in each of the FvFreq and FvSE scales showed a DIF analogue at the 95% confidence level, about what might be expected by chance for scales of
20 items. Thus, the evidence is weak for mere exposure effect differences in how these Boy Scouts responded to these two scales on a second occasion. The evidence is slightly stronger for a mere exposure effect on the FvPref scale because the DIF analogue is larger for liking for cabbage, significant at the 99% confidence level for this 39-item scale. Further research might confirm this finding and clarify its interpretation.
These analyses were kept as straightforward as possible to demonstrate the DIF analysis procedures within IRM. A better approach to determining differences in item calibrations from different occasions would be to assess person DIF but this analysis was deemed outside the scope of this paper. A more complex analysis that allows for simultaneous measurement, treatment and measurement by treatment DIF estimation is possible, but, again, was judged too complex for the purpose of demonstrating DIF procedures.
The strengths of this paper were the reasonably large sample, the use of measures that were previously validated using classical test theory procedures and application of sophisticated psychometric analysis procedures. The weaknesses include that these measures were not previously validated using item response theory methods, that the control group received a physical activity change intervention which may have confounded the mere measurement effect, and no hierarchical analysis was performed to account for the clustering of data by troop.
| Conclusion |
|---|
|
|
|---|
These analyses demonstrated that some items functioned differently after participants participated in an intervention and after some were exposed to a measurement scale but these effects were mostly small and could be considered negligible. This may be problematic for the usual interpretation of item functioning. Future research will need to replicate these results.
| Conflict of interest statement |
|---|
|
|
|---|
None declared.
| Acknowledgements |
|---|
|
|
|---|
This research was supported by a grant from the American Cancer Society (ACS TURSG-01) and a contract from the National Cancer Institute (NCI 263-MQ-31958). This work is also a publication of the USDA/ARS Children's Nutrition Research Center, Department of Pediatrics, Baylor College of Medicine and Texas Children's Hospital, Houston, TX, USA. This project has been funded in part by federal funds from the United States Department of Agriculture/Agricultural Research Service (USDA/ARS) under co-operative agreement 58-6250-6001. The contents of this publication do not necessarily reflect the views or polices of the USDA or NCI, nor does mention of trade names, commercial products or organizations imply endorsement by the US Government.
| References |
|---|
|
|
|---|
1. Baranowski T, Lin LS, Wetter DW, et al. Theory as mediating variables: why aren't community interventions working as desired? Ann Epidemiol 1997 7:S8995.
2. Baranowski T, Anderson C, Carmack C. Mediating variable framework in physical activity interventions. How are we doing? How might we do better? Am J Prev Med 1998 15:26697.[CrossRef][Web of Science][Medline]
3. Baranowski T and Jago R. Understanding mechanisms of change in children's physical activity programs. Exerc Sport Sci Rev 2005 33:16368.[CrossRef][Web of Science][Medline]
4. Domel SB, Baranowski T, Davis H, et al. Fruit and vegetable preferences among fourth and fifth grade students. Prev Med 1994 22:86679.
5. Domel SB, Baranowski T, Thompson WO, et al. Psychosocial predictors of fruit and vegetable consumption among elementary school children. Health Educ Res Theory Pract 1996 11:299308.
6. Zajonc RB. Attitudinal effects of mere exposure. J Pers Soc Psychol 1968 8:129.[Medline]
7. Bornstein RF and D'Agostino PR. Stimulus recognition and the mere exposure effect. J Pers Soc Psychol 1992 63:54552.[CrossRef][Web of Science][Medline]
8. Monahan JL, Murphy ST, Zajonc RB. Subliminal mere exposure: specific, general, and diffuse effects. Psychol Sci 2000 11:4626.[CrossRef][Web of Science][Medline]
9. Sherman SJ. On the self-erasing nature of errors of prediction. J Pers Soc Psychol 1980 39:21121.[CrossRef][Web of Science]
10. Greenwald AG, Carnot CG, Beach R, et al. Increasing voting-behavior by asking people if they expect to vote. J Appl Psychol 1987 72:3158.[CrossRef]
11. Morwitz VG, Johnson E, Schmittlein D. Does measuring intent change behavior. J Consum Res 1993 20:4661.[CrossRef]
12. Levav J and Fitzsimons GJ. When questions change behavior: the role of ease of representation. Psychol Sci 2006 17:20713.[CrossRef][Web of Science][Medline]
13. Shepard LA. Definition of bias. Handbook of Methods for Detecting Test Bias.Baltimore, MD: Johns Hopkins University Press 1982.
14. Wilson M. Constructing Measures: An Item Response Modeling Approach.Mahwah, NJ: Erlbaum 2005.
15. Thompson D, Baranowski T, Cullen K, et al. 5 A Day Boy Scout Badge program: outcome results. Am J Prev Med.2006 in press.
16. Jago R, Baranowski T, Baranowski J, et al. Fit For Life Boy Scout badge: outcome evaluation of a troop & Internet intervention. Prev Med 2006 42:1817.[CrossRef][Web of Science][Medline]
17. Wu ML, Adams RJ, Wilson MR. ACER ConQuest: Generalised Item Response Modelling Software. American College of Educational Research 1998.
18. Wright BD and Masters GN. Rating Scale Analysis.Chicago, IL: Mesa Press 1982.
19. Paek I. Investigation of differential item function: comparisons among approaches, and Extension to a multidimensional context. Unpublished PhD Dissertation.Berkeley, CA: University of California 2002.
20. Longford N, Holland P, Thayer D. Stability of the MH D-DIF statistics across populations. In Holland P and Wainer H (Eds.). Differential Item Functioning.Hillsdale, NJ: Lawrence Erlbaum Associates, Inc. 1993 pp. 17196.
Received on January 10, 2006; accepted on July 19, 2006
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
L. C. Masse, M. Wilson, T. Baranowski, and L. Nebeling Improving psychometric methods in health education and health behavior research. Health Educ. Res., December 1, 2006; 21(suppl_1): i1 - i3. [Full Text] [PDF] |
||||
![]() |
D. D. Allen and M. Wilson Introducing multidimensional item response modeling in health behavior and health education research Health Educ. Res., December 1, 2006; 21(suppl_1): i73 - i84. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Watson, T. Baranowski, and D. Thompson Item response modeling: an evaluation of the children's fruit and vegetable self-efficacy questionnaire Health Educ. Res., December 1, 2006; 21(suppl_1): i47 - i57. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
