Short Report | Open | Published:
Methodological considerations regarding response bias effect in substance use research: is correlation between the measured variables sufficient?
Substance Abuse Treatment, Prevention, and Policyvolume 6, Article number: 1 (2011)
Efforts for drug free sport include developing a better understanding of the behavioural determinants that underline doping with an increased interest in developing anti-doping prevention and intervention programmes. Empirical testing of both is dominated by self-report questionnaires, which is the most widely used method in psychological assessments and sociology polls. Disturbingly, the potential distorting effect of socially desirable responding (SD) is seldom considered in doping research, or dismissed based on weak correlation between some SD measure and the variables of interest. The aim of this report is to draw attention to i) the potential distorting effect of SD and ii) the limitation of using correlation analysis between a SD measure and the individual measures. Models of doping opinion as a potentially contentious issue was tested using structural equation modeling technique (SEM) with and without the SD variable, on a dataset of 278 athletes, assessing the SD effect both at the i) indicator and ii) construct levels, as well as iii) testing SD as an independent variable affecting expressed doping opinion. Participants were categorised by their SD score into high- and low SD groups. Based on low correlation coefficients (<|0.22|) observed in the overall sample, SD effect on the indicator variables could be disregarded. Regression weights between predictors and the outcome variable varied between groups with high and low SD but despite the practically non-existing relationship between SD and predictors (<|0.11|) in the low SD group, both groups showed improved model fit with SD, independently. The results of this study clearly demonstrate the presence of SD effect and the inadequacy of the commonly used pairwise correlation to assess social desirability at model level. In the absence of direct observation of the target behaviour (i.e. doping use), evaluation of the effectiveness of future anti-doping campaign, along with empirical testing of refined doping behavioural models, will likely to continue to rely on self-reported information. Over and above controlling the effect of socially desirable responding in research that makes inferences based on self-reported information on social cognitive and behavioural measures, it is recommended that SD effect is appropriately assessed during data analysis.
Prompted by frequent media exposure of high profile doping cases and prevalence reports, the inadequacy of the detection- and sanction-based deterrence to prevent doping has been progressively recognised. Parallel to this development, anti-doping efforts have turned to developing a better understanding of the behavioural determinants that underline the decision to cross the line to the land of prohibited substances. As a result, the number of social science research projects investigating the social aspects of doping has increased, including several papers developing or testing behavioural models and social cognitive processes underlying doping use [1–21]. The comprehensive review commissioned by the World Anti-Doping Agency  showed that the overwhelming majority of social science research is based on self-reports with over 100 doping related papers in the social science domain, of which 69 focused on attitudes. Self-report questionnaires comprise over 97% of these studies, in which the potential effect of response bias was seldom mentioned.
Empirical testing of anti-doping interventions is somewhat lagging behind behavioural model work with only a few notable exceptions such as the ATLAS (Athletes Training & Learning to Avoid Steroids)  and ATHENA (Athletes Targeting Healthy Exercise and Nutrition Alternatives)  for high school athletes. Although empirical evidence has put forward to show the effectiveness of these well known and widely used, school based health promotion and substance abuse prevention programmes, the evaluation in all cases was based on self-reports at both baseline and interval measurement points [25–30].
Self-report is the most commonly employed method in psychological assessment. In addition to the well known benefits of ease of use and information richness, the method has attracted considerable criticism for potentially distorting effects arising from response set and styles . Most of these limitations stem from two fundamental assumptions that the respondent is i) able to self-report and ii) be willing to self-disclose. Hence, the respondent is assumed to have sufficient insight into what is being measured yet no intention to distort his or her responses. Violations of either of these two assumptions can seriously compromise the validity of self-report assessment. Origins of this distortion range from denial through self-deception to deliberate self-impression management, with varying effect on the construct being measured . Self-presentation (socially desirable responding) is one of these potential distortions. Social desirability, a tendency of respondents to reply in a manner that will be viewed favourably by others, is one of the common method variance mechanisms that can create artefactual association. Owing to this social desirability (SD) effect, respondents may deny or deflate their responses about undesirable whilst inflate their answers on desirable attributes and/or behaviour, in particular in situations when the questions drill into socially sensitive issues.
For example, the difficulty in establishing doping prevalence rate via direct self-reports is partly caused by the inconsistent approach to defining doping, setting timeframe and frequency; and partly due to the varying degree of SD effect present in the target populations .
On account of the popularity and convenience of self-report methods, in particular when large data set is required for robust statistical analyses, considerable efforts have been made to estimate, and potentially eliminate, the SD effect in research into socially sensitive issues. These endeavours have included ensuring anonymity, using indirect measures and developing tests that are less prone to manipulation. As a last resort, when SD bias is assumed to be present and cannot be eliminated, researchers often include a scale that measures respondents' tendency to give socially desirable answers and correlate these SD scale scores with the target measures.
When socially desirable responding is considered, typically a distinction is made between SD in response set (that is a property of a particular scale) and SD response style, which is an individual difference variable and as such, affects many if not all responses given by the individual . This distinction is important in dealing with SD responding with response set SD being less problematic in psychological assessments as it affects all respondents equally with information not used in absolute levels but compared to other groups' results. However, SD as an individual difference variable could distort the data obtained  and may lead to false interpretation if scores were taken at face value. For example, a recent study using objective verification of doping showed that those who falsely claimed abstinence performed on the social cognitive measures as it would be expected from a clean athlete .
Although people with certain personality characteristics (i.e. conscientiousness) are known to score high on the SD scales, studies using objective criteria show that in most cases SD scales do not measure individual differences, hence high correlation between the SD scale and other variables indicate significant shared substantive variance , thus indicating the presence of SD distortion. A recent review suggests that SD is a motivated process in which respondents deliberately alter the information they report and the extent of this distortion depends on whether the respondent has anything compromising to report and on design features of the survey . Notably, this distortion also presents to a degree when the reporting is done anonymously, i.e. when there is no danger to be embarrassed directly or having consequences of the admitted behaviour.
Despite the fact that methods for testing, controlling and/or managing response bias are available , research into doping attitude or other predictors of doping behaviour has seldom considered response bias or made an attempt to i) estimate or ii) partial out variability owing to this effect. The WADA commissioned literature review on the antecedents of doping behaviour concluded that social science doping research would benefit considerably from improvement in research methodology and measurements . In line with this recommendation, this report aims to draw attention to i) the potential distorting effect of SD and ii) the limitation of using correlation analysis between a response bias measure and the individual variables of interest.
For this study, a subset of the data used for testing the performance enhancement model  was re-analysed using structural equation modeling technique to include a SD variable. The proposed model focuses on opinion formation and is depicted in Figure 1 (baseline model), In line with the current concepts of SD [32, 34, 35, 38] the subsequent models were formulated to test the extent to which the SD is an independent variable that affects the other measured predictor variables at the construct level (Figure 2) and indicator level for the predictor variables (Figure 3); or SD is among the independent predictors affecting expressed doping opinion (Figure 4).
Methods to control SD effects have been widely discussed, with remedies ranging from anonymity to statistical procedures applied [39, 40]. At the individual measurement level, SD is either a context specific and temporary effect relating to the response set or consistent across situations related to the person [32, 34, 35, 38]. Although both can affect self-reported responses, it is the latter that may have serious effect on the conclusions drawn from the observed relationship between the measured variables of interest. At the model level, SD is conceptualized as one of the three possible effects: i) suppress genuine relationship, ii) create artefactual relationships or iii) moderate/mediate the relationship between the predictor and the outcome variable [41, 42].
Statistical approaches suggest partial correlation and latent variable modeling to test whether SD leads to spurious or suppressor effect [40, 41], with a distinction made between suppressor variables and moderator/mediator effects . Notably however, psychometricians speak out against post hoc attempts to statistically partial out SD effect claiming that if doing so, part of the genuine and possibly important variance is also lost [31, 38]. Omitting SD when it is a theoretically important variable yields an inadequate model fit  and may lead to incorrect conclusions.
Unfortunately, the information on the SD effect on self-reports, particularly in field studies, is limited owing to the difficulty having objective information available on the same person to contrast self-reports on behaviour . Whilst the research has been done on the validity of self-reports on behaviour (i.e. being involved in an act such as drug use, smoking, drinking, etc.), the results are inconclusive. Reassuring validity reports for methods such as the Timeline Follow-Back procedure ([44, 45], Drug Abuse Screening Test , the CAGE for excessive alcohol consumption  or the Cannabis Use Problems Identification Test  are counterbalanced by studies using objective verification via biomarkers showing considerable under-reporting of substance use [49–51]. Whilst people may deny their undesirable behaviour for fear of consequences (in case of illegal behaviour), it is equally plausible that such denial is driven by self-presentation. Research showing that SD effect is present even under anonymity  supports this notion. Self-presentation plays a particularly important role in research relying on self-reported measures of various psychological constructs such as social cognition and personality. A recent investigation into doping behaviour, benefitting from synergy between social and analytical science, showed that those who denied their compromising behaviour provided answers on the related psychological assessments tapping into attitudes, beliefs and social projection that were congruent with the self-reported (but untrue) behaviour .
Therefore the work presented in this paper focuses on the potential SD distorting effect on self-reported measures of various psychological constructs. We used opinion for outcome variable as a construct that results from the combination of someone's beliefs, attitudes, desires, as well as knowledge, understanding and perceptions of a particular situation, including perceived control. Predictor variables were the general doping attitude (Performance Enhancements Attitude Scale (PEAS) ), tendency for self impression management (Marlowe-Crowne Social Desirability scale , referred to as SD measure in this paper, external and internal deterrence factors and opinion regarding allowing restricted (top level athletes only) and unrestricted (all athletes) use of doping in competitive sport. External deterrence factors were doping control, affordability, perceived use/abstinence of the opponent and disapproval of important others in the athlete's life such as family, friends and coaches. Internal deterrence factors were based on moral values (i.e. doping is cheating, disapproval of drugs) and health concerns.
Correlation coefficients were calculated between SD and other measures. The doping opinion model was tested using structural equation modeling, with and without the self-impression management variable. Scale reliability was assessed using Cronbach's alpha and the KR-21 coefficient. Relationships between the SD and other variables were tested using Pearson and correlation coefficients. Structural equation modeling was performed using AMOS 18 in the PASW package and the R statistical computing software  with the SEM package . For further analysis, participants were categorised by their SD score into high- and low SD groups using k-means clustering. All statistical analyses were performed using PASW 18.0.
The data set comprised of 278 college athletes (71.6% male) from Division I & II with the majority from Division II and II/IAA (88.9% combined), mean age 20.1 ± 1.9. Eighty-nine percent of the athletes claimed not having any personal experience with doping, which is congruent with other self-reports but most likely under-reported. Interestingly, 27% of the non-users would support having doping legalized under restricted conditions and a further 3% would even support unrestricted use for high performing athletes. The reliability coefficient values in the present sample ranged from satisfactory to good. Descriptive statistics, along with the scale reliability measures where applicable, are shown in Table 1. Cluster analysis based on SD scores resulted in two distinct groups with a naturally occurring divide at score 17, with cluster centres of 20.39 and 12.18 for high- and low SD groups, respectively. (For details, see Additional file 1: Cluster analysis creating high and low SD groups).
Correlation coefficients between SD and the other measured variables are shown in Table 2. The relationships between social desirability and other doping related measures were in the expected directions. Whilst some were statistically significant, their low value (<|0.22|) suggests that the predictor variables for doping were not strongly affected by socially desirable responding at the measurement level and exhibited an even lower level of correlation (<|0.11|) in the low SD group. However, at the model level, covariances between the latent predictor variables were statistically significant with the covariance between Attitude and Control being considerably larger than the other two (Table 3). Estimated correlations between the predictor latent variables were 0.202, -0.736 and -0.735 for Deterrence - Control, Attitude - Control and Attitude - Deterrence, respectively. Although the model fit could be improved significantly by imposing correlations between the predictor latent variables, we posit that this relationship is largely influenced by the spurious effect of SD. To test this assumption, we tested the models with and without the SD variable where correlation between Deterrence, Control and Attitude were not allowed (Figures 1 and 2). Goodness of fit statistics and fit indices, along with their corresponding customary cut-off values, are summarised in Table 4. Additional file 2 and 3 provides the correlation and covariance matrices (Additional file 2: Pearson r and Additional file 3: Covariance matrix). To facilitate comparison between the models, standardised regression weights and correlations between the latent variables in the doping opinion models are shown in Table 5.
As Table 4 shows, the model without SD variable showed poor fit and had substantial amount of unexplained covariances in the observed data. Including SD dramatically improved the model fit. The overall fit index (chi-square statistics testing H0: implied covariance structure is the same as the observed covariance matrix) has changed from very poor fit to a good fit. In an ideal scenario, a good fitting model expected to have non-significant chi-square statistics, but owing to its conservative nature, it is seldom achieved. As an alternative approach, the χ2/df ratio is used to assess overall fit where the value for good fitting model is expected to be less than 3. This ratio has dropped from 8.5 to 2.6 when SD was included. All comparative fit indices showed improvement but apart from the Bentler Comparative Fit Index (CFI), they did not quite reach the desired level suggesting that the model can be further improved with imposing further or alternative relationships with the SD variable.
In order to encapsulate the effect SD individually has on the measured predictor variables and whether SD can be considered as an independent predictor for explicitly expressed opinion, we also tested models depicted in Figures 3 and 4, respectively. Both models showed good fit (Table 6). The best model fit was achieved when SD was included as an independent predictor with covariances allowed between the endogenous variables (Figure 4). We also tested this model under high and low SD conditions using data from the high SD group and low SD group independently. Interestingly, both models independently showed good, even improved, fit. This is despite the split sample results showed no significant correlation with the SD (Table 2) and the baseline model without SD (Figure 1) did not show adequate fit (χ2 = 73.40, df = 13, p < 0.001) for the low SD group, The baseline model fit for the high SD group was slightly better but far from being adequate (χ2 = 48.77, df = 13, p < 0.001). Model fit indices for the split sample analysis are shown in Table 7. Descriptive statistics on the measured variables by SD groups are displayed in the Additional file 1: Cluster analysis creating high and low SD groups. This peculiar pattern may be suggesting two things: i) that SD has a model level effect even in cases where SD reported low and ii) SD results in giving deliberate and goal oriented strategic responses to sensitive questions, hence the measures (including the SD) are more congruent within the respective groups than in the pooled data. The latter assumption could be further tested in experimental conditions where the need for giving strategic response is manipulated.
The larger than 1 regression weights (Table 5) suggest a suppressor relationship, a statistical phenomenon that often present in social science research using latent variables if collinearity is present in the data , affecting the self-reported Attitude measure the most. The high negative correlation between Attitude and SD, which clearly exists and strong in the high SD group (-0.681) but dramatically reduced in the low SD group (-0.293), indicates that SD acts as a suppressor for Attitude measure the most with other indicators are also affected to a lesser degree. Further research is required to determine whether SD effect is a common method variance [39–41] or a theoretically meaningful component [35, 43].
In conclusion, despite that the relationship between social desirability and other doping related measures appeared to be reassuringly low, the SEM analysis revealed that the model without the SD variable contained a large amount of unexplained variances resulting in a poor model fit. Including SD increased the proportion of observed covariances explained by the model; improved the fit indices to the desirable level for a satisfactory model fit. Whilst the social desirability bias at the individual variable level was not concerning, the results showed that the accumulated effect at the model level can be quite significant. Large measurement error can result in failing to find robust relationship; hence correlation coefficients may not be able to reflect accurately the effect of socially desirable response in research based on self-report survey data. The presence of social desirability was clearly evidenced when the data were subjected to appropriate statistical tests. This is in line with a recent study showing mediating and moderating effect of social desirability between doping attitudes and susceptibility .
Based on the results reported here and in keeping with previous work , we propose that conclusions drawn on behavioural models with several determinants of doping (or drug), relying solely on self-reports, should be interpreted cautiously. Repeating some key research with the inclusion and measure of SD effect to provide further evidence for (or falsify) the assumption that SD is a substantial part of the explicit measures of the social cognitive determinants of doping would be a worthwhile endeavour, with a potential to advance the current standing of social science research on doping significantly. In addition to coalescing disparate analytical and social approaches to create a unique platform to investigate sensitive behaviour, progress has also been made in identifying methods that may overcome the limitations associated with the sole use of self-report methodology such as introspective limits and social desirability . In this study, combining self-reported measures with implicit associations in the in the context of objective behavioural information, a distinctive cognitive patterns emerged for those athletes who denied their doping use.
In the absence of direct observation of the behaviour in question (i.e. doping use), evaluation of the effectiveness of future anti-doping campaign, along with behavioural model testing, will likely to continue to rely on self-reported information. Controlling the effect of socially desirable responding is recommended in research that makes inferences based on self-reported information on social cognitive and behavioural measures. Considering SD in study design where it is feasible is strongly recommended . Situations with reduced demand for giving SD responses where respondents are not fully aware of the purpose of the investigation or the options for giving strategically selected responses are not overtly available by the questionnaire design could help reducing SD distortion. For example, implicit social cognition research investigating automatic process underlying social judgements and behaviour has steadily gained popularity in social psychology . The implicit association test (IAT) procedures, relying on latency differences measured on carefully crafted lexical sorting tasks [59–61] are thought to overcome, at least to a degree, the limits associated with and has shown predictive power over and above explicit self-reports for future behaviour . Upon further refinement, a combined explicit and implicit assessment approach can be successfully used in to improve self-report methodology. In cases where SD effect cannot be mitigated via study designs, including statistical analyses to estimate the extent and magnitude of the SD effect in research on the determinants of socially sensitive behaviours is strongly recommended.
Findings from this research should be extended to other variables used for predicting doping. These constructs include but not limited to vulnerability/susceptibility, willingness, motivation and self-efficacy. Owing to the increasing requirement to move from output-based to outcome-based evaluation in drug-prevention, findings and recommendations of this report may be of interest to researchers and practitioners beyond sport and doping.
The authors declare that they have no competing interests.
Denham BE: Determinants of anabolic-androgenic steroid risk perceptions in youth populations: a multivariate analysis. J Health Soc Behav. 2009, 50: 277-292. 10.1177/002214650905000303.
Dodge T, Jaccard JJ: Is abstinence an alternative? Predicting adolescent athletes' intentions to use performance enhancing substances. J Health Psychol. 2008, 13 (5): 703-711. 10.1177/1359105307082460.
Donovan RJ, Egger G, Kapernick V, Mendoza J: A conceptual framework for achieving performance enhancing drug compliance in sport. Sports Med. 2002, 32: 269-284. 10.2165/00007256-200232040-00005.
Dunn M, Mazanov J, Sitharthan G: Predicting future anabolic-androgenic steroid use intentions with current substance use: findings from an Internet-based survey. Clin J Sport Med. 2009, 19 (3): 222-227. 10.1097/JSM.0b013e31819d65ad.
Goulet C, Valois P, Buist A, Côté M: Predictors of the use of performance-enhancing substances by young athletes. Clin J Sport Med. 2010, 20 (4): 243-248. 10.1097/JSM.0b013e3181e0b935.
Laure P, Lecerf T, Friser A, Binsinger C: Drugs, recreational drug use and attitudes towards doping of high school athletes. Int J Sports Med. 2004, 25: 133-138. 10.1055/s-2004-819946.
Laure P, Favre A, Binisinger C, Mangin G: Can self-assertion be targeted in doping prevention actions among adolescent athletes? A randomized controlled trial. Serbian J Sport Sci. 2009, 3 (3): 105-110.
Litt D, Dodge T: A longitudinal investigation of the Drive for Muscularity Scale: predicting use of performance enhancing substances and weightlifting among males. Body Image. 2008, 5 (4): 346-351. 10.1016/j.bodyim.2008.04.002.
Lucidi F, Grano C, Leone L, Lombardo C, Pesce C: Determinants of the intention to use doping substances: An empirical contribution in a sample of Italian adolescents. Int J Sport Psychol. 2004, 35: 133-148.
Lucidi F, Zelli A, Mallia L, Grano C, Russo PM, Violani C: The social-cognitive mechanisms regulating adolescents' use of doping substances. J Sport Sci. 2008, 26 (5): 447-456. 10.1080/02640410701579370.
Peretti-Watel P, Guagliardo V, Verger P, Mignon P, Pruvost J, Obadia Y: Attitudes toward doping and recreational use among French elite student-athletes. Soc Sport J. 2004, 21: 1-17.
Petroczi A: Attitudes and doping: A structural equation analysis of the relationship between athletes' attitudes, sport orientation and doping behaviour. Subst Abuse Treat Prev Policy. 2007, 2: 34-10.1186/1747-597X-2-34.
Petróczi A, Aidman E: Psychological drivers in doping: The life-cycle model of performance enhancement. Subst Abuse Treat Prev Policy. 2008, 3: 7-
Rees CR, Zarco EPT, Dawn K, Lewis DK: The steroids/sports supplements connection: pragmatism and sensation-seeking in the attitudes and behavior of JHS and HS students on Long Island. J Drug Educ. 2008, 38 (4): 329-349. 10.2190/DE.38.4.b.
Sas-Nowosielski K, Swiatkowska L: Goal orientations and attitudes toward doping. Int J Sport Med. 2008, 29 (7): 607-612. 10.1055/s-2007-965817.
Shakeri J, Parvizifard AA, Sadeghi K, Kaviani S, Hashemian AH: Cognitive correlations and psychological morbidities of doping in adolescent athletes in Kermanshah, Iran. Iranian J Psychiatry Behav Sci. 2009, 3 (1): 38-43.
Strelan P, Boeckmann RJ: A new model for understanding performance enhancing drug use by elite athletes. J Appl Sport Psychol. 2003, 15: 176-183. 10.1080/10413200305396.
Strelan P, Boeckmann RJ: Why drug testing in elite sport does not work: perceptual deterrence theory and the role of personal moral beliefs. J Appl Soc Psychol. 2006, 36: 2909-2934. 10.1111/j.0021-9029.2006.00135.x.
Wichstrom L, Pedersen W: Use of anabolic-androgenic steroids in adolescence: winning, looking good or being bad?. J Stud Alcohol. 2001, 62 (1): 5-13.
Wiefferink CH, Detmar SB, Coumans B, Vogels T, Paulussen TGW: Social psychological determinants of the use of performance-enhancing drugs by gym users. Health Educ Res. 2008, 23 (1): 70-80. 10.1093/her/cym004.
Zelli A, Mallia L, Lucidi F: The contribution of interpersonal appraisals to a social-cognitive analysis of adolescents' doping use. Psych Sport Exerc. 2010, 11 (4): 304-311. 10.1016/j.psychsport.2010.02.008.
Backhouse S, McKenna J, Robinson S, Atkin A: Attitudes, behaviours, knowledge and education - Drugs in sport: past, present and future. 2007, http://www.wada-ama.org/rtecontent/document/Backhouse_et_al_Full_Report.pdf
Goldberg L, Elliot D, Clarke GN, MacKinnon DP, Moe E, Zoref L, Green C, Wolf SL, Greffrath E, Miller DJ, Lapin A: Effects of a multidimensional anabolic steroid prevention intervention. The Adolescents Training and Learning to Avoid Steroids (ATLAS) Program. JAMA. 1996, 276 (19): 1555-1562. 10.1001/jama.276.19.1555.
Elliot DL, Goldberg L, Moe EL, Defrancesco CA, Durham MB, Hix-Small H: Preventing substance use and disordered eating: initial outcomes of the ATHENA (athletes targeting healthy exercise and nutrition alternatives) program. Arch Pediatr Adolesc Med. 2004, 158 (11): 1043-1049. 10.1001/archpedi.158.11.1043.
Elliot DL, Goldberg L, Moe EL, DeFrancesco CA, Durham MB, McGinnis W, Lockwood C: Long-term outcomes of the ATHENA (Athletes Targeting Healthy Exercise & Nutrition Alternatives) Program for female high school athletes. J Alcohol Drug Educ. 2008, 52 (2): 73-92.
Goldberg L, Elliot DL, MacKinnon DP, Moe EL, Kuehl KS, Yoon M, Taylor A, Williams J: Outcomes of a prospective trial of student-athlete drug testing: the Student Athlete Testing Using Random Notification (SATURN) study. J Adolesc Health. 2007, 41: 421-429. 10.1016/j.jadohealth.2007.08.001.
Goldberg L, Elliot DL: Preventing substance use among high school athletes. The ATLAS and ATHENA Programs. J Appl School Psychol. 2005, 21 (2): 63-87. 10.1300/J370v21n02_05.
Goldberg L, MacKinnon DP, Elliot DL, Moe EL, Clarke G, Cheong J: The adolescents training and learning to avoid steroids program: preventing drug use and promoting health behaviors. Arch Pediatr Adolesc Med. 2000, 154 (4): 332-338.
Kuehn BM: Teen steroid, supplement use targeted. Officials look to prevention and better oversight. JAMA. 2009, 302 (21): 2301-2303. 10.1001/jama.2009.1711.
Ranby KW, Aiken LS, Mackinnon DP, Elliot DL, Moe EL, McGinnis W, Goldberg L: A mediation analysis of the ATHENA intervention for female athletes: prevention of athletic-enhancing substance use and unhealthy weight loss behaviors. J Pediatr Psychol. 2009, 34 (10): 1069-1083. 10.1093/jpepsy/jsp025.
Paulhus DL, Vazire S: The self-report method. Handbook of research methods in personality psychology. Edited by: Robins RW, Fraley RC, Kruger RF. 2010, New York: Guilford, 224-239.
Paulhus DL: Measurement and control of response bias. Measures of personality and social psychological attitudes. Edited by: Robinson JP, Shaver PR, Wrightsman LS. 1991, San Diego, CA: Academic Press, 1: 17-59.
Lentillon-Kastner V, Ohl F: Can we measure accurately the prevalence of doping?. Scand J Med Sci Sports. 2010
Paulhus DL: Socially desirable responding the evolution of a construct. The role of constructs in psychological and educational measurement. Edited by: Braun HI, Jackson DN, Wiley DE. 2002, Mahwah, NJ: Erlbaum, 49-69.
McCrae RR, Costa PT: Social desirability scales: more substance than style. J Consult Clin Psychol. 1983, 51 (6): 882-888. 10.1037/0022-006X.51.6.882.
Petróczi A, Aidman EV, Hussain I, Deshmukh N, Nepusz T, Uvacsek M, Tóth M, Barker J, Naughton DP: Virtue or pretense? Looking behind self-declared innocence in doping. PLoS One. 2010, 5 (5): e10457-
Tourangeau R, Yan T: Sensitive questions in surveys. Psychol Bulletin. 2007, 133: 859-883. 10.1037/0033-2909.133.5.859.
Uziel L: Rethinking social desirability scales: from impression management to interpersonally oriented self-control. Perspectives Psychol Sci. 2010, 5: 243-10.1177/1745691610369465.
Podsakoff PM, MacKenzie SB, Lee JY, Podsakoff NP: Common method biases in behavioural research: a critical review of the literature and recommended remedies. J Applied Psychol. 2003, 88 (5): 879-903. 10.1037/0021-9010.88.5.879.
Ganster DC, Hennessey HW, Luthans F: Social desirability response effects: Three alternative models. Acad Management J. 1983, 26: 321-331. 10.2307/255979.
Williams LJ, Anderson SE: An alternative approach to methods effects by using latent-variable models: applications in organizational behavior research. J Applied Psychol. 1994, 79 (3): 323-331. 10.1037/0021-9010.79.3.323.
Baron RM, Kenny DA: The moderator-mediator variable distinction in social psychological research: conceptual, strategic and statistical considerations. J Pers Soc Psychol. 1986, 51: 1173-1182. 10.1037/0022-35184.108.40.2063.
Kline TJB, Sulsky LM, Rever-Moriyama SD: Common method variance and specific errors: a practical approach to detection. J Psychol. 2000, 134: 401-421. 10.1080/00223980009598225.
Sobell LC, Sobell MB: Timeline Follow-back: a technique for assessing self-reported ethanol consumption. Measuring alcohol consumption: psychosocial and biological methods. Edited by: Allen J, Litten RZ. 1992, Totowa, NJ:Humana Press, 41-72. 1992.
Sobell LC, Brown J, Leo GI, Sobell MB: The reliability of the Alcohol Timeline Followback when administered by telephone and by computer. Drug Alcohol Dependency. 1996, 42: 49-54. 10.1016/0376-8716(96)01263-X.
Yudko E, Lozkhina O, Fouts A: A comprehensive review of the psychometric properties of the Drug Abuse Screening Test. J Subst Abuse Treatment. 2007, 32: 189-198. 10.1016/j.jsat.2006.08.002.
Dhalla S, Kopec JA: The CAGE Questionnaire for alcohol misuse: a review of reliability and validity studies. Clin Investigative Med. 2007, 30 (1): 33-41.
Bashford J, Flett R, Copeland J: The Cannabis Use Problems Identification Test (CUPIT): development, reliability, concurrent and predictive validity among adolescents and adults. Addiction. 2010, 105: 615-625. 10.1111/j.1360-0443.2009.02859.x.
Musshoff F, Driever F, Lachenmeier K, Lachenmeier DW, Banger M, Madea B: Results of hair analyses for drugs of abuse and comparison with self-reports and urine tests. Forensic Sci Int. 2006, 156: 118-123. 10.1016/j.forsciint.2004.07.024.
Legerwood DM, Goldberger BA, Risk NK, Lewis CE, Price RK: Comparison between self-report and hair analysis of illicit drug use in a community sample of middle-aged men. Addictive Behav. 2008, 33: 1131-1139. 10.1016/j.addbeh.2008.04.009.
Delaney-Black V, Chiodo LM, Hannigan JH, Greenwald MK, References and further reading may be available for this article. To view references and further reading you must this article, Janiss J, Patterson G, Huestis MA, Ager J, Sokol RJ: Just say "I don't": lack of concordance between teen report and biological measures of drug use. Pediatrics. 2010
Petroczi A, Aidman EV: Measuring explicit attitude as an indicator of athletes' engagement in doping: Review of the psychometric properties of the Performance Enhancement Attitude Scale. Psych Sport Exerc. 2009, 10: 390-396. 10.1016/j.psychsport.2008.11.001.
Crowne DP, Marlowe D: A new scale of social desirability independent of psychopathology. J Cons Psychol. 1960, 24: 349-354. 10.1037/h0047358.
R Development Core Team: R: A Language and Environment for Statistical Computing. 2010, R Foundation for Statistical Computing, Vienna, Austria, ISBN: 3-90051-07-0
Fox J: Structural equation modeling with SEM package in R. Struct Equat Model. 2006, 13: 465-486. 10.1207/s15328007sem1303_7.
Maassen GH, Bakker AB: Suppressor variables in path models. Soc Methods Res. 2001, 30: 241-270. 10.1177/0049124101030002004.
Gucciardi DF, Jalleh G, Donovan RJ: Does social desirability influence the relationship between doping attitudes and doping susceptibility in athletes?. Psych Sport Exerc. 2010, 11 (6): 479-486. 10.1016/j.psychsport.2010.06.002.
Payne BK, Gawronski B: A history of implicit social cognition: where is it coming from? Where is it going?. Handbook of implicit social cognition: measurement, theory, and applications. Edited by: Gawronski B, Payne K. 2010, New York, NY:Guilford Press, 1-8.
De Houwer J, Teige-Mocigemba S, Spruyt A, Moors A: Implicit measures: a normative analysis and review. Psychol Bulletin. 2009, 135 (3): 347-368. 10.1037/a0014211.
Gawronski B: Ten frequently asked questions about implicit measures and their frequently supposed, but not entirely correct answers. Canadian Psychol. 2009, 50 (3): 141-150.
Schnabel K, Asendorpf JB, Greenwald AG: Assessment of individual differences in implicit cognition. Eur J Psychol Assessment. 2008, 24: 210-217. 10.1027/1015-57220.127.116.11.
Greenwald AG, Poehlman TA, Uhlmann E, Banaji MR: Understanding and using the Implicit Association Test: III. Meta-analysis of predictive validity. J Personality Social Psychol. 2009, 97: 17-41. 10.1037/a0015575.
AP initiated the study and collected the data. AP and TN performed the statistical analyses contributed equally to drafting the manuscript and both approved the final version.