madd message effects: a twelve-year ... - UNM Digital Repository [PDF]

Dec 1, 2010 - Woodall for encouraging me to engage in this dissertation topic and his support over the .... Company of N

0 downloads 5 Views 3MB Size

Report

Download PDF

PNG Network

Recommend Stories

A Madd

Forget safety. Live where you fear to live. Destroy your reputation. Be notorious. Rumi

Digital Repository UNEJ Digital Repository UNEJ Digital Repository UNEJ Digital Repository UNEJ

Where there is ruin, there is hope for a treasure. Rumi

Digital Repository UNEJ Digital Repository UNEJ Digital Repository UNEJ Digital Repository UNEJ

Ego says, "Once everything falls into place, I'll feel peace." Spirit says "Find your peace, and then

Digital Repository UNEJ Digital Repository UNEJ Digital Repository UNEJ Digital Repository UNEJ

Be like the sun for grace and mercy. Be like the night to cover others' faults. Be like running water

Digital Repository UNEJ Digital Repository UNEJ Digital Repository UNEJ Digital Repository UNEJ

Don't ruin a good today by thinking about a bad yesterday. Let it go. Anonymous

Digital Repository Universitas Jember

Seek knowledge from cradle to the grave. Prophet Muhammad (Peace be upon him)

Digital Repository Universitas Jember

Don't fear change. The surprise is the only way to new discoveries. Be playful! Gordana Biernat

Digital Repository Universitas Jember

The wound is the place where the Light enters you. Rumi

Digital Academic Repository

If your life's work can be accomplished in your lifetime, you're not thinking big enough. Wes Jacks

Digital Repository Universitas Jember

Silence is the language of God, all else is poor translation. Rumi

Idea Transcript

University of New Mexico

UNM Digital Repository Communication ETDs

Electronic Theses and Dissertations

12-1-2010

MADD MESSAGE EFFECTS: A TWELVEYEAR RANDOMIZED TRIAL Una E. Medina

Follow this and additional works at: http://digitalrepository.unm.edu/cj_etds Recommended Citation Medina, Una E.. "MADD MESSAGE EFFECTS: A TWELVE-YEAR RANDOMIZED TRIAL." (2010). http://digitalrepository.unm.edu/cj_etds/21

This Dissertation is brought to you for free and open access by the Electronic Theses and Dissertations at UNM Digital Repository. It has been accepted for inclusion in Communication ETDs by an authorized administrator of UNM Digital Repository. For more information, please contact [email protected].

MADD MESSAGE EFFECTS: A TWELVE-YEAR RANDOMIZED TRIAL BY UNA MEDINA B.A., Communication, The University of New Mexico, 2002 M. A., Communication, The University of New Mexico, 2004

DISSERTATION Submitted in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy Communication The University of New Mexico Albuquerque, New Mexico December, 2009

DEDICATION This dissertation is dedicated to our beloved friend and mentor Dr. Everett M. Rogers, who taught us to ask questions about not only what and who is included, but to also ask questions about who and what have been excluded, and why? Another question Dr. Rogers loved to ask was, ― Where did you get that idea?‖ He led me to realize that everything I am, every idea and work of scholarship has grown from the fertile soil of the innovations of others. For Ev, and through you To all teachers, mentors, scholars From the vantage of your shoulders We, blessed, stand on tiptoes Peering over the mountain of mundane We see destiny, Our Mandala The vast blue canopy of the sky

iii

ACKNOWLEDGMENTS MADD VIP groups were found to have lower moods following MADD VIP presentations than before the MADD VIP (Woodall, Delaney, Rogers, & Wheeler, 2007). why?‖ He discovered in the 2-year follow-up that MADD Dr. W. Gill Woodall asked ― VIP, participants‘ recidivism rates were 30% higher than their DWI School comparison group, trending toward significance at p = .0583. Probing the meaning of these findings, Dr. Woodall conceived of the possibility that there may be a message effect operating within the MADD VIP audiences, and he suggested this dissertation study. I thank Dr. Woodall for encouraging me to engage in this dissertation topic and his support over the two years spent on this study. He has shared his data and opened his archive files. Dr. Woodall generously purchased 12 years of DWI recidivism traffic safety data used in this dissertation. His professional and mentoring style will remain with me as I continue my career. I thank my committee members, Dr. Janice Schuetz, Dr. Mario Rivera, Dr. Virginia McDermott, and Dr. Harold Delaney, and earlier committee member Dr. Ken Frandsen, for their recommendations in this study and support of my professional development. Dr. Janice Schuetz, in conversation with Dr. Woodall and me, suggested Brehm‘s (1966) theory on reactance to explain the MADD VIP message effect. She has deeply influenced this dissertation and the development of my scholarship. She thoughtfully guided the structure of the literature review, suggesting its organization by types of reactance variables. Her advice and editing on this dissertation, her leadership in my comprehensive examination and her constant support has filled the sails of my scholar ship with gentle and steady breezes. Dr. Schuetz introduced me to grounded theory method, which I used in both my M.A. thesis and this dissertation. She invited me to journey into the rich and rewarding seas of qualitative analysis. She supplied me with the tools to discover theoretical relationships among constructs, and to believe I could develop theory. Dr. Mario Rivera has propelled my professional development from our first meeting on April 24, 2002 at Dr. Everett Rogers‘ Distinguished Lecturer Award ceremony. Dr. Rivera encouraged me to write with Dr. Rogers on the relationships between complex adaptive systems theory and diffusion of innovations theory, a writing project that became my debut publication in The Innovation Journal. Since then Dr. Rivera has been a dear mentor and an insightful and thorough co-author in three more peer-reviewed journal publications. During this dissertation process, Dr. Rivera has engaged me in long discussions about the articulation of my work and probed my conceptualizations. He has generously spent many hours, beyond what any candidate could hope, advising me and teaching me how to improve the quality of writing within this manuscript. Dr. Ken Frandsen, who began on the committee and then retired to Colorado, introduced me to the concept of a ― committee‖ in 2001 when we met in my early stages of my undergraduate McNair Fellowship. He offered to serve on my committee and I was such a greenhorn scholar that I had to ask someone, ― What is a committee?‖ Dr. Rogers‘ parting words included the advice to ― listen to Ken.‖ Did he somehow guess that Dr. iv

Frandsen would open doors to innovative research? After Dr. Rogers‘ passing, Dr. Frandsen‘s mentorship led to my receipt of a scholarship to attend the New England Complex Systems Institute at MIT and later earn a graduate certificate in computer modeling of complex systems. Systems thinking, a nonlinear approach to data analysis influenced some of the unique problem-solving approaches that led to breakthroughs in this dissertation. When Dr. Frandsen chose me to assist him in developing group activities for ― Introduction to Communication‖ little did I realize that exercise would develop skills that later would be exercised again in an invited textbook proposal at Sage Publications. He taught me how to analyze student response metrics on computerized tests, and shared his extremely useful and standardized method for grading student papers. Dr. Frandsen served on the prospectus committee and, upon retirement, suggested Dr. Virginia McDermott as his replacement on this dissertation committee. Dr. Virginia McDermott has been an inspiration to my quantitative scholarship since we taught CJ507 ― Quantitative Methods‖ together, with my role as her lab assistant. I attended every one of her statistics lectures and she impressed me with her thoroughness and the depth of her statistics knowledge. She has gone out of her way to encourage and develop my professional identity. Dr. McDermott helped me organize and write my first curriculum vitae. Her original vitae document remains the core of the vitae I use today. She has been an enthusiastic and energetic supporter throughout the dissertation process. I owe a debt of gratitude to Dr. Harold Delaney, statistician and author of ― Designing Experiments and Analyzing Data: A Model Comparison Perspective,‖ which he co-authored with Scott Maxwell. Dr. Delaney taught me advanced ANOVA procedures in his litmus-test course for the University of New Mexico Psychology Ph.D. program. He taught me how to calculate many types of ANOVA by hand, and when to apply considerations for ANOVA contrasts. Because of Dr. Delaney‘s course, I can calculate an unequal ― n‖ ANOVA by hand, using, in the case of this study, a weighted grand mean, which was useful in determining that I did not have to conduct a hierarchical linear model analysis. This method was critical for weighting means to verify equal distribution of DWI predictor characteristics between groups for age and number of prior arrests. He gave me the tools to conduct a contrast using psi to contrast two low reactance-inducing VIP Groups against eleven other high-reactance VIP Groups. This calculation enabled me to justify my categorizing the VIP groups into low and highreactance groups, enabling me explore whether a change in reactance levels was consistent with a change in DWI recidivism. Dr. Delaney was a weekly statistical sounding board. He and Dr. Woodall closely supervised the methods and results sections. Eric Erhardt, Ph.D. Candidate in statistics and 2008 Statistics Lab Director, consulted on data transformation considerations, proportional hazard assumption tests and considerations, the meaning of a time-dependent covariate, and the realization that if time-dependence is not detected in a variable then its effect may be interpreted as continuous (not diminishing or increasing) over time. Time dependence is a very important test for message effects studies that is seldom, if ever, used. Alvaro Nosedal-Sanchez, Ph.D. Candidate in statistics and 2009 Statistics Lab Director, meticulously explained the method for computing odds versus the method for computing probability values of odds, both from the frequency table and from the loglinear logit coefficient (the log odds ratio), as a means to confirm results obtained v

through Cox Proportional Hazards (survival analysis) and loglinear logit regression. He met with me during the 2009 months of analysis twice per week. I am grateful to his commitment in seeing me through the process of choosing rationales, ruling out alternative approaches, and seconding my decisions. He applied his sea of knowledge in a broad range of statistics, satisfying my quest to find the best methods and most accurate and clear interpretations given the complexity of the data structure. Zoe Johnson of University of New Mexico ITS loaded SAS on my computer and pointed me to the online procedure manuals. Robert Hudson at PNM (Public Service Company of New Mexico) helped with the translating of the SPSS and Excel files of an early version of the dissertation data into SAS. He worked with me on the replication of SPSS results in the SAS environment. He co-wrote the SAS code with me on an early process of COX repeated measures procedure PHREG (Proportional Hazards Regression) that offered output not available in SPSS. Those analyses must be reconducted on the final dissertation data and as such are reserved for a future research project. During that SAS process, I learned much about the best way to prepare and read files into SAS and how to configure PHREG arguments to attain conversion. This will be useful when I return to SAS to conduct more analysis. SAS is the preferred statistical package used by Dr. Hongwei Zhao, who consulted on the survival analysis. Dr. Hongwei Zhao, Department of Epidemiology and Biostatistics, Texas A&M Health Science Center, a specialist in survival analysis and longitudinal data analysis, consulted on the survival analysis procedures. She lent her expertise in survival analysis based on her experience with the data set through which she has coauthored several publications with Dr. Woodall and Dr. Delaney on DWI prediction, prevention, and intervention in New Mexico (Delaney, H. D., Kunitz, S.J., Zhao, H., Woodall, W.G., Westerberg, V., Rogers, E. & Wheeler, D.R., 2005; Kunitz, S. J., Woodall, W. G., Zhao, H., Wheeler, D. R., Lillis, R., & Rogers, E., 2002; Kunitz, S.J., Zhao, H., Wheeler, D.R., & Woodall, W.G., 2006; Woodall, W.G., Delaney, H.D., Kunitz, S.J., Westerberg, V.S. & Zhao, H., 2007; Woodall, W.G., Kunitz, S.J., Zhao, H., Wheeler, D.R., Westerberg, V. & Davis, J., 2004). Her generous collegial assistance in this dissertation has impressed and warmed my heart. The data used in this study were complex to derive. A number of researchers and professionals provided data and helped link MADD VIP exposure to subsequent recidivism records. I extend my gratitude to UNM CASAA for providing the videotapes, transcripts, dates, and list of offender participants in the MADD VIP presentations. Thanks are expressed to James W. Davis, Associate Director of UNM Division of Government Research, Institute for Applied Research Services, for procuring and matching DWI recidivism data to MADD VIP participants. I wish to thank my research associates. Christina Shapiro, M.A., L.M.H.C. She provided professional interpretations of the DSM-IV classifications of levels of severity of mental illness. This understanding was necessary in order to interpret Jacob Cohen‘s (1968) weighted kappa example on levels of ordinal disagreement between psychologists rating mental health patients. Understanding the case study example was important in making preparations for applying Cohen‘s kappa (1960) for calculating agreement of coders on ordinal reactance-inducing codes in the content analysis of MADD VIP message qualities. vi

Eight graduate scholars worked diligently as MADD VIP transcript messagequality coders in this project. Phase 1 coders each coded 433 statements in 12 documents: Laura Burton, Ph. D. student, Santhosh Chandrashekar, Ph.D. student, Haibin Dong, Ph.D. Candidate, Yen Dong, M.A. student, Doris Fields, Ph.D., Lissa Knudsen, Ph.D. Candidate, Taura Mangone, Ph.D. Candidate, and Keena Neal, M.A. Taura Mangone and Keena Neal, due to their consistent judgments, earned the highest interrater reliability1 (kappa = .90) of all eight coders in phase 1. In Phase 2, Mangone and Neal then generously recoded all 2,021 VIP statements with a degree of accuracy that is considered strong agreement (weighted kappa = .83), improving the reliability of the analysis and the reliability of the operationalizations and measurements of reactance theory constructs as independent variables in this message effects research. Mia Logan, Ph.D. of Ltd. Unlimited coached me, cheering me onwards during the most critical three months of the data analysis process. She supported my cancellation of intervening activities, appointments, and distractions. She and her business partner Charlotte Hendrix, Ph.D. hosted a Dissertation Boot Camp during which I was able to make tremendous progress for seventy hours during six days with no distractions. This ― camp‖ enabled me to focus in depth. It enabled me to identify and correct bad data, and see possibilities in the data segmentation and analysis that would not have been evident without such focused and uninterrupted thought. I thank my friends, Taura Mangone, Christina Shapiro, Radoslava Simeonova, and Spence Shaw who kept the vigil throughout my dissertation process with camaraderie and support. Christina Shapiro, Spencer Shaw, and my cousin Michelle Maestes read the entire manuscript and identified terms that should be defined. Greg Mechels offered explanations to untangle early and mystifying results, muscled fifty-pound data storage boxes into numeric order, and seconded an investigation of hard copies, validating the electronic database against the original study documents. Your gift of time is always remembered and appreciated. Monica Maasin double-checked citations against references with meticulous care, uncovering instances where cascading revisions had resulted in inconsistencies in citations and a need to update the reference section. To my children Jean-Philippe Medina Senart, Rex Addison Davis, and Eleanor Marie Davis, who offered immeasurable incentive and inspiration over the years, thank you for sharing this journey. I look forward to attending your dissertation defenses. Finally, to my husband John Davies Olmsted, thank you for encouraging and making this academic journey possible. Our marriage is continually renewed, a string of moments, pearls strung together in one shared life: patience, devotion, and faith.

1

Interrater reliability is the degree to which two or more analysts agree upon the ratings they assign to elements within a body of data. The higher the interrater agreement, or agreement between raters, the greater is the reliability of their merged scores.

vii

MADD MESSAGE EFFECTS: A TWELVE-YEAR RANDOMIZED TRIAL BY UNA MEDINA

ABSTRACT OF DISSERTATION Submitted in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy Communication The University of New Mexico Albuquerque, New Mexico December, 2009

MADD MESSAGE EFFECTS: A TWELVE-YEAR RANDOMIZED TRIAL

BY UNA MEDINA B. A., Communication, The University of New Mexico, 2002 M. A., Communication, The University of New Mexico, 2004 Ph.D., Communication, The University of New Mexico, 2009 ABSTRACT One out of three Americans undergoes drunk-driving crashes; 23% result in death. To deter DWIs (Driving While under Influence), MADD (Mothers Against Drunk Drivers) created VIPs (Victim Impact Panels) where victims impact offenders with gory stories, photos, and threats of punishments and loss of freedom, hoping this message will deter DWIs. It is remarkable that although the VIP message is considered a primary DWI intervention, yet no studies have investigated VIP message effects. VIP message effects, their persistence and decay, are chronicled here over the course of 12 years. This study extends an empirical investigation of VIPs, conducted by Woodall, Delaney, Rogers, and Wheeler (2007) (n = 833) during 1994-1996. At 2 years, these researchers found MADD VIP participants‘ recidivism rates were 30% higher than their DWI School comparison group, trending toward significance at p = .0583. This study supports those results as significant at 12 years. As an extension, it investigates whether reactance theory explains VIP message effects failure. Reactance theory research, a subset of message effects research, explains how emotional, confrontational, ix

and threatening messages induce psychological reactance in the mind of the message receiver, who then seeks to preserve his or her sense of freedom by behaving contrarily (Brehm, 1966). Hierarchically intensifying effects of these theoretical reactance antecedents are studied here in an unusual manner, as they occur in vivo, in real life. The same intervention was observed to have different effects depending on prior conditions and demographics. The emotional high-threat, high-confrontation MADD VIP message coincided with significantly shorter time to recidivism (p = .009, d = 1.64) and significantly higher number of subsequent arrests (p < .0001, d = 1.64) among recent prior offenders, and those with no priors under age 30 (p = .01, d = 0.35). Younger offenders may be associated with more iconoclastic2 behavior than older offenders (Beirness & Simpson, 1997; Greenberg, 2005; NHTSA, 2008), partially explaining the under-30 age effect. This study furthers persuasive message design as a science and suggests a messagebased approach to intervention analysis. There was no effect when MADD VIP was analyzed simply as an intervention. However, there were highly significant effect sizes when the same MADD VIP intervention was analyzed as a message. This study concludes by offering MADD VIP best practice recommendations.

2

Icon is a symbol that stands for something else. Clastic means shattering or smashing. Thus iconoclastic refers to a propensity to tear down, destroy, or at the very least disregard conventional symbols, practices, and mores.

x

TABLE OF CONTENTS LIST OF FIGURES ......................................................................................................... xix LIST OF TABLES ............................................................................................................ xx PREFACE ........................................................................................................................ xxi CHAPTER 1: INTRODUCTION ....................................................................................... 1 PROBLEM STATEMENT: BACKGROUND OF THE MADD VIP MESSAGE EFFECTS PROBLEM .......................................................................................................... 4 Prevalence and significance of the repeat drunk-driving problem .................. 6 Economic impact of drunk-driving behavior ................................................... 8 Courts mandate thousands to MADD VIPs without evidence of efficacy ...... 9 Methodological problems involving previous research ................................. 13 PURPOSE AND RATIONALE ....................................................................................... 14 Theoretical import, framework, and scope .................................................... 15 CHAPTER 1 SUMMARY.............................................................................................. 19 CHAPTER 2: REVIEW OF RELATED LITERATURE ................................................. 23 EVIDENCE THAT MADD VIPS WORK ...................................................................... 25 EVIDENCE THAT MADD VIPS DO NOT WORK ........................................................ 26 DRUNK DRIVING INTERVENTIONS ............................................................................ 29 Intervention effect sizes ................................................................................. 29 Most effective interventions .......................................................................... 30 GENERAL VARIABLES IN DWI RESEARCH................................................................ 32 Age .............................................................................................................. 33 Gender............................................................................................................ 35 xi

Number of prior arrests .................................................................................. 36 GENERAL VARIABLES IN MESSAGE EFFECTS RESEARCH.......................................... 37 Message context............................................................................................. 38 Message content............................................................................................. 43 Message function ........................................................................................... 44 Message intensity........................................................................................... 44 Message frequency ........................................................................................ 46 Message frequency and intensity: a combination of metrics ......................... 46 Message pathos .............................................................................................. 47 Message decay rate ........................................................................................ 48 REACTANCE THEORY VARIABLES INVOLVING MESSAGE SENDERS.......................... 50 Strong intent to persuade ............................................................................... 51 Confrontation ................................................................................................. 63 Public censure ................................................................................................ 66 REACTANCE VARIABLES INVOLVING MESSAGE RECEIVERS .................................... 69 Message receiver‘s confidence he possesses freedom to comply or not ....... 71 Message receiver‘s perceived import of freedom .......................................... 72 Message receiver‘s belief that MADD threatens other freedoms .................. 74 REACTANCE THEORY‘S USEFULNESS IN EXPLAINING MADD MESSAGE EFFECTS... 77 Anti-abuse Messages Induce Reactance ........................................................ 78 Confrontational Messages Induce Reactance ................................................ 79 FIELD NOTES FROM A MADD VIP PARTICIPANT OBSERVATION PILOT STUDY ....... 82 Strong intent to persuade ............................................................................... 82 xii

Forewarning ................................................................................................... 82 Confrontation and public censure .................................................................. 83 Signs of perceived threat ............................................................................... 83 CHAPTER 2 SUMMARY.............................................................................................. 84 Need to test message effects of the MADD VIP ........................................... 84 Need to test reactance theory constructs and assumptions in context of MADD ................................................................................................. 85 RESEARCH QUESTIONS ............................................................................................. 87 CHAPTER 3: METHODOLOGY .................................................................................... 89 POPULATION AND SAMPLING ................................................................................... 93 Participants .................................................................................................... 93 Recruitment, consent, and non-adherence to condition ................................. 95 Random assignment ....................................................................................... 99 DESIGN, METHODS, AND PROCEDURES .................................................................... 99 2x5 Mixed Factorial Design .......................................................................... 99 Operationalization of reactance theory constructs into variables ................ 101 How literature informed the study design; Study‘s contribution to literature ............................................................................................................ 104 How the literature informs methods chosen for the design ......................... 105 How the present study extends message effects research methods ............. 112 How the interaction between qualitative and quantitative analysis informed each other in this methodological symbiosis. .................................... 115 Methods ....................................................................................................... 116 xiii

Procedures.................................................................................................... 133 INSTRUMENTS AND DATA SOURCES ....................................................................... 146 Instrument: The questionnaire ..................................................................... 147 Questionnaire reliability and validity .......................................................... 148 Secondary data source ................................................................................. 149 Public arrest records .................................................................................... 149 THE VARIABLES ..................................................................................................... 149 Covariate operationalizations and measure of constructs ............................ 149 Independent variable operationalizations and measures of theoretical constructs ........................................................................................... 150 Identification of reactance constructs in VIP transcripts ............................. 151 Dependent variable operationalization and measure of reactance outcomes ............................................................................................................ 152 THE DATASETS ....................................................................................................... 155 Priors separate from no priors...................................................................... 155 Censored cases separate from non-censored cases ...................................... 155 SOFTWARE AND ITS USE ......................................................................................... 155 QSR N6 ........................................................................................................ 155 SPSS ............................................................................................................ 157 Microsoft Office Excel ................................................................................ 157 METHODS LIMITATIONS ......................................................................................... 157 Under-identification of prior offenders ....................................................... 157 Attrition due to deaths ................................................................................. 158 xiv

Nonrepresentative sample ............................................................................ 159 Bimodal distribution of independent variables indicate conversion to dichotomous variables ........................................................................ 160 Variable categorization increased power ..................................................... 160 CHAPTER 3 SUMMARY............................................................................................ 162 CHAPTER 4: RESULTS ................................................................................................ 164 STATISTICAL TESTS CONDUCTED FOR EACH RESEARCH QUESTION ....................... 164 Necessity of splitting data into levels of prior arrests.................................. 165 CALCULATION OF DEPENDENT VARIABLES ............................................................ 169 Time to recidivism ....................................................................................... 169 Number of subsequent arrests ...................................................................... 169 Emotional change scores ............................................................................. 170 IDENTIFICATION AND REMOVAL OF OUTLIERS ....................................................... 171 General considerations regarding outliers ................................................... 171 Identification of outliers in the present data ................................................ 172 RESEARCH QUESTIONS AND RESULTS .................................................................... 174 1.

At what levels are reactance antecedents present in MADD VIP presentations? ..................................................................................... 174

2.

Do the 15 different MADD VIP presentations have different reactance message dosages? ............................................................................... 178

3.

Does the reactance message dosage (level of reactance-inducing statements and proportion of reactance-inducing statements) predict

xv

direction of emotional change score in the MADD VIP plus DWI School intervention group? ................................................................ 182 4.

Does the reactance message dosage predict survival time to first recidivism within the MADD VIP plus DWI School intervention group, while controlling for covariates age, gender, and number of priors?. 182

5.

Does the reactance message dosage predict number of subsequent arrests within group for the MADD VIP plus DWI School intervention group, while controlling for covariates age, gender, and number of priors?................................................................................................. 187

6.

Are there different predictor variables for recidivism for those study participants with DWI arrests before the study (who arguably believe they have the freedom, a reactance theory assumption, to drink and drive) versus those participants with no prior arrests? ....................... 187

7.

What are the demographic covariates that predict positive or negative message effects of MADD VIPs? ...................................................... 190

8.

Are MADD VIP messages effective in terms of lengthening time to recidivism and reducing number of subsequent arrests?.................... 195

CHAPTER 4 SUMMARY............................................................................................ 204 CHAPTER 5: DISCUSSION......................................................................................... 208 IMPLICATIONS FOR DWI INTERVENTION DESIGN ................................................... 212 Interaction effect between age and message type ........................................ 212 Life cycles of intervention message effects and effect sizes ....................... 214 Identification and different treatments for repeat offenders ........................ 219 xvi

Pre-DWI interventions may be more effective than post-DWI interventions ............................................................................................................ 221 IMPLICATIONS FOR PERSUASIVE MESSAGE DESIGN................................................ 222 Effect of message intensity .......................................................................... 224 Effect of pathos versus fear appeals ............................................................ 225 The effect of message strength .................................................................... 226 The effect of forewarning and confrontation ............................................... 227 The effect of public censure and anger ........................................................ 228 THE IMPORT OF CONSIDERING MESSAGE RECEIVER FUNCTION IN MESSAGE DESIGN ..................................................................................................................... 230 IMPLICATIONS FOR MESSAGE EFFECTS STUDY DESIGN .......................................... 233 Incorporation of time dependence into a message effects study design ...... 236 Benefits of incorporating mixed methods in message effects study designs236 Standardization of message effects terms and definitions, internal validity 237 IMPLICATIONS FOR TESTS OF THEORETICAL MESSAGE CONSTRUCTS..................... 238 A test of reactance theory antecedents ......................................................... 239 A test of message effects theory that includes specific message types ....... 240 VALIDATION OF GENERAL VARIABLES IN MESSAGE EFFECTS RESEARCH .............. 240 Use of hard end-point data to measure message theory constructs ............. 241 LIMITATIONS .......................................................................................................... 241 Under-identification of prior offenders ....................................................... 242 Levels of independent variables were observed, not manipulated .............. 242 Marginal sample size for prior offenders..................................................... 243 xvii

Nonrepresentative sample ............................................................................ 244 Do intervening factors bias DWI demographics? ........................................ 245 FUTURE RESEARCH ................................................................................................ 246 Mixed effects experimental study with random selection ........................... 246 Larger sample size for prior arrest participants ........................................... 247 Matching victims and offenders by age and gender .................................... 248 Investigation of causes of deaths among participants.................................. 248 Proposal for analysis of data using Repeated Measures Cox Regression ... 249 SUMMARY .............................................................................................................. 249 REFERENCES ............................................................................................................... 251 APPENDIX 1: PRE-POST MADD VIP INSTRUMENT .............................................. 272 APPENDIX 2: VIP INTENSITY OF REACTANCE-INDUCING STATEMENTS HISTOGRAMS OF STATEMENTS‘ INTENSITY BY FREQUENCY ............. 278

xviii

LIST OF FIGURES Figure 1-1: DAMM: Drunks Against Mad Mothers t-shirt. ............................................... 7 Figure 3-1: Screen shot of Excel spreadsheet from which coders coded MADD VIP transcripts. ............................................................................................................. 140 Figure 3-2: Flow chart of order of procedures. ............................................................... 146 Figure 4-1: No priors versus priors: Survival function. .................................................. 167 Figure 4-2: VIP Level of Reactance-inducing Statements by VIP Intervention Date. ... 177 Figure 4-3: Histograms for VIP 13 (low reactance-inducing VIP) and VIP15 (highreactance VIP). ...................................................................................................... 180 Figure 4-4: Effect of MADD VIP reactance-inducing level upon categories of offenders. ............................................................................................................................... 184 Figure 4-5: Effect of MADD VIP level of reactance-inducing statements upon offenders with priors. ............................................................................................................ 203

xix

LIST OF TABLES Table 2-1: Drinking as a function of age. ......................................................................... 34 Table 2-2: Reduction in High-BAC (0.10+) Drivers in Fatal Crashes by Sex, 1982-1998. ................................................................................................................................. 35 Table 3-1: 2x5 Mixed Factorial Design .......................................................................... 100 Table 3-2: Example of Units of Analysis Coding from Codebook ................................ 120 Table 3-3: Guidelines for Interpretation of Logistic Regression Coefficient ................. 124 Table 3-4: Set of eight ordinal reactance intensity codes used to code the 2,021 statements by 56 presenters in 15 MADD VIPs. .................................................. 137 Table 4-1: Priors: Covariates in the Cox Regression Equation ...................................... 168 Table 4-2: Set of eight ordinal codes used to code the 2,021 statements by 56 presenters in 15 MADD VIPs................................................................................................. 176 Table 4-3: Raw scores and transformed values for reactance-inducing level and reactance. ............................................................................................................................... 181 Table 4-4: Categorical Variable Codings for Cox PH Regression ................................. 189 Table 4-5: Variables in the Cox PH Regression Equation .............................................. 189 Table 4-6: All Cases: Number of Participants in Demographic Categories ................... 191 Table 4-7: Summary of Demographic Risk Factors that Influence Time to Recidivism 192 Table 4-8: Summary of MADD Message Effects........................................................... 196

xx

PREFACE I am drawn to this research both personally and professionally. I am personally drawn to this research because my brother-in-law died a victim of a drunk driver. Further, my father died of alcoholism after four DWIs. I understand the strong undertow that alcoholism can have on recidivism and the effects upon both the victim and the offender. I am professionally drawn to this project because, after a marketing career practicing persuasive communication, I now have the rare opportunity to analyze message effects in a randomized experimental context. I base my study on data collected in an original experiment designed by Rogers, Woodall, Rao, Polacsek, and Milan (1994) (n = 833). This is the first study to examine MADD VIP in terms of message effects. It is the first study to conduct content analysis of the MADD VIP presentation messages. It is also the first study to link VIP message effects variables to long-range outcomes. The data collected on MADD VIP messages and the experimental design by Woodall et al. offer a rare opportunity to study 12-year post recidivism data MADD VIP participants. Such hard outcome data is rarely if ever available to test message effects. Further, the randomized design provides a rare opportunity to study message effects on recidivism regardless of the degree of alcoholism in the offender. The study design controls for participant level of alcoholism through random assignment to group conditions.

xxi

CHAPTER 1: INTRODUCTION Will a heavy drinker, having a good time out on the town, get into his car and drive drunk? Alternatively, will he remember a gory slide show featuring drunk-driving victims that he saw at a Mothers Against Drunk Driving (MADD) Victim Impact Panel (VIP)? Will his memory of DWI (driving while impaired) victims‘ angst and emotional admonitions stop him from driving drunk? During a MADD VIP intervention, which is a one-time and typically a one-hourlong intervention, victims of drunk drivers and their families project photos of accidents, gore, and before and after photos disabled or dead loved ones. Victims, often crying, share their tragic stories and plead with DWI offenders to stop drinking and driving. MADD hopes the DWI offender remembers the MADD VIP experience, and hopes that he or she, when drinking, asks a sober person for a ride. Judges rely on MADD VIP efficacy. Judges have thus been mandating thousands of DWI offenders to MADD VIP interventions each week for the past 26 years3. Judges hope that MADD VIPs will reduce drunk driving. But research on efficacy of MADD VIP interventions is mixed and inconclusive. Results are inconclusive due to their quasi-experimental nature. There is a need for empirical studies on MADD VIP message effects, conclusive studies where DWI offenders are randomized to intervention and control groups. This dissertation comprises a twelve-year continuation and expansion of a randomized study (n = 833) conducted by Woodall, Delaney, Rogers, and Wheeler (2007). The purpose of this investigation is to extend the Woodall et al. study by

3

Number of participants in MADD VIPs is not known. The absence of a centralized MADD data system results in research notes that data is ―n ot readily available‖ (NHTSA, 2008).

1

conducting the first MADD VIP message effects study. What is the effect of the MADD VIP message and what observations are associated with that effect? This study investigates whether psychological reactance, a state of anger and negative cognitions (Quick & Stephenson, 2007), explains the contrary behavior of higher DWI recidivism4 (Delaney, Kunitz, Zhao, Woodall, Westerberg, Rogers, & Wheeler, 2005) among drunk drivers who were exposed to the MADD VIP message. Reactance is a mediating state between the act of receiving a message and contrary behavior. Reactance research is a subcategory of message effects research. Reactance antecedents are those precursors, such as reactance-inducing statements, that precede a state of psychological reactance. Reactance antecedents have been hypothesized and observed (Brehm, 1966) as producing a state of psychological reactance, a causal influence, upon the message effect: a negative reactance behavior. The state of psychological reactance is therefore an intermediating state between any number of reactance antecedents, as defined by Brehm, and the negative contrary behavior. The present study measures the latent variable, intermediating reactance, as a quantity of the relationship between intensity of reactance-inducing statements for 15 VIPs and levels of negative outcome behavior as measured by recidivisms among participants who attended those 15 VIPs. Intensity of reactance antecedents is operationalized5 as message attributes that reside within VIP messages. This research

4

Recidivism is the return to a previous behavior, in this case the return to driving drunk. Theoretical constructs are defined through the operations that scientists use to measure their instances in the observable world. Percy Williams Bridgman articulated operationalization as the method of translating abstract theoretical constructs into variables that can be measured, quantified, and analyzed. A scientist should explain how (s)he is operationalizing theory into variables. One‘s verbal acuity in operationalizing constructs into variables should be sufficiently transparent, logical, and repeatable by others (Everett M. Rogers, personal communication, August 27, 2001). 5

2

investigates the relationship between levels of intensity of reactance antecedents6 (levels of mild to strong threat and anger in the VIP message) and levels of negative outcomes (levels of recidivism among receivers of the VIP message. These variables were suggested by Janice Schuetz and Gill Woodall (personal communication, April 18, 2007) and previous researchers who designed and analyzed the original MADD VIP study (Delaney, Kunitz, Zhao, Woodall, Westerberg, Rogers, & Wheeler, 2005; Kunitz, Zhao, Wheeler, & Woodall, 2006). In the present study the earlier researchers‘ suggestions for future research, VIP messages as antecedents and recidivism as their outcomes, are observed and quantified. The outcome quantifications of 15 VIP interventions, each VIP‘s participant levels of DWI recidivism over a 12-year period, are regressed upon levels of reactance antecedents present in those same 15 VIPs. According to reactance theory, a message receiver‘s level of reactance is measured in terms of the degree to which they react through contrary behavioral outcomes: how contrary are their outcome behaviors in relation to the message they have received? Reactance theory explains why VIP audiences might behave contrary to the anti-drinking-driving messages they have received. If a message receiver believes (1) that he or she has the freedom to drive drunk, and (2) that this freedom to drink and drive is threatened by public censure and threats of legal and social punishment during MADD VIPs, then (3) the message receiver will drink and drive more often after receiving the MADD VIP message. If VIPs prompt increased reactance, then those who exercise the freedom to drive drunk more often should have received higher doses of VIP reactanceinducing statements. If higher levels of reactance outcomes (shorter time to recidivism 6

Antecedents are precursors, which come before, and in some instances can be predicted as causal of an outcome.

3

and greater numbers of rearrests) are observed within the MADD VIP plus DWI School intervention group versus lower levels of reactance outcomes within the DWI School Only comparison group, then an increase in levels of reactance-inducing statements and increases in drunk driving behavior are related. This first chapter of the dissertation (1) presents the problem statement and provides background information on the import and context of the MADD VIP efficacy problem. (2) The purpose and rationale section discusses how the present experimental design is the best approach to research the problem. (3) The theoretical import, framework and scope section discusses this study‘s theoretical implications for reactance theory. Problem Statement: Background of the MADD VIP Message Effects Problem The efficacy of an intervention refers to its capacity to produce a desired result, such as rehabilitation or deterrence of future instances of the undesired behavior. The standard for measuring efficacy of an intervention is measurement of a treatment and control group, contrasting the two, where the only difference between groups is the intervention. The assumption is that all members of the treatment group receive the same intervention. The validity of such an assumption is questioned in this study. This study investigates whether the interventions received by MADD VIP participants were of the same quality. Such a qualitative measure depends upon analysis of the intervention at a finer level of scale—the components within the intervention. Within the intervention, the present study investigates the intervention message to determine whether all instances of the VIP are the same type of intervention. Are all VIPs actually delivering the same uniform message? If significantly different types of intervention messages were 4

administered to people in the treatment group, then these differences might confound the determination of the intervention‘s efficacy. The intervention treatment, not being uniform, would have different effects that should not be measured together as if they were the same effect. Clearly, instances of interventions need to be analyzed to determine whether their messages are uniform or not. This question has never been addressed with regard to MADD VIP interventions. This particular deficit in the research may partially explain why MADD VIP efficacy research has been mixed and inconclusive. There may be another explanation for why MADD VIP efficacy research has been mixed and inconclusive. As indicated earlier in this chapter, quasi-experimental studies of MADD VIP efficacy cannot yield clear results because they do not employ the treatment/control design or participants are not randomly assigned to groups. There is no comparison group, or the groups could contain different types of people. Quasiexperimental designs present a problem: one cannot draw a reliable conclusion. The unreliable nature of quasi-experimental studies may explain why some find support for the efficacy of MADD VIPs, and others do not. Despite a lack of evidence on MADD VIP efficacy, public policymakers, judges, and many public health workers have expressed confidence that MADD VIPs are effective. Traffic safety experts have published reports advocating confidence that MADD VIPs are effective. Is confidence in MADD VIP efficacy warranted? If MADD VIPs have no effect or worse, if MADD VIPs increase drunk driving, then they increase an array of related expensive and devastating social problems (C‘de Baca, Lapham, Liang, & Skipper, 2001). In New Mexico, social programs that care for 5

drunk driving victims, combined with other programs that attempt to mitigate the farreaching effects of drunk driving, drain state revenues at the rate of $600 per year for every man, woman, and child in the state. It is remarkable that MADD VIPs have not been studied in terms of their message effects. Without a message effects study of MADD VIPs, public policy makers and courts cannot determine whether sentencing DWI offenders to MADD VIPs increases or decreases drunk driving, or whether MADD VIPs have any effect at all. Prevalence and significance of the repeat drunk-driving problem Statistics from the National Highway Traffic Safety Administration (NHTSA, 2001) indicate, ― 30% of Americans will be involved in an alcohol-related crash at some point in their lives‖ (NHTSA, 1999; NTSA, 2001a, 2001b, 2005, 2007; Rojek, Coverdill, & Stuart, 2003). In 2002, 41% of U.S. fatal crashes involved a drunk driver (Brunson & Knighten, 2004). Significantly, drunk drivers do not usually consider the impact of their behavior upon others. In fact, drunk drivers are more likely to exhibit antisocial behaviors. They exhibit poor interpersonal relationships and aggression, both when preparing to drive drunk and when behind the wheel (Beirness & Simpson, 1997; Eby, 1995; Snow, 1996a, 1996b; Veneziano & Veneziano, 1992 As discussed in the review of literature, substance abusers demonstrate high levels of sociopathy or anti-social behavior, and this population is highly reactive to confrontational interventions. Confrontation induces substance abusers to react negatively to appeals to stop drinking and driving, according to Miller (2000), and Miller, Benefield, and Tonigan (1993). According to Brehm (1966), substance abusers perceive ― anti-substance-abuse messages‖ as threatening to their freedom to perform pleasurable 6

self-serving acts. Instead of reducing a substance abuse behavior, an anti-substance-abuse message is more likely to increase abuse, a reactance behavior. Brehm would categorize MADD‘s message ― MADDer than hell‖ (Mac, 1996; MADD, 2002) as a threatening and reactance inducing message. Reactance theory may explain the seemingly negative reactance to MADD‘s message by members of ― DAMM: Drunks Against Mad Mothers,‖ who wear t-shirts bearing that message. One interpretation of the anti-MADD t-shirt wearing behavior is that it could be a sign of anti-MADD negative reactance among drinkers, a ― pushing back‖ message directed at MADD that reinforces drinkers‘ sense of personal freedom to drink (Room, 1989).

Figure 1-1: DAMM: Drunks Against Mad Mothers t-shirt. Brehm (1966), author of reactance theory, would explain the wearing of the DAMM t-shirt as an expression of negative reactance to the MADD campaign to reduce drunk driving.

Repeat drunk drivers are 4.5 times more likely to be involved in a fatal crash than intoxicated drivers with no priors (no prior DWI arrests). Repeat drunk drivers cause 3080% of alcohol-related fatal crashes (Brunson & Knighten, 2004; Fell, 1995; Jones &

7

Lacey, 2000; NHTSA, 1992, 2006; Peck, Arstein-Kerslake, & Helander, 1994). Researchers in the state of Louisiana determined that once a driver has one DWI they are 50% more likely to be involved in any kind of crash, alcohol related or not (Gould & Gould, 1992). Punishing repeat offenders with sanctions that revoke or suspend driver licenses appears to have little or no effect. At the time of their arrest, half of repeat offenders are discovered driving with no license, a revoked license, or a suspended license (Beirness & Simpson, 1997; Eby, 1995; Snow, 1996a, 1996b; Veneziano & Veneziano, 1992). The likelihood of driving with a suspended license increases as number of arrests increase. In one study, 32% of offenders with 2 priors and 61% of offenders with 3 priors who were cited for traffic violations were found to be driving with suspended licenses (Brunson & Knighten, 2004). Repeat offenders voluntarily report that they are not affected by sanctions such as license suspension (Freeman et al., 2006). Since sanctions do not change their behavior, why would threats of sanctions at MAD VIPs change their drunk driving behavior? Are MADD VIP messages, messages that rely heavily on appeals to offender altruism and threats of sanctions, effective in deterring repeat DWI offenders? Economic impact of drunk-driving behavior Drunk driving exacts a high death and injury cost on New Mexico. Alcohol abuse accounts for 40% of vehicle deaths in New Mexico (National Highway Traffic Safety Administration, 2002, 2004, 2005). The total economic impact for drunk driving in New Mexico is roughly $1.2 billion annually. Each year, alcohol-related crashes in the United States cost between $45-51 billion (Blincoe et al., 2002; Polacsek et al., 2001). Alcohol-related crashes cost 8

employers $55 billion in 1994 (Network of Employers for Traffic Safety, 1994). The United States economy shoulders the cost of an estimated $166 billion annually because of the costs of alcohol dependence (US Health and Human Services, 2007). New Zealand researchers estimated the cost of drunk driving to society equals $0.75 for each drink consumed yet typically DWI offenders pay for only half the costs of their crashes (Miller & Blewden, 2001). The balance of the DWI debt lands upon government. Because of the weighty economic impact on public and private resources, the public has called for stronger sanctions against repeat DWI offenders. Courts mandate thousands to MADD VIPs without evidence of efficacy Trusting a prevailing belief that MADD VIPs are effective, judges mandate drunk drivers to attend MADD VIPs. Judges believe that VIPs will deter future DWI offenses (C‘de Baca, Lapham, Paine, & Skipper, 2000). MADD considers the VIP a ― healing opportunity‖ for offenders (Mercer, Lorden & Haris Lord, 1999). According to MADD (Fell & Voas, 2006). MADD considers VIPs successful if they attain the following goals (Lord, 1990): The goal of the VIP is to influence DWI offenders on an emotional level to change their attitudes about drunk driving, thus reducing the likelihood of recidivism. This is accomplished in four ways, by: (1) exposing offenders to the consequences of drinking and driving; (2) helping offenders move beyond bad luck‖; (3) serving as a first step in breaking down the focusing on their own ― denial of alcoholics/drug addicts; and (4) imprinting images of real people in the offenders‘ minds that may replay when he or she considers drinking and driving. (p. 1421) 9

Implied in the goal of the VIP is that offenders will switch their orientation from selfcentered pleasure and freedom-seeking to being ― other-centered‖ after exposure to a dosage of MADD confrontations. Also implied in the goal of the VIP is that rational thinking occurs in the mind of a drunk. However, according to researchers Wiliszowski, Murphy, Jones, and Lacey (1996), it is not valid to assume, as MADD does, that drunk drivers are rational. Repeat offenders (n = 182) from Arizona, Colorado, and Pennsylvania, participated in a study at approximately the same time period as the present study. Wiliszowski et al. report that these drunk drivers demonstrated lack of forethought and lack of rationality surrounding their decision to drive drunk:

Reasons For Driving After Drinking

% of Responses

Thought he/she was OK to drive

32.2

Just did not think about it

21.0

Lacks control over him/herself after drinking

18.6

No one available to drive for him/her

14.4

Would be OK if careful (to avoid accident/arrest)

13.8

Another study found a negative correlation between drunk drivers‘ attitudes and safe behaviors such as calling a friend for a ride or taking a taxi (Turrisi & Jaccard, 1992). Repeat offenders are usually arrested after a drinking episode at a restaurant or bar, often justifying their drunk driving behavior as a matter of convenience, or ― just not thinking about it‖ (Beirness & Simpson, 1997; Eby, 1995; Snow, 1996a, 1996b;

10

Veneziano & Veneziano, 1992). Drunk drivers‘ reasons for driving drunk demonstrate a lack of judgment, lack of forethought, and impulsiveness that researchers have referred to as alcohol myopia (Steele & Josephs, 1990). Not surprisingly, repeat DWI offenders typically share the following characteristics. They demonstrate relatively irresponsible, impulsive, and sensation seeking behavior (MacDonald, Zanna, & Fong, 1966; Mayhew & Simpson, 1991; McMillen, Pang, Wells-Parker, & Anderson, 1992; McMillen, Adams, Wells-Parker, Pang & Anderson, 1992). Cavaiola, Strohmetz, Wolf, and Lavender (2003) found that DWI offenders scored significantly higher on the K, Psychopathic Deviate (Pd) Scale, Over-Controlled Hostility (O-H) Scale, and MacAndrews Alcoholism Scale—Revised (MAC-R) than a non-offender comparison group. Multiple offenders scored significantly higher than first offenders, and both scored significantly higher than the nonoffenders. Given these characteristics, the repeat offender is not likely to respond responsibly to moral appeals to curb their sensation seeking or impulsiveness. Their alcohol myopia would, in terms of the MADD perspective, translate into a moral myopia. According to reactance theory, the repeat offender is not likely to respond rationally to fear appeals if those fear appeals are threatening or confrontational, two antecedents of reactance. In fact, if reactance is induced through threat or confrontation then reactance theory predicts that the repeat offender is more likely to increase drunk driving (Brehm, 1966; USA Today, 1992). Given the foregoing, it is unlikely that repeat offenders will respond to appeals to be more responsible to society. Instead, to drown out MADD‘s imprinting of victim images in their minds, they are more likely to react by drinking more. 11

MADD considers its confrontation and threats successful if it imprints images of DWI victims in the offenders‘ minds. MADD believes this victim imprint will stop drunks from driving by increasing their awareness of probability of being in an accident and hurting a victim after choosing to drive drunk. MADD‘s reasoning assumes the drunk has a sense of responsibility, full reasoning capacity, thinks about risk of having an accident or being caught, and values safety of unknown other people over sensation seeking and the personal convenience of driving drunk. However, research on attitudes of drunk drivers indicates that drunk drivers‘ perceptions of their risk of arrest, and their perceptions of the probability of an accident while driving drunk, only persist for a short time following an educational message about the risk and negative outcomes of drunk driving (Turrisi & Jaccard, 1992). Do MADD VIPs succeed in their goals of reducing drunk driving? Research is inconclusive. Some researchers have found evidence that MADD VIPs do work (Badovinac, 1994; Fors and Rojeck, 1999; O‘Laughlin, 1990; Rojek, Coverdill, & Fors, 2003; Sprang, 1997). Other researchers have found evidence against the efficacy of MADD VIPs (C‘de Baca, Lapham, Liang, & Skipper, 2001; C'deBaca, Lapham, Paine, & Skipper, 2000; Marin and Marin, 1991; Theriot, 2006; Woodall, Delaney, Rogers, Wheeler, Rao, Polascek, & May, 2008). Shinar and Compton (1995) found mixed results within the same study. VIPs appeared to have an impact in one panel but not in another. To date there is no conclusive evidence on the effectiveness of MADD VIP messages. This is remarkable since, believing MADD VIPs are effective, courts mandate thousands of DWI offenders to MADD VIP interventions each year.

12

Methodological problems involving previous research There has been no direct research on MADD VIP message effects that regresses message effects onto message content, or that determines whether all VIPs deliver the same type of intervention message. Additionally, researchers have not evaluated MADD VIP messages for presence of antecedents to reactance. This study evaluates MADD VIP messages for presence of antecedents to reactance. This study‘s measurement of presence and levels of antecedents to reactance, the causal precursors of contrary behavior as explained by reactance theory, enables a theoretical framework, a framework of reactance theory that undergirds this study. Reactance antecedents present in MADD VIP content might explain cases where MADD VIPs increased drunk driving levels among participants, as Woodall et al. 2-year follow-up study has suggested. There have been other serious problems with previous MADD research. Other than the Woodall research, previous MADD VIP researchers have limited their research to quasi-experimental designs, as discussed previously. How quasi-experimental designs fail to meet assumptions of statistical tests is discussed more closely here. In quasiexperimental designs the differences in the outcome variable cannot be attributed to a causal variable. Random assignment, which results in normal and equal distribution of traits among groups, is necessary to meet important statistical test criteria. Random assignment to group condition, and inclusion of a control group, are assumptions that must be met in order for statistical tests to draw logical and compelling inferences from the data (Maxwell & Delaney, 2004). For example, because in past research offenders have not been randomly assigned to group condition, the demographics of those who are mandated to VIPs versus other 13

interventions have a certain probability of being skewed. As a case in point, C‘de Baca et al. (2001) found that judges were least likely to refer low-educated minority men to MADD VIPs. Judges were more likely to refer unmarried white offenders. Drinking cultures may be different for different demographics, thereby further skewing any quasiexperimental results. C‘de Baca et al. (2001) also found that MADD VIP referral ―did not increase recidivism rates but lowered them marginally or not at all‖ (p. 1420). Was this marginal or neutral effect an effect of the drinking culture of unmarried white offenders, rather than an effect of MADD VIP? C‘de Baca conceded that her finding might be a result of judges‘ biased group assignment. She and her coauthors stressed the need for an experimental, randomized design, such as this study, to measure MADD VIP efficacy. Of particular interest in the present research is whether the VIP message is consistent in its level of intensity and strength. If the MADD message consists of varying levels of intensity and strength, then do those levels make a difference in how the VIP message works and how it should analyzed? Are differences in message strength statistically significant? Is the VIP indeed only one type of intervention or do VIPs vary in the type of intervention they administer? Does analyzing the VIP as a message-based intervention offer insights on whether VIPs are one or many types of intervention and can a message-based analysis determine VIP efficacy? Purpose and Rationale The purpose of the present study is to analyze MADD VIP message effects to determine MADD VIP intervention efficacy. This study also proposes to test the appropriateness of reactance theory in explaining MADD VIP message effects. It 14

evaluates whether insights gained from this study can contribute to the body of reactance literature, a subset of message effects literature. This study researches whether reactance can at least partially explain MADD VIP participants‘ trend toward increased recidivism in the Woodall 2-year post study. At the intervention level of scale, this study examines whether at twelve-years post the original study (Rogers, Woodall, Rao, Polacsek, & Milan, 1994), the MADD VIP participants display significantly more DWI recidivisms than their DWI School comparison group. The research questions presented at the end of chapter 2 specify the details of this investigation, but painting with a broad-brush stroke, the aims of this study are as follows: 

To investigate whether antecedents to reactance are present in the VIP messages.



To identify levels of message reactance ― dosages‖ for each VIP.



To regress twelve years of VIP participants‘ and controls‘ DWI recidivism records onto VIP message dosages.



To investigate whether reactance theory can explain the anti-VIP effect, which trended toward significance in the Woodall et al. (2007) study at 2-years post intervention.

The present experimental design, because it is a randomized trial, offers the best possible problem space for concluding whether MADD messages have a positive or negative effect on drunk driver recidivism. Theoretical import, framework, and scope Theoretical import 15

This study contributes to message effects theory by operationalizing theoretical message effects constructs, reactance antecedents (Brehm, 1966), as specific message types, while considering the relationship between mean message type and hard behavioral outcome data. This study‘s hard outcome data approach contrasts with use of less reliable soft outcome measures, often participant self-report data, employed in some DWI studies to test theoretical constructs in the DWI literature (Gosling, John, Craik, & Robins, 1998). The theoretical literature on message effects lacks empirical support from research designs that test specific message types (Brashers & Jackson, 1999; Jackson, 1992). Previous research has not clearly set forth the operationalization of theoretical constructs as variables or specific message types. Thus, previous research has not linked theoretical message effects constructs to in vivo (real world) messages, provided taxonomy of reactance message types, or linked message types and levels of message strength to real world behavioral outcomes. This study provides operationalization of reactance constructs as a variables with each reactance construct represented by an archetypal in vivo message, provides a taxonomy of reactance message types, and links these message types and their levels of message strength, or intensity, to real world behavioral outcomes. The VIP efficacy literature may be characterized as lacking reliably operationalized message effects constructs, and generally lacking in reliable outcome measures. In this study, the operationalization of reactance theory constructs are articulated and validated via two methods, both methods offering a high level of

16

empirical reliability with corroboration between both methods of operationalization. These methods are overviewed in the next section on theoretical framework. Theoretical framework The present study draws its theoretical framework from the foundational message effects research of Shannon and Weaver (1963). Shannon and Weavers‘ sender-messagereceiver constructs are drawn upon to organize message variables into three main categories: 1) message sender variables, 2) message-related variables, and 3) message receiver variables. Nested within those three general variable categories, the reactance antecedent constructs of reactance theory (Brehm, 1966) are distributed. Reactance antecedent constructs are clustered according to whether they are sender, message, or receiver constructs. Within Shannon and Weavers‘ categories the study operationalizes and tests accuracy of operationalization of reactance theory constructs. This study uses two methods to operationalize and test operationalization of reactance theory constructs. In the first method of operationalization of theoretical constructs into variables, the reactance antecedent constructs are operationalized by translating their theoretical definitions into a message type variable. The message type variable consisted of categories of message types into which in vivo, real world, messages were assigned, by independent coders. The coders‘ highly consistent identification of the face value of VIP messages in terms of reactance theory-defined categories was validated by a high level of inter-rater reliability. The high level of inter-rater reliability among independent coders, who employed coding definitions designed from reactance theory constructs, validated the consistency of the operationalization of reactance theory constructs in terms of VIP message types. 17

In the second method of operationalization of reactance theory constructs into VIP message types, VIP message types were matched with laboratory-created messages (Brehm & Cohen, 1962; Brehm & Cole, 1966; Brehm, 1966, 1972; Buller et al., 2000; Campo & Cameron, 2006; Dillard & Shen, 2005; Dillard & Shen, 2005;Dillard and Shen, 2005; Engs & Hanson, 1989; Festinger & Carlsmith, 1959; Festinger, 1957; Freedman, 1965; Goranson & Berkowitz, 1966; Hollander, 1971; Hong & Faeda, 1996; Miller et al., 2006; Quick & Stephenson, 2004; Quick & Stephenson, 2007; Quick, 2003). These laboratory-tested messages had been empirically tested for their levels of reactance inducement. The laboratory messages had also been designed to operationalize theoretical definitions of reactance antecedents with message types. Both VIP operationalizations and laboratory operationalizations of message types matched each other, and the laboratory message types had been empirically validated as operationalizations of reactance theory constructs. The present findings determined that in vivo archetypes of these laboratory messages that had been created to match reactance theory constructs, again induced the levels of reactance that the theory predicted. The process of testing accuracy of operationalization of the reactance theory constructs is complex. The process is complex due to the progression of tests of the data against theoretical constructs at three different levels of scale. At each level of scale, the data are tested to insure that the only difference between variables in the final test, at the final level of scale, is indeed the level of presence and level of intensity of reactance constructs. Thus, the accuracy at which the theoretical constructs were operationalized into VIP message variables is determined in an articulated three-step process, each step

18

executed at a different level of scale for the data. This multiscale operationalization process is described in chapter 3: Methodology. Theoretical scope Reactance theory, disaggregated into its 8 component reactance antecedent constructs, is the main theory tested in this dissertation. Additionally, the dissertation refers to supporting theories that inform and deepen the understanding of general message theory constructs and specific reactance theory constructs. These supporting theories illuminate a discussion of the theoretical implications of the results in Chapter 5. In addition to reactance theory (Brehm, 1966), the following theories provide additional theoretical scope to this investigation: cognitive dissonance theory (Festinger, 1957; Festinger & Carlsmith, 1959), message-context theories (Miller, 2002; vanDijk, 2008), grounded theory (Strauss & Corbin, 1990; Thomas & James, 2006), message function theory (Jeong, 2004), theories about influences of message intensity (Aristotle, 2006; Dillard & Shen, 2005), theories about how audience pathos reduces reactance (Aristotle, 2006; Burke, 1965; Corbett, 1984), and the theory of reasoned action (Fishbein & Ajzen, 1975). Chapter 1 Summary Drunk driving incidents, including crashes, are self-reported at least once for every licensed driver in the U.S. Concerning crashes alone, one out of every three adults will be involved in a drunk driving crash during their lifetime. Of those drunk driving crashes, one out of four will result in death. New Mexico spends $600 every year for every man, woman, and child in the state to cover the social costs of drunk driving. The drunk driving problem is both prevalent and significant in this state. 19

The question of whether MADD VIPs are effective then becomes an important question, not only from the standpoint of individual citizen safety but also from an economic standpoint. If MADD VIPs are effective, then the program should be encouraged. If MADD VIPs are ineffective, then society would be better served by redirecting DWI intervention resources to interventions that are more effective. The primary purpose of this study is to analyze MADD VIP efficacy in a twelveyear randomized field study. This study will evaluate if MADD VIPs are effective and if not, why not, in order to inform future DWI intervention program designs. Does reactance theory explain MADD VIP message effects? What other persuasion theories can inform this study on MADD VIP message effects? This study possesses the secondary goal of contributing to the literature on message effects studies. It contributes a real life application of empirical certainty to a body of theory and investigations that, according to analysts (Jackson, O'Keefe, Jacobs, & Brashers, 1989), is over laden with quasi-experimental and inconclusive designs, an underrepresentation of practical examples of specific message types and detailed analysis of under what conditions a message type is or is not persuasive. This study offers insights into measuring treatment or intervention efficacy. In the literature, manipulation checks are sometimes employed, though not always, to check the researcher‘s expectation of the accuracy of an operationalization against the subject‘s perception of the meaning of the operationalization. Manipulation checks can work to identify subgroups that would respond differently and even oppositely to the same treatment or manipulation. Researchers use manipulation checks not only to check the

20

accuracy of their operationalizations, or parameter checks, but also to determine whether their sample has come from the same or different populations. Sometimes, however, a manipulation check is not possible. In the collection of in vivo field data, such as the MADD VIP data collection process, the researcher does not manipulate the message source. The message source delivers the message in a natural manner. Whereas field research is valuable because the environment is natural, yet it is often considered that manipulation checks of field variables are not possible. Where field experiment results are generalizeable to natural situations in the real world, yet because the field is difficult to isolate from confound influences, often a laboratory-generated experiment with controls and manipulation checks, despite limited generalizeability, is considered to be a cleaner science. This study advances a method to conduct manipulation checks on field research involving message variables. It offers a useful message check procedure, introduced here. Like any type of experiment, field research is a test of a theory and needs some type of check for accuracy of theoretical construct operationalizations. A limitation of field experiments is that operationalization of constructs into variables, where variables contain data from natural sources, may be confounded by different levels of treatment (different theoretical constructs operating) through different sources. In this case, a message analysis, if the treatment is message-dependent, is useful to determine whether all message sources are delivering the same treatment (and same theoretical operationalization) or different treatments and operationalizations. This may be accomplished by assigning levels of the variable strength, including zero strength where a trait or treatment is absent. This method is used in genetic trait analysis with datasets 21

containing missing data, a method adapted for the present study from Kelly and Di Marzo Serugendo (2009). The method is discussed more fully in chapter 3, Methodology. Communication scholars specialize in message analysis and can offer a significant contribution to increase accuracy of theoretical operationalization and ― manipulation checks‖ in field studies where the treatment channel is a message. Researchers in disciplines other than communication may find benefit in including a communication scholar in their research team when they have message-related variables in their study design. A communication scholar may thus assist social researchers in avoiding a Type II error. For example, previous field research in the DWI literature has found low or no treatment effect for in vivo data sets. The absence of detection of an in vivo effect might occur due to lack of intervention message analysis, as this study demonstrates. Thus, this study makes a contribution to the practice of field research across disciplines in the social sciences. It demonstrates the usefulness of communication analysis in avoiding a Type II error in field studies.

22

CHAPTER 2: REVIEW OF RELATED LITERATURE The determination of efficacy of drunk driving interventions on a national level is stymied by lack of coordination between states. Differences in arrest and court procedures and data firewalls between states lead to discontinuous and relatively unreliable data. This is true especially in the case of data concerning repeat DWI offenders. Data between states are fire-walled: one state‘s driver records usually will not include arrests a driver received in another state. Data between states are discontinuous: each state has different definitions of what constitutes drunkenness, different numbers of years for which arrest records are available, different laws, and different norms for practices such as record purging and plea bargaining. A plea bargain usually changes a repeat DWI offender‘s number of prior offenses to zero when in fact they may have three to six or more priors. Some states only track prior offenses for three years, others for ten years. Different states have different blood alcohol content cutoff points that determine whether a driver is drunk or not (Streff, Spradlin, & Eby, 2001). These differences make it difficult to confidently compare results of research conducted in different states (Jones & Lacey, 2000; Breer, 1998; Yu & Williford, 1991). When it comes to DWI data and particularly DWI data for repeat offenders, state differences skew the accuracy of data on the national level. Researchers often consider national statistics to be relatively accurate, but in actuality national aggregate data is only as accurate as state reporting. As considered previously, state reporting is inaccurate, so it follows that national data aggregated from state reports is also inaccurate. This inaccuracy on the state and national reporting level influences the accuracy of meta23

analysis on DWI interventions. The inaccuracy of the underlying data and of state and national reports, and the challenge of comparing different metrics from different states, is often overlooked in meta-analyses. The meta-analytical method compares studies based on methodological rigor and offers integrated research finding summaries, syntheses, and comparisons. With these considerations in mind, the following discussion addresses two related questions. These questions concern the efficacy of drunk-driving interventions and the rigor of methods used to assess their efficacy. The pros and cons of previous research are examined based on their methodologies, without probing the inaccuracies of underlying data, which is beyond the scope of the present study. Other than the Woodall research, previous MADD VIP researchers have worked with quasi-experimental designs, reporting mixed and inconclusive results. Some studies found MADD VIPs effective (Badovinac, 1994; Fors and Rojeck, 1999; O‘Laughlin, 1990; Rojek, Coverdill, & Fors, 2003; Sprang, 1997). Other studies found no evidence of MADD VIP efficacy (C‘de Baca, Lapham, Liang, & Skipper, 2001; Marin & Marin, 1991; Shinar & Compton, 1995; Theriot, 2006). At two years post intervention Woodall, Delaney, Rogers, Wheeler, Rao, Polascek, and May (2008) found that MADD VIP participants‘ recidivism rates were 30% higher than their control group, trending toward significance at p = .0583. All of these studies are discussed in more detail in this chapter. This chapter also covers literature on DWI intervention effect sizes, components of the most effective DWI interventions, research on general message effects variables, and research on reactance theory variables. It concludes with a section highlighting

24

ethnographic field notes from a MADD VIP session that contain field observations of reactance antecedents. Evidence that MADD VIPs Work Badovinac (1994) found that 62 imprisoned DWI offenders displayed a significantly greater number of plans to use designated drivers or taxis because of their MADD VIP experience than did their imprisoned controls (n = 46). Sprang (1997) found that MADD VIP participants (n = 103) displayed an intent not to drive drunk that was significantly different from their control group (n = 75). Further, Sprang found two times as much recidivism for controls than MADD VIP participants during the 12-month period following the previous arrest. Fors and Rojeck (1999) reproduced Sprang‘s study and found similar conclusions: during the post 12-month period following a previous DWI, Controls (n = 431) displayed significantly more arrests than MADD VIP intervention participants (n = 404). Rojek, Coverdill, and Fors (2003) reported at the five-year follow-up (n = 404) that ― VIPs are associated with a 55.7% overall decrease in the hazard of recidivism; the VIP effect is strong in the first two years.‖ O‘Laughlin (1990) found similar results. However, the demographics of these population samples may have limited their MADD VIP outcome conclusions. Rojek et al. conducted their experiments in Clarke County, Georgia, a community without a significant Hispanic population. Fors and Rojek (1999) analyzed the same data, obtaining significant results only with white men, ages 26-35 years old, who had only one prior DWI arrest. For nonwhite men, older men, and those with multiple priors there was no effect. Their results were inconclusive as ethnicity, age, and number of priors were not manipulated variables. The design was not empirical and 25

causal inferences cannot be derived. Rojek et al. (2003) discussed the possibility that different VIP styles and different audience cultures may cause different VIP efficacy rates. Their results point to the possibility that culture may explain differences in participant responses to the same intervention. Whether or not culture or racial profiling may be responsible for these findings is considered in detail in chapter four, The effect of ethnicity on survival and in chapter four summary, Demographic risk factors, and in chapter five, Do intervening factors bias DWI demographics. In any case, as discussed previously, the above researchers cannot claim causal results because these studies employed quasi-experimental designs. In quasiexperimental designs, study outcomes cannot be conclusively stated to be a result of observed effects. Another possible reason for inconclusive results is the intervening factor of judge group assignment bias, a biased assignment of certain demographic cultural types to MADD VIPs (C‘de Baca, Lapham, Liang, & Skipper, 2001), as discussed in the introduction. A complication in existing research, as discussed in the introduction and again in this section, is that previous researchers, other than Woodall et al. (2008), did not conduct randomized designs. Additionally, none of the research designs controlled for all three important predictor covariates: gender, age, and number of prior arrests. The role of these three important predictor variables in DWI recidivism is discussed in detail later in this chapter. Evidence That MADD VIPs Do Not Work There is evidence that MADD VIPs do not work. Theriot (2006) found no significant correlation between MADD VIP attendance and recidivism (n = 247). Shinar 26

and Compton (1995) studied Oregon and California DWI offenders (n = 2000). They matched MADD VIP intervention and control groups on age and gender and found MADD VIP had no impact on recidivism. C‘de Baca et al. (2000) found ― there was no evidence of a recidivism-reducing effect of the VIP‖ (p. 1425) for the VIP audiences in Bernalillo County, New Mexico, the majority of whom self-identified as of Hispanic ethnicity. However, they did not identify the message qualities or archetypal message types in the VIP presentation as a possible influence upon this failure. Instead, they discussed the possibility of confounding cultural causes, confusing culture with ethnicity. The present study considers whether VIP message design, as defined by message qualities, could have caused reactance, i.e., a negative and opposite reaction to the message than was desired by the MADD message designers. C‘de Baca, Lapham, Liang, & Skipper (2001) found no statistical association between MADD VIPs and first-time offender recidivism. However, ― female repeat offenders referred to VIPs were significantly more likely to be re-arrested‖ (p. 615) compared to nonVIP controls. This finding suggests that gender and number of previous DWI arrests should be covariates in a MADD VIP study. Woodall, Delaney, Rogers, and Wheeler (2007) and Wheeler, Rogers, Tonigan, and Woodall (2004) reported results from the only randomized study on MADD VIPs. They found mixed and conflicting results on MADD VIP efficacy, depending on the length of time between intervention and post measurement. In a one-year follow-up study, these researchers initially found evidence that MADD VIPs were effective in reaching their goal of changing offenders‘ attitudes and aims. Immediately after 27

completion of fifteen different VIPs, conducted in 1995 through 1996, most participants reported a raised awareness of the DWI problem and 26% stated they would never drink and drive again (Woodall et al.). The Wheeler et al. study compared the MADD VIP participant recidivism to an age and gender matched control group of drivers. Preliminary findings, based on self report data, were encouraging. However, a quasi-experimental study that similarly contrasted age-sex matched VIP attendees against VIP no-shows in California found no difference between the two groups, both of which shared the same intent to treat7 MADD VIP court mandate (Shinar & Compton, 1995). The quasiexperimental Shinar Compton findings left room for doubt regarding the New Mexico findings, even though the New Mexico MADD VIP age-sex matched findings had been obtained from an empirical study. Then a 2-year follow-up by Woodall, Delaney, Rogers, Wheeler, Rao, Polascek, and May (2006), based on recidivism data, found that length of follow-up made a difference in the results. After two years, MADD VIP participants‘ recidivism rates were 30% higher than their control group, trending toward significance at p = .0583. The 2year follow-up findings from the empirical study, although not significant, continued to cast a doubt on the efficacy of MADD VIPs. Because the follow-up duration of the MADD VIP study appears to have influenced study results, and the self-reports differed from the arrest records, the present study entails a twelve-year follow-up that evaluates longer-duration DWI arrest outcomes for the same original participants studied by Woodall et al. 7

Intent to treat approach means that those who were assigned to VIP were treated as if they had attended, even if they did not show up (Gross & Fogg, 2004). This is a conservative approach, the ramifications of which are discussed in the methods section.

28

In summary, Woodall et al., in their singular randomized study of MADD VIPs, found contrary results with different lengths of follow-up time. Other researchers conducted quasi-experimental studies that supported either side of the MADD VIP efficacy argument. None of the quasi-experimental studies controlled for more than one of the three known covariates (gender, priors, and age) except C‘de Bacaet al. (2005), who controlled for two of the covariates: gender and priors. These flaws in research design limit the validity and generalizeability of previous results. In contrast, the present study segments data for all three known covariates: gender, age, and number of priors. The Woodall et al. original randomized design is used again in the present study; random assignment to group condition has the advantage of controlling for unknown confounds. Drunk Driving Interventions An ever-increasing array of interventions is aimed at reducing alcohol abuse, producing a large body of literature. Early reviewers (Foon, 1988; Institute of Medicine, 1990) could not determine DWI interventions‘ effect size. However, Wells-Parker, Bangert-Drowns, McMillen and Williams (1995) conducted a meta-analysis of 215 DWI interventions that included a study of DWI treatment effect sizes. The results of their meta-analysis are discussed in the following section. Intervention effect sizes Wells-Parker et al. (1995) found that individual effect sizes differed by offender categories. Low-risk (no prior arrests) and high-risk offenders (multiple prior arrests) did not respond to DWI intervention (t (19) = 0.68; t (22) = 0.20, respectively, p > 0.05). Moderate risk offenders, however, did respond to DWI intervention. ― Average effect size for moderate risk offenders differed from zero (t (16) = 3.33, p < 0.05)‖ (Wells-Parker, 29

Bangert-Drowns, McMillen & Williams, 1995, p. 917). The difference between the low + high and the moderate risk groups was not statistically significant (t (58) = 1.6; p = 0.06). Effect sizes for 200-day follow-ups were significantly more varied than for 1600day follow-ups (p 0.05). Moderate risk offenders, however, responded to DWI intervention. ― Average effect size for moderate risk offenders differed from zero (t (16) = 3.33, p < 0.05)‖ (p. 917). The difference in recidivism between the low, high, and moderate risk groups was not statistically significant (t (58) = 1.6; p = 0.06). The number of priors was also found to have a contributing effect in a quasiexperimental study by C‘de Baca et al. (2005). In order to conserve power (Cohen, 1988) the final data set used in the survival analysis portion of the study was two-tiered: those 36

with prior arrests and those with no priors. The stepwise regression model fitting in the Cox Regression survival analysis, discussed in the results section, demonstrated that each of these groups demonstrated a different covariate model. In the present research, there is no distinction made between prior offenders who were high-risk drinkers who drive versus high-risk drivers who drink. Voas (2000) makes this distinction as a means to illustrate a broad continuum of different risk types who become repeat offenders. The present study does not subtype DWI offenders into clinical subpopulations. The small sample of recent prior offenders (n = 47) precludes such subdivision. General Variables in Message Effects Research Because behavioral health interventionists dispense all DWI interventions using language, photos, signs, or symbols, it may be argued that the most common independent measure in DWI intervention is the message. Message is defined as either verbally articulated language or a nonverbal sign, photo, or symbol sent by a message sender to a message receiver. As discussed previously in this chapter, education, a message-based intervention, was the most common independent measure in DWI intervention research. Based on the work of Wells-Parker et al. (1995), coupling education with psychotherapy (both of which are message-based interventions), made for a successful combined intervention. Education, when combined with contact-probation and psychotherapy counseling—all message-based interventions—had the best effect of all. Thus, it is argued that the most successful interventions involved some type of message intervention. Therefore, the present study looks systematically into the message as an independent, or causal, variable. 37

Several theoretical constructs reappear often in the literature on message effects. These constructs are (1) context, (2) content (including deductive and inductive arguments), (3) function, and to a lesser extent (4) intensity, (5) frequency, and (6) pathos or the strength of emotional appeal elicited by the message. Message context Linguists, communication scholars, and scholars engaged in discourse analysis define message context as the dimensions of setting and of the given roles, goals, plans, intentions, and prior knowledge of the participants (see the reactance antecedent Forewarning). A message researcher needs to examine the context for both the speaker and the listener when studying message meaning. A brief description of a VIP message context and the message content follows. Typically, DWI offenders arrive at their court-mandated VIP and pay a $20.00 cash fee to support the MADD organization. Afterwards they enter the auditorium. Once inside, DWI offenders sit and listen while victims and families of drunk driving victims present eight-foot high projections of photos of their dead or disfigured loved ones and of crash scenes. During delivery of their message, presenters describe their anguish and paint lurid pictures amid the loss-framed messages that highlight the social costs of drinking and driving. Loss-framed messages are designed to emphasize the disadvantages of offenders‘ failure to comply. Message context influences function. Message context transfers meaning to a message and contexts vary according to the function served by the communication (Burleson, 1987; Greene & Raney, 2003). In other words, the same communication delivered in a different context may serve a different function. For example, if a peer 38

lovingly said to a drinker, ― Don‘t drive,‖ as opposed to a MADD VIP presenter aggressively demanding the same thing, then the difference between the two contexts might elicit different meanings—interpretations—and serve different functions. A peer message context is intimate and likely presented close in time to the driving decision, often near the car that will be driven. Such a peer message, because it occurs in an intimate setting, may function as a statement of caring and support. A MADD VIP message context is public, censorious, and refers to some future time and place of making the driving decision. From the perspective of functional linguistics (Leckie-Terry, 1995), which is concerned with the way that language functions, it is argued that for MADD VIP presenters, the VIP message functions as punishment and a statement of public censure; it is not intended to function as an expression of care and nurturing. Public censure is a construct in reactance theory identified as a variable in this literature review. Public censure is an emotionally intensive message context. Several researchers (Dillard & Solomon, 2000; Solmsen, 1954) have studied message context and its persuasive influence on a message. Researchers have found context to be like a gate that can either open or close opportunities to persuade. For DWI offenders, the context of the MADD VIP message is one of punishment and public censure. This context may close opportunities to persuade, influencing the offender to be predisposed against, inattentive, and resistant to the MADD VIP message. Context may also influence a drinker‘s attention to an intervention message. At the moment when a drinker is deciding to drive, the context entails social approval that comes not from the MADD VIP presenters but from his drinking and driving peers. Therefore the context in which a person decides to drink and drive is different from the 39

MADD VIP intervention context. It is argued that popular culture, television, film, and print media, which tend to glamorize the free exercise of drinking behaviors, influence the context of the offender‘s decision to drink and drive. For example, some Southern Comfort whiskey television commercials highlight social conviviality amid surreal kaleidoscope visuals that reinforce the link between freedom of choice and intoxication. It is argued that the media (either wittingly or unwittingly) promotes drinking for pleasure and seldom if ever reinforces restraint in drinking and driving contexts. Pragmatic conversational analysts, though they recognize context as a variable, often fail to consider context in their analyses of meanings (Geis, 1995). Context analysts pursue a suitable and accessible scholarly method for environmental analysis of a message meaning. Context analysis is a method of assessing meaning of a message from the environmental context in which it occurs. The environmental context may include body language as well as the physical environment, the chronological environment, and messages that preceded and followed the message of interest (Scheflen, cited in Wertz, 1973). These elements of context are interpreted and understood to create meaning in the course of their relationships to one other (Kendon, 1990). For example, in the present study the relationships between environmental context and other factors were considered in coding message meaning. The immediate context was a large room populated with DWI victims who were privileged to speak and who spoke in confrontational, emotional, and threatening tones and content. Court mandated attendance for offenders preceded the event and court order compliance followed the event. This was the larger chronology of the environment. The environment was also comprised of message receivers, namely DWI offenders. They were seated together in a group, beneath the raised dais of the VIP 40

speakers, and they were not privileged to tell their stories. The coders who coded the text of the VIP presentations knew these contexts. The coding consisted of matching presenter statements with downward codes such as ― I am angry‖ and ― You should change.‖ There was very little variance between coders and agreement was high in categorizing the meaning of the VIP statements from the perspective of the message receivers. Hopper (1981) suggested that analysts make use of frame analysis (Goffman, 1974) when analyzing communication contexts. In the present study, the context and frame of reference is reactance theory and empirical reactance research. Jacobs and Jackson (1983) adapted the context analysis method of conversation analysis to consider context as a variable in speech acts. Variables in discourse analysis of context are sounds (intonation), gestures, rhetoric, meanings, speech acts, and turn taking as tactics to gain a strategic goal. These variables offer starting points for the constant comparison analysis of the MADD VIP messages. Constant comparison analysis is a process conducted within an inductive method known as the grounded theory method (Strauss, 1987). In grounded theory applications, inductive research is conducted in reverse of the manner in which traditional deductive research occurs. While, at first glance, inductive and deductive methods may appear to be contradictory approaches (Clarke, 2005; Kelle, 2005), it will become apparent upon inspection and comparison that they are two sides of the same coin, that is, complementary avenues to knowledge acquisition. In grounded theory applications, instead of beginning with a hypothesis, as in traditional science, the first step is to begin with an analysis of the data. These data may have been collected through a variety of data collection processes (Stebbins, 2001). After many re-reads of the data, a process that is referred to as constant comparison, key 41

themes are identified based on their frequency of occurrence and intensity of occurrence in the data. These data themes are conceptualized as codes (Charmaz, 2006; Glaser, 1992; Goulding, 2002; Mey & Mruck, 2007). After researchers have compared and re-compared the codes to the data, and after engaging in some reflection, these same researchers determine whether the codes are sufficient in number and parsimonious in number. The researchers determine that the codes adequately describe the data. They determine that codes do not overlap and are not redundant—i.e., the codes must represent unique elements. This process of developing unique and sufficient numbers of codes involves a manipulation of codes. Some codes are merged; while others are split into two or more codes. When satisfied with the foregoing steps, researchers then group codes into similar or related concepts in order to ease manipulation. From these concept groups, categories are formed. If appropriate, these categories are hierarchically arranged. In the present study, as will be discussed in the methodological design section, categories of archetypical message types were hierarchically ordered from least to most reactance-inducing messages. An archetypical message category was assigned a level of reactance-inducing intensity according to level at which similar message types had been found to induce reactance in germane empirical laboratory-conducted research. These previous empirical studies based their categorization of reactance message types upon theoretical reactance antecedents, such as forewarning or anger. Thus, theory played an important role in determining the labeling of message types, and the hierarchical relationships between message types were determined by previous empirical research. 42

At this point, it is useful to briefly consider how theory informs empirical research and how the grounded theory method, also referred to as constant comparison analysis, was employed in the present study. It was used in this study to identify reactanceinducing message types and their hierarchical order of reactance-inducing intensity. Theory is an explanation of phenomena through description of elements and their relationships. Relationships between elemental categories are discovered, through observations that become theoretical propositions, and through empirical testing of theoretical propositions. The grounded theory approach begins with interpretation of an undifferentiated ground of observations. It ends with an articulated theory about relationships and perhaps a hypothesis. This grounded theory process is the converse of the traditional research process, in which the researcher begins with a theory, forms a hypothesis, operationalizes theoretical constructs as variables, quantifies those operationalizations through frequency or level of occurrence in the data, conducts analysis, and ends with interpretation of the results. In grounded theory, the endpoint is development of theory based on interpretive analysis of raw data. The two approaches, the traditional inductive method and the relatively newer grounded theory method may be considered as compliments to each other in the same manner that a coin is comprised of two complementary sides. In both inductive and deductive methods, the goal is knowledge acquisition and generation. In the service of knowledge acquisition, both methods are employed in the present study. Message content The present study analyzes message content as a distinct variable that contributes to a message‘s independent location on an ordinal hierarchy, a scale of levels of 43

reactance-inducing statements. Each message within a dataset of VIP messages is analyzed from the perspective of its content and the function that content serves for the message sender. This method has been tested in genetic trait analysis to overcome the limitations of message noise, or variance, and missing data (Kelly & Di Marzo Serugendo, 2009). Message function The same message can vary in function, depending on whose perspective is being considered, whether the perspective is that of the message sender or that of the receiver. For example, in the VIP, variability in message function is demonstrated as follows. The MADD VIP message serves a cathartic mood-elevating function for presenters (Mercer, 1990), while lowering the mood for the receiver (Woodall, Delaney, Rogers, Wheeler, Rao, Polascek, & May, 2006). The present study investigates whether participants had significantly lowered mood levels following high-reactance VIPs where a high proportion of reactance-inducing statements were delivered. Message intensity Message intensity is the degree to which the message sender‘s attitude deviates from neutrality (Bowers, 1963; Hamilton & Stewart, 1993). The intensity the message sender uses to deliver a message is important to persuasive message analysis; it can increase either positive or negative message effects in the message receiver (Buller, Borland, & Burgoon, 1998), depending on whether the message has a happy spin (positive valence) versus an angry spin (negative valence).8 8

A valence is a directional indicator. Valence is an adjective that indicates a noun‘s placement on a continuum from negative to positive. For example, if a message is negatively valenced then it has inherent qualities that place it in a negative context.

44

Degree of message intensity is related to the degree of emotion, the strength of aggression, and the opinionatedness displayed by the message sender (Burgoon, Pauls Denning, & Roberts, 2002). The degree of aggression expressed by the message sender is an intensity-related construct that has an influence on persuasion (Burgoon, 1989, 1990). In other words, the message sender‘s intensity of expression influences the persuasiveness of a message (Miller & Lobe, 1967). Language intensity may have long-term treatment effects (Buller, Burgoon, Hall, Levine, Beach, Buller, & Melcher, 2000). For example, if the victim presenters in the MADD VIP presentations express their messages with highly emotional intensity and anger-evident aggressiveness, then they may induce a reciprocating strongly emotional effect on the DWI offender audience. Dillard and Shen (2005) studied the effect of message senders‘ language intensity on the outcome, as measured by message receivers‘ negative reactance. The researchers tested two different levels of message sender intensities: high threat and low threat. They found that the more intense high-threat messages9 induced reactance, whereas the less intense low-threat messages did not. A message receiver experiences message intensity differently depending on his stage of behavior change (Buller, Borland, & Burgoon, 1998, Rogers, 2003).10 Buller et al. found people in the ― contemplation‖ stage of behavior change are receptive to a dissonant message. 11 Highly graphic and intense messages positively influence these

9

A high-threat message is defined as a message that the receiver perceives as threatening, whether or not the message sender intends the message to convey a threat. 10 Stage of behavior change refers to Prochaska and DiClemente (1984) stages of change model (this is the colloquial name for the model; Prochaska and DiClementes‘ official name for the model is the transtheoretical model). 11 Dissonance is an uncomfortable state that occurs when one recognizes a discrepancy between one‘s beliefs about reality and a contrary observation.

45

contemplation-stage offenders. These contemplation-stage12 drinkers‘ results equate to the Wells-Parker et al. (1991) ― moderate‖ offenders. On the other hand, people who are not receptive to a message (career drinkers) view highly graphic and intense messages as cognitively dissonant (Festinger, 1957; Festinger & Carlsmith, 1959). Dissonance prompts nonreceptive offenders to consider a message as inappropriate. This dynamic may explain why career drinkers react negatively to the MADD VIP message. For nonreceptives, a graphic and emotionally intensive message induces psychological reactance. Message frequency Frequency is the number of times message content occurs (Straus & Corbin, 1990). In the methods section, procedures for quantifying the frequency of reactanceinducing statements are discussed, as well as the quantification of proportion of reactance-inducing statements. Dillard and Shen (1995) report the proportion of reactance-inducing statements as the frequency of reactance-inducing statements divided by the number of statements. Message frequency and intensity: a combination of metrics Reactance-inducing statements can be evaluated by a measure that combines level of intensity with frequency (Dillard & Shen, 1995; Strauss & Corbin, 1990). Strauss and Corbin recommend dual consideration of frequency and intensity because an infrequent message that is intense may deliver a strong affect. Dillard and Shen noted that intensity

12

According to the transtheoretical model, [delete ― also known colloquially as the stages of change model,‖] the contemplation stage is a stage in behavior change when the patient begins to consider changing their behavior (Prochaska & DiClemente, 1984).

46

of effect covaried with intensity of message. For example, an angry message was more reactance inducing than a pleasant message. In the present study, both frequency and intensity of reactance-inducing statements were measured for 15 VIP intervention groups. The intervention groups were categorized into two levels, low versus high reactance-inducing VIPs, depending on the levels of intensity of reactance-inducing statements and frequency of reactance-inducing statements. A third comparison group that had not been exposed to the VIP reactanceinducing messages consisted of the control group, DWI School Only, from the original Woodall et al. (2007) study. DWI recidivism outcomes were then regressed onto these three groups: (1) no reactance-inducing VIP exposure (DWI School Only), (2) low reactance-inducing VIPs, and (3) high reactance-inducing VIPs. Support for the use of categorization to increase robustness of data from variance and noise, and to account for missing data as in the case of no treatment is found in the study of genetic trait analysis by Kelly and Di Marzo Serugendo (2009), discussed in detail in chapter 3, Methodology. If there were no differences between any of these groups, then reactance-inducing statements would not make a difference in intervention outcomes, as measured by DWI recidivism. The categorization of groups into no reactance, low reactance, and high reactance dosages, and the regression of DWI recidivism outcomes onto these three levels of reactance-inducing dosages, enabled an evaluation of whether a change in reactance levels was consistent with a change in DWI recidivism. Message pathos Pathos is the strong emotional appeal in a message that a speaker uses to evoke emotion in a receiver. Strong emotional appeals typically incite high levels of arousal 47

(Kuhl, 1983) and can sometimes favorably influence social judgments (Bless, Bohner, Schwarz, & Strack, 1990; Bodenhausen, 1993). At other times, researchers (Baron, Inman, Kao & Logan, 1992; Janis & Feshbach, 1953; Jepson & Chaiken, 1990; Liberman & Chaiken, 1992) found strong that fear appeals (Rogers, 1983) incite a boomerang or backfire effect. A backfire occurs when the receiver is nonreceptive to the sender‘s intended effect. He or she discounts the message and behaves in a way that is contrary to the expectations of the source of the message. Fear appeals fail if they are too strong. If receivers feel threatened by a message they may become defensive, disagree with the message, lampoon it, and refuse to think rationally about the message. Through ridiculing and posturing, they exceed neutrality; they validate their oppositional behavior, even though they may base their behavior on an irrational interpretation of the message. Message decay rate Message decay rate describes the rate of continuous decline for a message effect, whether the message receiver is an individual offender or a system or group of individual offenders. Message decay is studied in terms of a message‘s broadcast strength and longevity. Decay of messages has been modeled employing artificial neural networks to predict radio field strength (Leros, Alexandridis, Dangakis, & Kostarakis, 1998). Message decay has been studied extensively in biological science as information loss, usually involving RNA transcription or message loss due to mRNA mutations (Nicholson, 2003). Message decay is also studied in computer science regarding the rate at which a message quality is lost due to the dropping of bits in a data packet (Borade, Nakiboglu, & Zheng, 2008), or when nonparametric data is converted into parametric data using genetic algorithms (Xiao, Goebel, & Eklund, 2006). 48

One of the more interesting features of message decay is that such decay might function to increase system fitness. Message decay, the modeling of which can sometimes involve use of genetic or decentralized algorithms, is thought to improve resilience and fitness of a system by replacing old information with newer information to increase accuracy and environmental sensitivity. In one such case, decay has been found to be useful in traffic simulations where the feedback from the environmental system to the individual is enhanced by message decay as discussed by Kelly and Di Marzo Serugendo (2007). The Persistence parameter controls the period of inﬂuence of the message, once the time is past the message is removed entirely. Also older messages exert lesser inﬂuence than newer messages (a 28 second old message is inferior to a brand new arrival as the new arrival more correctly reﬂects prevailing conditions). (p. 9) Message decay in communication research from 1949 to 1989 was limited to short time frames ranging from hours to seven weeks. Five weeks was the mean length of the research conducted on message decay during this period (Allen & Stiff, 1989). Since 1989 there has been extensive research on message decay by Pfau and colleagues (Pfau, 1991, 1992, 1997; Pfau & Burgoon, 1988; Pfau et al., 2001; Pfau et al., 2003; Pfau et al., 2004; Pfau et al., 2005; Pfau et al., 2006; Pfau, Kenski, Nitz, & Sorenson, 1990; Pfau & Van Bockern, 1994). Pfau and colleagues have studied message decay ranging from one to three weeks, at times in conjunction with the inoculation effect. Message inoculation works like a medical inoculation with a weakened virus. In message inoculation, a weak undesirable message is used to inoculate the receiver against being persuaded by subsequent stronger arguments. After inoculation against undesirable messages, the 49

intended message is more likely to persist and not decay as quickly. In fact, Pfau and colleagues found that weak refutational messages could inoculate the receiver against multiple attacks against the desired message. Inoculation with a weak undesirable refutational message has been found to protect a desired message from decay more effectively than messages that support and reinforce the desired message (Ivanov, Pfau, & Parker, 2009). As an intervention message becomes less relevant to the receiver, it becomes old. Old information is allowed to decay, replaced by new, more relevant, information. The new information fits the message receiver‘s current interpretation of the environment. If the message receiver‘s interpretation of the environment is limited to sensation seeking, then the salient message is ―dr inking is pleasurable and driving is expedient.‖ This new message is privileged, while the older MADD VIP message, ― I should not drive drunk‖ is allowed to decay. Thus the rational MADD VIP message decays, and the reckless drinking and driving message in effect comes to replace it. As discussed earlier, a decision to drive drunk, attractive because if its short-term expedience, is an example of alcohol myopia (Steele & Josephs, 1990). The drunk driver‘s cognitive field of vision, is constrained by alcohol myopia. The drunk driver chooses a short-term solution instead of considering the long-range evolutionary advantage of not driving drunk. Reactance Theory Variables Involving Message Senders Reactance theory defines four antecedents to the state of reactance that are related to the message sender. These antecedents to reactance are ― strong intent to persuade,‖

50

― forewarning,‖ ― confrontation,‖ and ― public censure‖ (Brehm, 1966). These four antecedents are described in this section. Strong intent to persuade Festinger and Carlsmith (1959) found that the less inducement or strength used to achieve compliance, the more effective the message. Similarly, Brehm (1966) states that the stronger the assertion of a message used to achieve compliance the less effective the message. These two views support each other. They agree with the results of a complex systems study of strength of persuasive messages during a group persuasion process (Medina et al., 2005). The study found that a weak (i.e., low-inducement) persuasive message was more effective than a strong (i.e., high-inducement) message. Persuasive message strength in the model, when it replicated the data signature of the in vivo group, was found to be 1/37th the strength of the group member variance from the mean (Medina, et al., 2005). A strong message is not necessarily qualified in the literature as a high-reactance high-threat message. In the literature, a strong message is more often referred to as a strong argument. The contradiction in definition of what constitutes a strong message has perplexed researchers and caused confusion in message effects literature. The definition of what constitutes a strong message has varied. Not only do researchers differ in the definition of what constitutes a strong message, but it also has been found through manipulation checks that study participants can respond differently to the same ― strong message.‖ Thus, message effects studies tend to lack external validity, due to inconsistency with each other in the definition and operationalization of the theoretical construct ―stron g message.‖ When researchers fail to conduce a manipulation check to 51

determine whether all of their participants define ― strong message‖ similarly, then internal validity suffers in those studies. This internal and external validity problem in message effects research has produced confusion in the literature. For example, a strong message has been found to have no effect in some studies, while a weak message has demonstrated a strong message effect in other studies. The source of perplexity is in the inconsistent definition of what constitutes a strong message across studies, and inconsistency in what constitutes a strong message across study participants within the same study. As will be discussed next, what constitutes a strong message can vary according to participant demographics, participant personality traits (McMillen, Adams, Wells-Parker, Pang, & Anderson; 1992), and in the case of the present study, participant prior conditions. Updegraff et al. (2007) found that a weak rather than strong message had greater influence on health behavior, although in the discussion of results the authors downplayed their perplexity about the counterintuitive results, explaining that results did not confirm their theoretical perspective. In puzzling over their outcome they made a call for more future research. One possible reason for their counter-attitudinal finding may be that researchers assumed their participants would regard a journal article as a stronger argument than anecdotal evidence. However, this assumption was not tested in a manipulation check. In a related study, Freedman (1965) studied the strength of message (high/low threat) and the resulting compliance level among boys from the second, third, and fourth grades. He found that those who complied had heard a mild message, consistent with a

52

weak intent to persuade. Those who did not comply were more likely to have heard a strong message, consistent with a message source‘s strong intent to persuade. Brehm and Cohen (1962) researched the variable ― strong/weak intent to persuade‖ with an essay-writing experiment. They used weak monetary incentives to impel undergraduates to write counter-attitudinal essays.13 Brehm and Cohen then gave an attitudinal evaluation questionnaire. They found that participants who received the lower (weaker) monetary incentives were more favorably disposed to changing their opinions. This finding supported Festinger and Carlsmith‘s (1959) research on cognitive dissonance: a person is likely to change their attitude when the reward for making a counter-attitudinal statement is low, especially if the topic is of low import to them. This phenomenon is termed insufficient justification. Insufficient justification explains how actors cannot justify changing their mind for a low reward. A low-value reward causes a state of dissonance. Their beliefs are disconfirmed by their own actions. Insufficient justification describes their reasoning that they must have wanted to change their beliefs anyway, and so they do. The current literature (Ziegler, von Schwichow & Diehl, 2005; Updegraff, 2007) often equates a strong message with a strong argument and operationalizes a strong argument as an expert argument. Investigators who are academics often use a journal article as an exemplar of a strong argument. For instance, experiments are often conducted using college students as participants, and researchers typically expect that student subjects will privilege a journal article as expert and as making a stronger argument than anecdotal evidence. In fact, Ziegler, von Schwichow, and Diehl (2005, p. 13

A counter-attitudinal message disconfirms a person‘s current attitudes. It opposes the beliefs upon which those attitudes are based.

53

648) found in a manipulation check that ― the majority of students were estimated to rate the product more positively given the attractive source (M = 4.83; SD = 1.37) as compared to the expert source (M = 4.25; SD = 1.42).‖ This finding contradicts the assumption that the college students find the journal article as an expert source and a stronger argument, and points to a threat to internal validity. Thus, when evaluating results of experiments on strong/weak messages a researcher might consider whether operationalization assumptions could be incorrect. Such threats to internal validity in the experimental design may be avoided by incorporating a manipulation check if it is an experimental design, or a post-hoc message analysis in the case of an in vivo field study. Affect is a term used in psychology that is synonymous with emotion. In a study where a strong message was claimed to have a significant and positive effect, the message that was tested was not a persuasive message designed to elicit attitude change. Ziegler, von Schwichow, and Diehl (2005, p. 649) tested opinions on shower lotion in an experiment in which the strong argument for a low self-monitor was a journal article. The strong argument for a high self-monitor was an attractive person. The researchers found ― a significant effect of the strong–weak message contrast, b = .35 (SE = .12), t (137) = 2.91, p < .01. Strong arguments (M = 4.82; SD = 1.06) led to more agreement than weak arguments (M = 4.12; SD = 1.35). The fact that no other effect involving this predictor variable was found indicates that this was the case regardless of whether these arguments were presented by the expert source or the attractive source and regardless of recipients‘ level of self-monitoring.‖ It must be noted, however, that the strong arguments, regardless of source and function for the message receiver, were not counter attitudinal as is usually the case in health behavior change messages. It might be argued 54

that an opinion elicited on shower lotion is not representative in face value to a counter attitudinal attitude or behavior change, and this is a flaw of the shower lotion study design. The effect of message strength may depend on the context of the message. Mitchell, Brown, Morris-Villagran, and Villagran (2001) found that ― message strength was positively correlated with attitude, intention and behavior, but was negatively correlated with negative thoughts, and counter arguments.‖ This study may explain why researchers find that high message strength increases likelihood of attitude and behavior change in some cases, while in other cases involving counter attitudinal behavior change a strong message decreases compliance. A drawback to the current literature on message strength is that the messages were created and manipulated in the artificial environment of a laboratory setting. The authors equivocated in their discussion sections, proposing that results (which had not achieved significance) could have been caused by researchers‘ choice of message content and how the message was manipulated in the experimental design. Their nonsignificant results could also be a function of use of self-reports to obtain behavioral outcome data. Further, the experimenters had created messages that they themselves had judged should be strong or weak. Whether researchers correctly judged what constituted a strong message was not tested through an experimental manipulation check. Nor did the researchers conduct a content analysis using independent coders. Independent coders are coders who code the same texts independent of consulting each other; they must consistently agree on operationalizations of what constitutes a strong/weak message.

55

Coding, or content analysis of messages, is a common method of analysis in communication message effects studies. Another common method of categorizing messages in message effects studies is the operationalization of theoretical constructs where a message is constructed by the researcher to represent theoretical traits at the message face value. The present study hybridizes these two approaches in such a way as to create an approach that is neither one nor the other. The advancement is something new, with qualities, limitations, and benefits of both qualitative (where continuity and dependence are salient) and quantitative epistemologies (where discretion and independence of elements are salient). The resulting method belongs to neither paradigm because it is paradoxically both continuous (wavelike) and discrete (particular). In its wave form, data are analyzed as continuous. Yet proximal continuity threatens alpha inflation. Alpha is the probability of perceiving a relationship between data that extreme or greater, given the data, and given one flip of the coin. If the data are viewed (analyzed) more than once, then the coin is flipped more than once. Data are viewed from more than one perspective and chances of perceiving relationships increase. Alpha is the probability that a singular analysis, a singular flip of the coin, will observe relationships between data. Threat of alpha inflation occurs because the greater the number of times that the same data points are perceived from different analytical perspectives, the higher the probability of interdependence of analytical observations. This logic, taken farther, suggests that when data are continuous, as number of analyses increase, the probability of unrelated observations decrease and probability of cross-observational or dependent relationships increase. As the probability of seeing interdependent relationships increases, the discretion of the alpha, the singular 56

view, is compromised. The integrity of the alpha is dependent upon independence of observations and a singular roll of the die to find a relationship between those observations. Paradoxically, accurate analysis of continuous data is threatened by a continuous view. The view must be singular, discrete. Paradoxically at the other extreme, accurate analysis of discrete data requires continual repetitive views of the units of analysis. These repetitive reanalysis‘ are encouraged. The process is to flip the coin many times and increase the probability of seeing interdependent relationships. This continuous process of analyzing discrete units is referred to as constant comparison analysis (Straus, 1987; Straus & Corbin, 1990). The integrity of constant comparison analyses of discrete data is threatened by a singular view. The view must be constant and continuous. It is interesting to consider how each paradigm‘s identity is strengthened by adopting paradigm of its opposite for its form of analysis. The correct approach to analysis of quantitative continuous data analysis is discrete. The correct approach to analysis of qualitative discrete units of data is continuous. Each methodological paradigm embraces its opposite as a means to reflect upon and make meaning out of itself. Each epistemology employs its opposite for reflection. It is therefore no great leap to understand that a viable approach to analysis of data can be simultaneously both continuous (quantitative) and discrete (qualitative). This study considers the fruit of a cross-epistemological confluence of qualitative and quantitative analyses. The delta region of confluence between paradigms, where boundaries fruitfully mix and overlap has been termed a heterogenous zone (Rogers, Medina, Rivera & Wiley, 2004). Historically, economically, linguistically, and scientifically (such as the dawning 57

observation that light is paradoxically both a particle and a wave) where differences conflate, scientific innovation is likely to occur and the frontiers of science are pushed outwards into new and previously unconsidered perspectives (Kuhn, 1996). In Kuhn‘s spirit of scientific evolution (1996), the researcher conducted a qualitative analysis informed by quantitative analysis. Quantitative researchers had preciously operationalized reactance theory constructs as message types with face values tested in terms of continuous dependent values. The present researcher employed those previously-established theoretically defined reactance-inducing message types to unitize the continous VIP transcripts into units of analysis. The researcher constantly reread and compared the VIP transcripts to identify a consistent definition of a unit of analysis, qualify definitions, and inductively abstracted exemplars of VIP presenter statements that became operationalizations of reactance theory constructs. The researcher organized VIP reactance theory message exemplars into a hierarchical order of increasing levels of reactance inducing statements per previous laboratory research, creating a code book. Independent coders self-trained, using the code book, and separately conducted a content analysis. Coders independently assigned VIP messages to code book categories. The coders independently and consistently agreed on operationalizations of what constituted a strong/weak reactance-inducing VIP message. The researcher This study advances message effects science by analyzing message strength in vivo, in a real life context. The advantage of in vivo message effects research is that the messages are created and studied within a naturally occurring context. Message effects research based on naturally occurring messages is more generalizeable and useful to message designers than messages that have been created and manipulated in an artificial 58

laboratory environment. Another advantage offered by the present study is the certainty under which the message effect has been evaluated. An outcome behavior is a more certain measure of message effectiveness than a self-report, which can be biased by a participant‘s desire to please the researcher or represent themselves in a more positive light than was actually the case. In other words, subjective and unverifiable self-reports are less likely to be accurate than objective measures that accurately document a behavior. The present study employs an objective measure of outcome behavior, documented subsequent DWI arrest records, as the source for outcome data. Such an objective behavioral outcome measure is called hard end-point data. The designation of message strength, whether a particular type of message was strong or weak, was not a speculation by the researcher, but was derived from two independent and successive phases of content analysis by independent coders. The following section anticipates the methods section. It discusses how results of experiments on message strength may be more reliable when the operationalizations of levels of message strength have been based on ratings produced by multiple independent coders. In this dissertation, measurement of message strength began with a qualitative analysis that was directly tied to reactance theory, followed by multiple types of quantitative analysis The methods section is foreshadowed here in order to contrast the present study with previous message effects research. In the present study, data were captured in a real life situation where recurring reactance-inducing statement themes and their message strength were recorded and transcribed. During qualitative constant comparison analysis, 59

these statements were compared with archetypical statements that had been empirically tested and results reported in the literature on reactance theory. The outcomes were a system of archetypical message themes that were hierarchically ordered by level of reactance-inducing intensity as reported in the literature. Qualitative analysis outcomes were next synthesized and parsimoniously refined to narrow the themes into eight hierarchical reactance codes, ordered by their level of reactance-inducing intensity. Definitions and archetypical exemplars were developed for each code. The next methodological step consisted of content analysis, a quantitative method. During this step, eight phase 1 coders would have their agreements measured by a probabilistic interrater reliability measure. The coders redundantly analyzed overlapping text samples from VIPs. They then matched archetypical message types to statements occurring in the VIP text samples. Because the archetypical message types were arranged in order of level of reactance-inducing intensity, as reported by previous research, the result of this operation was to code VIP statements by level of reactance-inducing intensity. The intensity level of a message, as determined by coders‘ placement of the message on the hierarchical reactance-inducing intensity scale, enabled assignment of a numerical ordinal value to each VIP message. See a detailed exposition of the reactanceinducing intensity scale in Table 3-4, with coding examples illustrated in Table 3-2. As a result, each message was coded by a code from 1-8. A code of 1 signified a nonreactance-inducing statement. A code of 8 signified a highly reactance-inducing statement. [insert paragraph break here]

60

The frequency of occurrences of different intensity levels for reactance-inducing messages within a VIP were summed for each VIP. For example, the level of reactanceinducement for a VIP with 8 instances of level 1 (happy) message types and 20 instances of level 8 (angry) message types would be divided by number of statements in order to calculate the average reactance-inducing intensity for that VIP. This would be calculated as follows: ((8 * 1) + (20 * 8))/ 28 = (8 + 160)/28 = 168/28 = 6. The average reactanceinducing intensity of this VIP would be 6, corresponding to a high-reactance-inducing intensity, on the average, for that VIP. The following would be an example of a low-reactance-inducing VIP. A VIP with 20 instances of level 1 (happy) message types and 8 instances of level 8 (angry) message types would be divided by the number of statements in that VIP. This would be calculated as follows: ((20 * 1) + (8 * 8))/28 = (20 + 64)/28 = 84/28 = 3. The average reactance-inducing intensity of this VIP would be 3, corresponding to a low reactanceinducing intensity, on the average, for that VIP. The interrater reliability for phase 1 coders was calculated and, though adequate, the variance and the marginals indicated that more reliable measurement of average levels of reactance-inducing statements for the VIPs could be obtained by increasing the attentiveness and focus of the coders. In phase 2 of the content analysis, two coders from phase 1, the two most attentive, focused, and consistent coders, the two coders who scored the highest interrater reliability during phase 1, were asked to each recode the entire data set of 2,021 VIP messages. These two coders consented to perform the task. They each recoded the entire data set for reactance-inducing message frequency and intensity, with a high level of interrater reliability. Thus, a measure of VIP message strength was independently 61

quantified through coder agreement on intensity of VIP reactance-inducing statements. This quantification of message strength was more objective than would have been the case should the researcher have guessed at whether an average message in a VIP was strong or weak, of strong or weak reactance-inducing intensity. Forewarning is an alert that another or different message is coming. Forewarning can be a positively valenced communication behavior, such as when a speaker outlines a speech in advance to help the audience better prepare to receive the content that is coming. This is called ― sign posting.‖ Sign posting for a receptive audience is positively valenced. However, in the context of reactance theory, forewarning is a type of sign posting that is negatively valenced. Petty and Cacioppo (1977) found participants‘ reactance increased if researchers forewarned them they were going to receive a counterattitudinal message. Brehm‘s reactance theory (1966) accords with Petty and Cacioppo‘s finding. Forewarning is a transitional statement, indicating a change in the direction of the conversation is about to occur. Sparks (1991) controlled for forewarning by measuring arousal and the emotional affects of distress and delight. He found no interaction effect between forewarning and gender of the participants. In another study, Sparks (1989) studied the interaction between high/low levels of forewarning (his manipulated independent variable), and preferred coping style (a preexisting trait). He assessed preferred coping style using the Monitor Blunter Style Scale, a scale that assesses information-seeking and avoidancecoping styles. Sparks found that high self-monitors preferred forewarning at high levels while low self-monitors preferred forewarning at low levels. Low self-monitors reacted negatively if forewarned. 62

In the MADD context, a message sender‘s forewarning is a cue to an audience they are going to receive a negative message intended to ―r educe behavioral freedom‖ (Brehm, 1966) to drink and drive. Forewarning, that an undesirable message is coming, is operationalized with statements such as, ― I am going to tell you about how a drunk driver ruined my life.‖ Another typical MADD VIP forewarning statement that signals a transition from opening pleasantries to a down-to-business confrontational tone is, ― That was before a drunk driver ran into her and stopped her life.‖ In the present study, forewarning is defined as a transitional statement that indicates that the next message is going to be sad, unwelcome, or confrontational. Confrontation Confrontation is a type of counter-attitudinal message. A speaker uses confrontation with the intention of reversing the listener‘s beliefs and attitudes. However, strong confrontation is bound to invoke a negative response. Miller (2000) and Miller, Benefield and Tonigan (1993) found that reactance increased when the persuader‘s language was confrontational. This finding suggests that an explanation for increased drunk driving among MADD VIP audiences might be found in the confrontational nature of the MADD message. Strong confrontational messages invoke a defensive response as a listener‘s countermeasure. Drunk drivers have been known to be defensive, demonstrate avoidance, and they have been known to minimize the value of the counter-attitudinal VIP message. Evidence of drinkers‘ minimalization and ridicule of MADD is presented in the discussion section of this dissertation. These reactions to MADD may operate as a

63

countermeasure, serving to validate the drinking culture and insulate drinkers from the discomfort they experience from MADD‘s confrontational message. Is MADD‘s message confrontational? MADD spokesperson Lord (1990) advocates the following tactics for VIPS: (1) Exposing offenders to the consequences of drinking and driving (2) Helping offenders move beyond focusing on their own ― bad luck‖ (3) Serving as a first step in breaking down the denial of alcoholics/drug addicts Miller, Benefield, and Tonigan (1993, p. 455), define MADD‘s tactics as confrontational because MADD advocates ― a hard-hitting, directive, exhortational style intended to overwhelm robust defensive mechanisms‖ of the DWI offender. As just suggested, it is often the case that confrontation, of which there are many levels, is an antecedent to reactance (Brehm, 1966). Confrontation may take many forms. A confrontational message may threaten a loss or invoke fear. Messages that frame a behavioral consequence in terms of loss are called loss frame messages. A loss frame message may point out that the message receiver will lose a cherished freedom, such as the freedom to drive legally. A loss frame message may point out that the message receiver will suffer incarceration, which is again associated with loss of freedom, obviously a negative outcome. Fears of loss and negative outcomes are inflamed by messages that appeal to, or target, those very fears. Messages that appeal to a listener‘s fears are called fear appeals. A fear appeal may target any fear. During MADD VIPs, presenters target audience members‘ fears that they might lose their freedom to drive legally, or their freedom to move about freely, by virtue of incarceration. MADD‘s loss frame messages and fear 64

appeals threaten loss of freedom, and in this instance loss of freedom is a reactance antecedent (Brehm, 1966). Both loss frame messages and fear appeals have been studied extensively by Witte and colleagues (Witte & Allen, 2000). Witte and colleagues have found in general that fear messages are denied, ignored, or rationalized as irrelevant when the message receiver does not possess the ability to avoid the feared event. If a message receiver does not feel efficacious14 in controlling their circumstances to avoid the feared loss, then they rationalize that the message is unimportant or does not pertain to them. The rationalization ― this message does not apply to me,‖ is a form of fear control. While in a state of fear control, the message receiver may dismiss or deny the probability that the feared event will occur. Fear is uncomfortable. Fear control may take the form of seeking comfort, especially the comfort provided by the associated undesirable behavior. Therefore, undesirable behaviors may be unwittingly reinforced by fear appeals. For example, a DWI offender, fearing incarceration, and feeling unable to control the drinking behavior, may seek comfort by increased drinking. Drinking more, increasing likelihood of increased drunk driving, may be the only effective way a drinker knows to control fear. Fear control may be an intervening state between the fear appeal or loss frame message and the reactance behavior. The literature on fear control, fear appeals, and their interaction with loss frame messages is noted here, but will not be further considered because the body of fear appeal and loss frame literature overlaps with only one of seven reactance antecedents discussed in the present study, confrontation. Further, the literature 14

Efficacy is a person‘s believe that he or she has the ability to avoid or control an event.

65

on fear appeals and loss frame messages offer a narrower in scope of explanation of the MADD VIP message effect than does reactance theory. The fear appeal literature explains why a message receiver might feel defeated and make no change in behavior. Reactance theory explains why a message receiver would increase practice of the undesired behavior following confrontation, loss frame messages, and fear appeals. Public censure Public censure is a kind of public scolding. It is a face threat that often includes the possibility of continued public shunning, penalties, and sanctions. Public censure is experienced when one‘s public image is assailed in a public forum. A face threat is experienced when one feels intimidated or embarrassed in front of others. The confrontational nature of public censure is likely to invoke the typical reaction to confrontational messages: denial, rationalization, and a desire to seek comfort and avoid compliance. Face threats are confrontational. High face threat, or a high degree of public censure such as occurs in MADD VIPs, makes for low compliance and increases the likelihood of reactance (Brehm, 1966). Threat of public censure may be a confounding variable to ― strong/weak intent to persuade.‖ If an authority (representing public norms) is involved in the experiment, a strong intent to persuade demonstrated by the authority offering more money ($20) to tell a lie is more effective than a weak intent ($1) from the same person. Perhaps the subject complies more under strong inducement to please the authority and avoid public censure. It may be that an interaction between authority (invoking a subject‘s desire to avoid public censure by that authority) and level of inducement (high or low) depends

66

upon the level of subject‘s self esteem and whether, or by what degree, they feel vulnerable to public censure. In a replication of Brehm and Cohen‘s (1962) essay-writing experiment, Worchel and Brehm (1970) found the greatest degree of attitude change occurred for college student participants who were paid the largest amount of money. An authority, a college professor, paying large amounts of money, may be construed by the students as indicating a strong intent to persuade, and they may comply simply to avoid censure. In an interesting convolution of circumstances, Worchel and Brehm (1970) found that larger payments of money invoked greater degrees of attitude change. This contradicted Brehm and Cohen‘s (1962) earlier findings that lower payments evoked the greater attitude change. What factors explain the difference in these results, obtained by the same researcher (Brehm) after an eight-year lag? Contradiction between the two sets of findings must first be considered by examining the differences in the researchers‘ experimental designs. Experimental bias, in this case the influence of the researcher upon participants, was controlled in Worchel and Brehm‘s (1970) design. The researcher who evaluated participants did not know whether the person being evaluated had received a large or small payment. Because the evaluator had no way of identifying which participants would confirm the evaluator‘s personal expectations, the evaluator could not unwittingly influence evaluation results. An evaluator who does not know to which group a participant has been assigned and cannot unwittingly bias evaluations to confirm their own subjective expectations conducts a ― blind evaluation.‖ This type of study design is referred to as a blind study.

67

In Brehm and Cohen‘s eight-year earlier study (1962), the evaluation process was not blind. The evaluator knew which participants had received the smaller sums of money, and the evaluator expected that smaller sums (weak inducement) would evoke the most change. It cannot be ruled out that the researcher‘s bias toward expecting that smaller rewards would invoke bigger change,15 resulted in evaluations that the lower-paid participants changed the most. The highest payment evoked the most attitude change when the attitudinal evaluator did not know which payment (large or small) the participant had received. Compliance with an authority figure (researcher/persuader/high-ranking professor/evaluator) may be a result of the subject‘s wish to avoid public censure, coupled with with the amount of money ($20 versus $1) that the experimenter offered as an inducement. Worchel and Brehm (1970) explained that the reason their results conflicted with the Brehm and Cohen (1962) results could be due to the confounding effect of ― communicator influence‖ upon ― message content‖ (Hollander, 1971). Communicator influence might be interpreted as authoritative influence, or fear of public censure from that individual. In other words, Worchel and Brehm believed that Brehm and Cohen‘s study suffered from evaluator bias and was thus an inappropriately designed experiment. 15

Festinger and Carlsmith‘s (1959) well-known experiment from three years earlier supported the argument that insufficient justification for a behavior change would cause dissonance result in greatest compliance. Festinger and Carlsmith interpreted their low-inducement/greater-compliance results solely in terms of their manipulated variable, "level of monetary inducement $1 vs. $20." However, a confounding influence, unknown at the time, was that the inducement came from an authority figure. Later research, particularly Stanley Milgram's (1961, 1963, 1969, 1970, 1974, 1976, 1977) series of studies on the greater compliance obtained from a combination of high authority and low inducement, clarified the cause of the effect. Low monetary inducement, lacking a social element in the Brehm and Cohen experiment, is a different type of inducement than a highly-influential authority/social norm interacting with a low-level monetary inducement.

68

There is further explanation supporting the argument that fear of public censure or face threat accounts for Brehm and Cohen‘s (1962) results. Rosenberg (1965) argued that participants might have changed their opinions in the Brehm and Cohen experiment because they experienced evaluation apprehension. They may have feared negative evaluation of their choice of a weak reward more than they desired the high reward (Crano & Messe, 1970). Thus, fear of negative evaluation by an authoritative public figure, may be a version of fear of public censure. Public censure may be investigated as a mediating variable in the effect of a strong/weak message (O‘Keefe, 2003). Authoritativeness can have different effects in different situations on different types of people. The aforementioned studies were conducted on college students, usually freshmen, who may be reasonably expected to be less depressed and more resilient than DWI offenders in the MADD VIP audience. In the case of the present study, when one considers the context of loss frame messages, fear appeals, and confrontation during the VIP, it may be more apparent why DWI offenders reacted negatively and, as it were perversely, to the MADD VIP speakers. The discomfort felt during actual VIP public censure (not the hypothetical public censure of Worchel and Brehm‘s college freshmen), coupled with a high-reactance high-threat VIP message, may have invoked a state of reactance in message receivers. Reactance Variables Involving Message Receivers Message receivers display reactance by behaving contrarily to a request. Reactance can produce a ― boomerang effect‖ (Hollander, 1971)16 where the negative 16

A boomerang is a curved aboriginal weapon that, when thrown upwards into the sky, returns to the thrower.

69

force of the message returns and impacts the speaker negatively. MADD presenters who induce increased drunk driving among their audience could be considered to be invoking a boomerang effect. Empirical evidence points to the cause of reactance as a receiver‘s view that another person is threatening his freedom (Brehm & Cole, 1966; Goranson & Berkowitz, 1966). Reactance, induced after exposure to a loss frame message, a message that threatens incarceration and loss of freedom, offers one explanation of why a loss frame message would wreak an effect opposite from the desired upon the message receiver. Reactance has a higher likelihood of occurring if certain aggravators are present. These aggravators co-vary with reactance in a positive correlation. Higher the levels of these covariates, are correlated with higher the reactance outcomes (Brehm, 1966). The following covariates increase reactance: 1.

High levels of confidence that one has a right to freedoms being censured/threatened correlate with a high level of reactance.

2.

High levels of import of behavioral freedoms being threatened increase reactance.

3.

High levels of threat to an important freedom increase reactance.

4.

If a speaker states that the listener‘s other related freedoms are threatened, this statement increases reactance.

A measure is that is available to test reactance is the Hong Psychological Reactance Scale (Hong & Faedda, 1996). These covariates of reactance are now discussed from the perspective of the MADD VIP participant. The original study Rogers, Woodall, Rao, Polacsek, & Milan (1994) did not consider reactance as a variable. 70

Therefore, the original researchers did not collect data regarding reactance antecedents from the message receiver‘s point-of-view. However, for the sake of completeness, message-receiver reactance antecedents are included in the present study, as delineated in the sections that follow. Message receiver’s confidence he possesses freedom to comply or not Confidence in a freedom arises from previous expression of that freedom. In order for a threat to freedom to induce reaction, a participant must believe, with some degree of confidence, that he or she possesses that freedom. A person is confident he or she has a freedom if either of the two following propositions is true: (a) He or she has experienced a prior personal expression of that freedom, or (b) he or she has witnessed similar others expressing that freedom and therefore expects to be able to behave in the same way. Past exercise of the freedom to drink and drive reinforces a DWI offender‘s belief that he or she continues to possess the freedom to drink and drive. If the participant has observed peers exercise their freedom to drink and drive with impunity, then these observations reinforce a drinking and driving behavioral norm and reinforces the drunk driving behavior. Brock (1968) studied the correlation between levels of choice (freedom) and levels of dissonance (reactance). He found the best way to measure the variable ― volition‖ or freedom of choice, was on a continuum, rather than as a discrete variable that measures presence or absence. Brock found that as belief in rights to a freedom increased, levels of dissonance increased if that freedom was threatened.

71

Message receiver’s perceived import of freedom A freedom gains more importance when it is reinforced by one‘s repeated behavior, the repeated behavior of one‘s significant others, and the repeated behavior of generalized others in one‘s social group. The belief in the import of the freedom to drink and drive is influential on the behavior of drinking and driving (Fishbein & Ajzen, 1975). If the message receivers of the 1994-1996 study perceived the MADD VIP as a threat to their freedom to drink and drive, then according to Brehm (1966) they would react by increasing the exercise of that freedom. They would therefore incur more DWIs following the MADD VIP intervention. One reason for reactance behavior is the perceived attractiveness of risk-taking and rebellious behavior. Researchers have found that young drinking-driving populations are likely to engage in high-risk driving (McMillen, Pang, Wells-Parker, & Anderson, 1992). Pechmann, Zhao, Goldberg, & Reibling (2003) found that: Among youths who felt immune to health risks, higher perceived health risk severity was associated with stronger intentions to smoke. In other words, in the context of low perceived vulnerability, stressing health risks could increase smoking‘s symbolic value as a risk-seeking, rebellious, and thus attractive behavior. (p. 11). This type of risk-taking profile has been associated with disenfranchised youth who revel in the role of anti-hero. They are susceptible to aspirations of destructive heroism, called the Herostratos Syndrome after the Greek iconoclastic youth who sought immortality through destruction (Borowitz, 2005; Cooper, 1977; Harmon, 2000; Stern, cited in Greenberg, 2005; Stohl, 1988; Wright, 1985). Such youth are easily aroused by 72

risk-taking and antisocial behavior (Burke, 2003; Harmon, 2000; Hoffman cited in Greenberg, 2005; Stern cited in Greenberg; 2005). Sign of perceived threat Perceived threat is the message receiver‘s recognition that a loss is imminent; perceived threat to freedom can induce reactance. Festinger (1957) operationalized the magnitude of dissonance accompanying forced perceived threat to freedom as ― compliance‖ (p. 92). He found that, as an attitude object increased in importance, the dissonance and psychological reactance increased when possession of that attitude object was threatened. A threat to possession of the attitude object was perceived in a request for counter-attitudinal compliance. The more important the attitude object, such as drinking and driving, the greater the message receiver‘s perceived threat to freedom on hearing a counter-attitudinal message pertaining to the devastation caused by drunk driving. Brehm (1966) found that when a participant realized the message source had a strong intent to persuade, he or she felt cornered. As the perceived threat to freedom increased, the likelihood of reactance increased. As discussed regarding the previous variable strong intent to persuade, a highthreat correlates with low compliance (Freedman, 1965). Similarly, Brock (1968) found that a higher perceived threat to freedom to perform an action (such as driving while drunk), coincided with higher reactance (increased drunk driving). Brock measured reactance as dissonance and found that when high levels of perceived freedom were present, there were also present higher degrees of dissonance and reactance upon perception of a counter-attitudinal message.

73

Message receiver’s belief that MADD threatens other freedoms The MADD VIP message receiver, upon hearing that MADD is threatening his freedom to drink and drive, may deduce that MADD is also threatening related freedoms. For example, if MADD says one cannot drink and drive, and one drives to favorite drinking places, then MADD is reducing one‘s freedom to drive in general. Typically, drinkers drive to several drinking locations during one drinking episode. The MADD VIP anti-drunk-driving message threatens the freedoms to drive, to drink, and to both drink and drive. According to Brehm (1966) an offender‘s belief that a MADD speaker threatens related freedoms would increase that offender‘s reactance. For example, MADD VIP audiences would have a high degree of reactance if they were confident that they possessed the freedom to drink and drive without consequence. They would have a high degree of reactance if they had been arrested (freedom threatened) when they exercised their freedom to drink and drive in the past. They would have a high degree of reactance when they had observed others in their social network also exercising freedom to drink and drive without confidence. It is reasonable to propose that DWI offenders could have a high degree of reactance if they think a MADD presenter threatens their freedom to drink and drive through warnings of car impounds, sanctions, incarceration, loss-framed messages, and fear appeals. Such would be the case when, as often occurs during the VIP, a MADD presenter warns that drunk drivers will lose their drivers‘ licenses. They will respond with a high degree of reactance if reactance aggravators are present: (a) A MADD presenter forewarns that a negative, counter-attitudinal message directed to dissuading offenders 74

from drinking and driving is imminent; (b) listeners sense that the MADD presenters have a strong intent to dissuade them from drinking and driving; (c) the MADD presentation takes place in a public context where MADD presenters portray DWI offenders as villains in public presentations—invoking face threat; and (d), the MADD presenters exhibit a confrontational approach to alcohol use and abuse. DWI offenders may present reactance if Brehm‘s four reactance antecedents are present: (a) DWI offenders have a high degree of confidence in their freedom to drink and drive. (b) DWI offenders place an import on the freedom to drink and drive. (c DWI offenders believe MADD presenters threaten other freedoms. (d) DWI offenders believe that MADD presenters imply that courts may reduce related freedoms, such as the legal right to drive. Considering VIPs from the viewpoint of DWI offenders helps explain how VIPs can introduce and inflame reactance that increases drunk driving. Reactance theory explains why VIP audiences engaged in more drinking and driving following and in spite of the MADD VIP anti-DWI messages (Woodall, Delaney, Rogers, Wheeler, Rao, Polascek & May, 2006). This trend, though nonsignificant, pointed toward MADD VIP intervention failure. Reactance theory may explain the failure of the intervention from the perspective of a communication message effect, as discussed below. Researchers use reactance theory to explain the negative outcomes of intervention campaigns. Dillard and Shen (2005) discussed how reactance explained the failures of health campaigns. They also discussed how to measure reactance. Dillard and Shen found that ― reactance can be operationalized as a composite of self-report indices of anger and negative cognitions‖ (p. 144). A message receiver who is reacting negatively to an 75

intervention message will report significantly lowered mood state. Reactance causes lowered moods (Hong & Faedda, 1996). Rogers, Woodall, Rao, Polascek, and Milan (1994) and Woodall, Delaney, Rogers, Wheeler, Rao, Polascek, and May (2006) measured a negative mood change among MADD VIP participants following receipt of a strong MADD message to stop drinking and driving. The present study examines whether the level of reactance-inducing statements and proportion of reactance-inducing statements predict a pre/post direction of mood change. In other words, will reactance antecedents, as present in VIPs, presage a change in a negative direction on an emotional change score, where the emotions measured are anger and negative cognitions? Negative cognitions, such as thoughts of negative outcomes that can result from antisocial behavior, can exacerbate a sociopath‘s pathology. When threatened with loss of freedoms, sociopaths have been known to increase socially undesirable behaviors. For example, if threatened with consequences of driving drunk, then they may drive drunk more often to assert their freedom to do so. If the VIP message receivers, in the 19941996 original study, believed that VIPs threatened their freedom to behave as they chose, then reactance theory (Brehm, 1966), would explain why they would drive drunk more often. According to reactance theory, offenders would drink and drive more to deny threat of loss of freedoms and to demonstrate their immunity to loss of driving and other cherished freedoms. The freedom to drive drunk is just one of several related freedoms that a drunk driver enjoys. Other related freedoms are the freedom to possess and use a driver‘s license, freedom from incarceration, freedom to enjoy job security, and freedom to enjoy societal approval. Reactance theory explains why threatening loss of driving and other related freedoms would increase a drunk driver‘s frequency of driving drunk. 76

Reactance Theory’s Usefulness in Explaining MADD Message Effects Burgoon, Alvaro, Grandpre, and Voulodakis (2002) employed reactance theory to explain message effect failures. They found resistance (reactance) to health communication messages was high in subpopulations, such as adolescents, who have a high need to express their freedom. Subpopulations who demonstrated high-reactance behaviors displayed the following characteristics: (a) Reactance-prone subpopulations have a high need for self-determination; (b) reactance-prone subpopulations believe that interveners are attacking their behaviors; (c) reactance-prone subpopulations have a high degree of confidence in their right and freedom to perform the contrary behavior; and (d) reactance-prone subpopulations have a high degree of subjective certainty that they know more on the topic than the message source. Reactance theory can account for the failure of health communication interventions conducted upon subpopulations that have resistance characteristics. As suggested previously, the present study similarly finds resistance characteristics among MADD message receivers. Reactance theory, a theory that offers a message effects perspective on persuasive communications, can help to explain intervention failures. According to reactance theory, when VIP presenters forewarn DWI offenders that they will confront them about the negative outcomes to drinking and driving, and warn them about threats to their freedoms, then this type of message would likely induce reactance and result in increasing drunk driving. If VIP presenters demonstrate a strong intent to persuade, then this also is a reactance antecedent than can result in increased drunk driving. Presence of reactance antecedents such as forewarning, confrontation, threats to freedoms, and strong intent to persuade may account for failure of the VIP intervention. 77

Anti-abuse Messages Induce Reactance Just as certain subpopulations can be more susceptible to reactance, certain types of messages can increase susceptibility to reactance. Several teams of researchers have used reactance theory to explain health communication campaign failures (Buller, Burgoon, Hall, Levine, Taylor, Beach, Buller, & Melcher, 2000; Campo & Cameron, 2006; Engs & Hanson, 1989; Miller, Burgoon, Grandpre, & Alvaro, 2006; Quick, 2003). They found that certain types of persuasion messages aroused high levels of reactance: 1. Anti-smoking messages (Miller et al., 2006) 2. Anti-alcohol abuse messages (Engs & Hanson, 1989) 3. Anti-drug abuse messages (Campo & Cameron, 2006; Quick, 2003) 4. Anti-high-risk behavior messages (Quick & Stephenson, 2004) 5. Anti-cancer behavior messages (Buller, et al., 2000) Results from these studies confirm that reactance theory may account for the failure of antismoking, antialcohol abuse, antidrug abuse, and anti-high-risk behavior messages in health communication interventions. The reason, according to Brehm (1966), Miller (2000), and Miller, Benfield, and Tonigan (1993), is that substance users sensed these ― anti-messages‖ as threatening their freedom to perform these acts. Users assert their own reactive and contrary high-risk behaviors. As discussed in the previous section, aggravators of reactance include confrontational interventions (Miller, 1995; Miller, Benefield, & Tonigan, 1993), a strong intent to persuade, forewarning (Brehm, 1966; Petty & Cacioppo, 1977), and public censure (Brehm 1966).

78

Confrontational Messages Induce Reactance Researchers credit increased alcohol consumption as a factor in reactance to confrontational style alcohol interventions (Miller, 1995; Miller, Benefield, & Tonigan, 1993). This research leads one to consider whether MADD VIP alcohol interventions are, in fact, confrontational and reactance-inducing. Confrontational interventions, similar in message content to MADD VIPs, demonstrate three previously described coincident aggravators that correlate with reactance: forewarning, strong intent to persuade, and public censure. In an earlier section of this chapter labeled ― Role of Message Intensity in Message Effects Research,‖ the effect of message intensity was discussed. Message intensity, as it is discussed here, is not a well-researched variable in message effect research. Message effects researchers characteristically discuss message strength but they view message strength as synonymous with logical argument strength. They usually define a strong message as a strong logical argument, a message from an expert, such as the author of a peer reviewed journal article. In contrast, the discussion of message intensity here is synonymous with high-reactance high-threat reactance-inducing messages. Message effect theories usually explain the relationships of theoretic elements concerned with fear, efficacy, or gain and loss frames. While focusing on these topics, message effects researchers have largely neglected to examine message intensity. The present study researches message intensity as a message effect. The present study does not make the same assumption about audience attentiveness and willingness to centrally process, cognitively elaborate (Petty & Cacioppo, 1986). Other message effects studies have assumed audience attentiveness and 79

cognitive elaboration of the message. For example, ― gain and loss frames assume that audiences are attentive to message content‖ (Capella, 2006, p. S270). On the contrary, according to extensive empirical research conducted by Petty and Cacioppo (1986), audiences may not be attentive to message content. In the transcripts analyzed in the present study, some MADD VIP speakers actually commented publicly during the VIP about the disinterest in their audience. MADD VIP presenters‘ observations on audience disinterest raise a doubt about the MADD assumption that VIP audiences are attentive. Receivers are not always attentive to message content, particularly substance abuse audiences such as DWI offenders. Offenders approach their perception of the VIP message through the colored lenses of certain predispositions (beliefs, attitudes, and behaviors) that favor offenders‘ rights to practice substance abuse. Substance abusers are not attentive to counter attitudinal messages that threaten their freedom to abuse substances. Threats concerning the negative outcomes of substance abuse can arouse reactance (Buller, Burgoon, Hall, Levine, Taylor, Beach, Buller, & Melcher, 2000; Campo & Cameron, 2006; Engs & Hanson, 1989; Miller, Burgoon, Grandpre, & Alvaro, 2006; Quick, 2003). Drug use has been known to impede cognitive processing of counter-attitudinal anti-drug-abuse messages (Fishbein & Ajzen, 1975). Pleasurable attachment to a drug can cloud an abuser‘s ability to consider the arguments of an intervention message. Reactance coincides with rejection of a message, such as an anti-abuse message. Reactance explains why DWI offenders may indulge in increased substance abuse after hearing an anti-abuse message.

80

Story forms of persuasive messages, such as those employed in MADD VIPs, are effective in capturing the attention of low-motivation audiences (Green, 2006). However, high-reactance high-threat stories, such as the VIP statement ―T he drunk driver who killed my daughter is in jail for the rest of his life,‖ may simultaneously capture attention, arouse a negative affect, and invoke a high level of reactance. Reactance to such a message is more likely than acceptance because of the message‘s strong negative valence and counter-attitudinal effect. Reactance explains why the number of prior arrests often predicts the number of recidivisms. Arrest is more likely for those DWI offenders with the greatest investment in their freedom to drink and drive. This study researches the main theoretical structure of reactance theory, with its nine reactance antecedent variables, in the context of MADD message effects. In a pilot observational study, these nine aggravators of reactance were observed as likely to be present in MADD VIP presentations. Employing reactance antecedents in message analysis offers a unique means to measure the probability of whether reactance may account for DWI behavior that is contrary to the intent of the MADD VIP message. Dillard and Shen (2005) found that ― reactance can be operationalized as a composite of self-report indices of anger and negative cognitions‖ (p. 144). A negative intervention message typically lowers subjects‘ mood and leads to reactance (Hong and Faedda, 1996). Rogers, Woodall, Rao, Polascek, and Milan (1994) and Woodall, Delaney, Rogers, Wheeler, Rao, Polascek, and May (2006) found a lowered mood among VIP participants following MADD VIPs‘ negative intervention that authoritatively demanded that they stop drinking and driving.

81

Field Notes from a MADD VIP Participant Observation Pilot Study The author watched from a balcony as an eyewitness to a MADD VIP presentation. Analysis of field notes suggested a relationship between content of the MADD VIP presentation and the following reactance theory antecedents: strong intent to persuade, forewarning, confrontation and public censure, and signs of perceived threats. Confrontational intensity increased as each victim impact story built upon its predecessor a stronger message about how drunk driving leads to misfortune and carnage. Presenters described drunk drivers as thoughtless and selfish for driving while drunk without a care for their victims. The style of delivery of the stories was confrontational. Confrontational alcohol interventions can irritate and increase reactance, leading to higher alcohol use (Miller, 1995; Miller, Benefield, & Tonigan, 1993). Strong intent to persuade was observed. Each presenter ended the presentation with a strong plea to the DWI offenders to stop drinking and driving. Some of these pleas suggested a low face threat (Dillard & Shen, 2005). A low face threat message was, ― All I am asking you to do is consider calling a taxi for a free ride home.‖ Many of the VIP speakers‘ pleas, however, presented a high face threat, such as when the presenter said, ― I am angry about what happened to me. People like you who drink should not get into the driver‘s seat.‖ A presenter‘s strong intent to persuade likely annoys and increases receivers‘ reactance and incites them to contrary behavior (Brehm, 1966). Forewarning occurred before the presentations. Courts forewarned DWI offenders that they would be listening to victim impact stories and viewing graphic photos of crashes resulting from drunk driving. The MADD host of ceremonies forewarned DWI offenders they would be hearing victim impact stories and that some 82

photos would be graphic. Forewarning is an aggravator that increases reactance (Petty & Cacioppo, 1977) Confrontation and public censure (face threat) was likely experienced by the audience in response to anti-offender bias in MADD VIP stories: DWI offenders were cast as the villains who caused preventable misfortunes to the presenters and their loved ones. The presenters publicly censured drunk drivers. Public censure (face threat) is an aggravator that increases reactance (Brehm, 1966). Signs of perceived threat to drunk driving behavior may explain why, as the presentations continued, an increasing number of DWI offenders leaned far backwards, away from the VIP speakers. Their angle of repose was roughly ten degrees backwards from an upright position. Offenders increasingly crossed their arms. It must be noted that crossed arms alone do not signal a sign of defense against threat. An observer of a crossed-arm posture may interpret that body language as defensive. Observers rated people with closed arms as rejecting a speaker‘s message (Machotka, 1965). However, the reason a person crosses his or her arms can vary on a continuum from positive affect ― this is comfortable‖ to negative affect, ― I disagree.‖ While arm crossing can signify different meanings to the observer versus the single motivation of the arm-crossing person, researchers have found that leaning away from the speaker signals disagreement, and discomfort with the message content or the speaker (Bukhari, 2006). The farther an audience leans backwards, the less they like the speaker. According to Mehrabian and Friar (1969) ― The mean angle of backward lean with liked addressees (1.4°) is less than the mean angle with disliked addressees (9.3°).‖ It is argued here that a ten-degree backward lean suggests negative reaction and implies a 83

sign of perceived threat to DWI offenders‘ identity or beliefs. This observation, correlated with the arms-crossing observation, increases likelihood that VIP audience members‘ increasing arms-crossing behavior signified increasing defensiveness to perceived threat. Perceived threat increases reactance (Brehm, 1966). Field notes suggest that as the MADD VIP presentations continued, the DWI offender audience became increasingly provoked or threatened, as measured by the degree of backwards lean in the audience. This assessment is corroborated by observations from VIP presenters. One of the MADD VIP presenters remarked to the audience that he sensed their hostility. Hostility is present before and during reactance. MADD VIP message content contained antecedents that have been known to invoke reactance: presenters‘ strong intent to persuade, confrontational presentations, and public censure of drunk drivers. Reactance may account for the failure of the MADD VIP intervention for some types of DWI offender demographics. Chapter 2 Summary Need to test message effects of the MADD VIP Due to the high societal and personal costs of DWIs, there is a need for courts to evaluate what is the best intervention message design that will deter future DWIs among known offenders. In order to properly evaluate intervention message designs, specific messages need to be tested against each other (O‘Keefe, 1988). MADD VIPs offer an opportunity to study naturally occurring variations of the different message types and their effects. This design offers an important contribution to message effects research that has been called for by Jackson, O'Keefe, Jacobs, and Brashers (1989).

84

In this study, two interventions were evaluated against each other: the MADD VIP intervention was tested against the standard and customary DWI School. There is a need to evaluate the emotionally message-driven MADD VIP presentations for their possible influence on DWI recidivism. This is accomplished in the present study through employment of five general message effects variables in evaluation of the MADD VIP presentation. These five message variables are context, content, function, intensity and pathos or emotional appeal. The frequency and levels of intensity of these five message factors and quantification of their strength of association with degree of recidivism, via a regression test, would support the argument that message reactance influenced offenders‘ recidivism behaviors. Need to test reactance theory constructs and assumptions in context of MADD Reactance theory constructs There is a body of literature on the role of reactance as a confound to intervention compliance and as an explanation for why health behavior change interventions fail. However, there is no literature that focuses on evaluating MADD VIP presentations for presence of reactance theory constructs. The acronym MADD suggests anger toward drunk drivers. Angry behavior change messages have been found to induce reactance and negative message effects. Thus, there is a need to evaluate MADD messages for level of reactance-inducing statements, coinciding with an empirical evaluation of MADD message effects outcomes. If a message sender conveys a strong intent to persuade then he or she may irritate the message receiver. If the message sender forewarns the receiver of an imminent message of threat, confronts the message receiver, and censures the message receiver, 85

then according to reactance theory the message receiver would react oppositely from the desired behavior. If the message itself is stressful, unremitting, and delivered in a punishment context, it is likely to invoke defensiveness in the message receiver. If the message sender censures, threatens, or publicly ridicules a behavior that a listener believes he has the freedom to perform, then the receiver is likely to respond with reactance. The listener will likely reassert his freedom by increasing his practice of the censured behavior. Reactors, such as drunk drivers reacting negatively to MADD VIPs, may go so far as to band together to perform their contempt of those who are censuring them. As reactance theory explains, message reactors are likely to increase and celebrate the practice the censured behavior to assert their freedom. This reactance behavior, as well as a collective reactance behavior, featuring the banding together of reactance practitioners, will be discussed in the chapter 5. The present study considers five message-sender and message-related theoretical antecedents of reactance. This study evaluates whether message-sender and messagerelated reactance antecedents are present in the MADD VIP message. These antecedents are confrontation, public censure, forewarning, and strong intent to persuade. These constructs are operationalized as level of reactance-inducing statements in VIP presentations. They are also operationalized as proportion of reactance-inducing statements. If there are strong ― doses‖ of these five theoretical reactance antecedents in the MADD VIP intervention message, and if the level of reactance dosages predicts DWI recidivism (a shorter survival time until rearrest) (Delaney, Kunitz, Zhao, Woodall, 86

Westerberg, Rogers, & Wheeler, 2005), then reactance theory would offer an explanation for increased recidivism observed among MADD VIP participants. Reactance theory constructs are tested here as an explanation for specific message effects. Research Questions The present study asks four types of research questions. First, are there reactance antecedents present in MADD VIP presentations? Second, if present, do MADD VIP reactance antecedents predict significant differences in DWI recidivism outcomes? Third, what are the demographic covariates, besides MADD VIP message effects, that account for effects in DWI recidivism among MADD VIP participants? Fourth, are MADD VIPs effective at reducing DWI recidivism? 1. At what levels are reactance antecedents present in MADD VIP presentations? 2. Do the 15 different MADD VIP presentations have different reactance message dosages? If so, this difference will become a covariate that will be controlled for by nested regression, known as hierarchical linear modeling. 3. Does the reactance message dosage (level of reactance-inducing statements and proportion of reactance-inducing statements) predict direction of emotional change score in the MADD VIP plus DWI School intervention group? 4. Does the reactance message dosage predict survival time to first recidivism within the MADD VIP plus DWI School intervention group, while controlling for covariates age, gender, and number of priors?

87

5. Does the reactance message dosage predict number of subsequent arrests within group for the MADD VIP plus DWI School intervention group, while controlling for covariates age, gender, and number of priors? 6. Are there different predictor variables for recidivism for those study participants with DWI arrests before the study (who arguably believe they have the freedom, a reactance theory assumption, to drink and drive) versus those participants with no prior arrests? 7. What are the demographic covariates that predict positive or negative message effects of MADD VIPs? 8. Are MADD VIP messages effective in terms of lengthening time to recidivism and reducing number of subsequent arrests?

88

CHAPTER 3: METHODOLOGY Researchers have identified a need for longitudinal17 research that explores MADD VIP intervention message effects upon DWI recidivism over multiple years (Kunitz, Woodall, Zhao, Wheeler, Lillis, and Rogers, 2002). The present study researches the long-term impact of a MADD VIP intervention using a 12-year randomized research study designed by Woodall, Delaney, Rogers, and Wheeler (2007). The original research design was defined as a 2x5 mixed factorial design, an incremental design with two group conditions (VIP, no VIP) and five times of assessment including the current study assessment. Factorial designs, which study multiple effects (main effects and interaction effects simultaneously), have been considered more efficient than studying one factor at a time since 1926 when Sir Ronald A. Fisher introduced the term factor in his article titled ― The Arrangement of Field Experiments‖ (Box, Hunter, & Hunter, 2005; Fisher, 1926, p. 511). In the original study, the first group condition was a DWI School Only (no VIP) comparison group. The second group condition was a DWI School group that also received the MADD VIP intervention. Although the words rehabilitation and intervention have been used together to describe the VIP, the MADD VIP intervention is not a rehabilitation treatment in the medical sense. It does not treat or attempt to cure a pscyo-pathology such as an addictive substance abuse behavior. The VIP program does not purport to rehabilitate the offender from drinking alcohol or using illegal drugs. It is a psycho-social intervention designed to increase deterrence of driving while intoxicated. The intervention is also referred to as a study condition that may be tested to determine 17

Longitudinal research consists of a study whose time period for data collection stretches over a long time frame. Historically researchers have referred to a one or two year study as a longitudinal study.

89

the level to which the intervention deters an unwanted behavior. This is a fine distinction, but one that is necessary in order to avoid the implication that the MADD VIP is a medically prescribed treatment to rehabilitate addicted substance abusers. The original factorial study was designed to empirically investigate only two group conditions, MADD VIP plus DWI School compared to DWI School Only. The present study, an extension of the original study designed by Rogers, Woodall, Rao, Polacsek, & Milan (1994), observationally analyzed the original study‘s VIP presenter transcripts to determine whether reactance antecedents were present. Two levels of reactance antecedents (low versus high) were observed in post hoc analysis of the original study‘s VIP transcript data. Thus in the present study there are three levels of intervention condition, (1) DWI School Only, (2) DWI School plus low reactanceinducing VIP, and (3) DWI School plus high-reactance VIP. Kelly and Di Marzo Serugendo (2009) found that categorization of levels of traits, and inclusion of a zero trait level for those individuals for whom the trait/treatment/antecedent were missing, increased the robustness of the regression analysis to noise or variance. Each missing observation for a trait is assigned a unique level of a dummy factor associated with that trait. Each missing observation can be assigned the same dummy value. The program accepts that value as a real observation. The result is that the program handles the analysis as if all traits were observed for each animal with records. In essence, what happens is that the residual element associated with the dummy observation is estimated to be zero. The algorithm to compute the asymptotic average information matrix uses those zeroes which add nothing to the AI matrix so that it is the same as if it was computed with a more complex 90

algorithm. Similarly, L|y also is the same as if it was computed with missing observations ignored. (p. 1) The present study adapts the above approach and finds that categorization of the variables into levels in the case of message dosage levels serves to increase the sensitivity of the tests. The significant differences between low and high VIP message dosages, substantiated via ANOVA contrasts, are tested against absence of the VIP message dosage, where the non-VIP dosage is set as a zero level. Thus, as in the Kellly and Di Marzo Serugendo approach, the missing VIP dosage group members are all assigned the same zero value of VIP dosage. The loglinear logit regression handles the non-VIP group as if they were observed as receiving a VIP dosage level. This approach allows the intervention group, at two levels, to be compared with a third level of no intervention, and offers a tri-level quantification of the variable of interest, level of VIP message dosage. Three levels of VIP message dosage ― observed‖ as such increases the sensitivity of the regression, allows for an increase in power through inclusion of more cases, and increases ability to detect a VIP message effect. The design for the current study remains defined as a 2x5 mixed factorial design, however it may be thought of as a 2x5 mixed factorial design that evolved into a 3x5 mixed factorial observational design at the last time of assessment, the assessment conducted in the present study extension. The use of three groups in two different contexts (VIP, no VIP) helps to overcome the effect of context upon the message effects study. In single-context studies it cannot be known whether the message effect is confounded by the context. An advantage of the original researchers‘ experimental design

91

(Rogers, Woodall, Rao, Polacsek, & Milan, 1994) is that there has been a control for confounds such as context. This investigation is the first randomized design that tests communication reactance theory constructs by operationalizing them as specific message types (see Quantifying reactance message constructs). Message types, as associated with corresponding theoretical reactance antecedents, were hierarchically ordered during a qualitative analysis, according to their reactance-inducing intensity (see Table 4-2). This ordered scale was used to code 2,021 statements made during 15 sampled VIPs. After 2,021 individual VIP statements were coded according to their reactance-inducing intensity, the mean reactance-inducing intensity for each VIP was calculated. In this manner, the level of reactance-inducing intensity for each of 15 VIPs was obtained (see Figure 4-2 and Table 4-3). It was found via Fishers LSD analysis that VIPs were not all of the same level of reactance-inducing intensity. VIPs could be categorized into two levels, high versus low reactance-inducing intensity. VIPs were subsequently categorized into low reactanceinducing VIPs versus high reactance-inducing VIPs. Thus, the VIPs were bifurcated into two different levels of reactance-inducing intensity. Recidivism outcomes were regressed upon two levels of VIP categories according to their low/high reactance dosage, which was the frequency of occurrence of reactanceinducing statements weighted by their intensity (see Figure 4-4: Effect of MADD VIP reactance-inducing levels upon categories of offenders). In the above manner, reactance theory constructs were operationalized as specific messages types and tested for message effects. 92

This methods chapter begins with a traditional discussion of general population and study design characteristics, followed by a discussion of parametric methods assumptions. Next, the chapter describes the calculation and testing of variables of interest for violations of the assumptions of parametric statistics. The methods section ends with an explication of qualitative and quantitative methods employed in the present study, the data-structure rationales for why loglinear logit regression and Cox Proportional Hazards Regression were chosen in favor of other tests, their respective units of analysis, procedures, validity, and reliability. Population and Sampling Participants There were 833 DWI offenders in Bernalillo County, New Mexico, who participated in the original Woodall et al. (2006) study. There were 426 participants in the MADD VIP plus DWI School intervention group (6% had prior DWI arrests). There were 406 participants in the DWI School comparison group (5% had prior DWI arrests). Seventy five percent of the participants were male. Three cases had missing values for gender. Males were more likely to be rearrested than females t (828) = -2.9, p = .004. The mean ages of the intervention groups were relatively equal (DWI School comparison group: 31.2 years, MADD VIP plus DWI School intervention group: 31.4 years). The MADD VIP plus DWI School intervention group had 3% more young participants under age 30. The under age 30 group was more likely to be rearrested, which resulted in a necessity to stratify the no prior group by age.

93

The ethnic composition of the population sample was balanced between group conditions. No ethnic group was observed to be more frequent in one intervention condition or another. The study sample was 46% Hispanic, 36% Anglo, 12% Native American, 2% African-American, and 4% other ethnicity. The present study under-identified those with prior DWI arrests. The original researchers (Rogers, Woodall, Rao, Polacsek, & Milan, 1994) enrolled the first participants in 1994. In 2007, Woodall, Delaney, Rogers, & Wheeler recorded the earliest prior arrest as occurring in 1972. With earliest prior arrest dating in 1972, they found 165 of the 833 participants had prior DWI offenses. The current study used an earliest prior arrest date of July 1995 and identified only 47 offenders with prior DWI offenses in the most recent weeks and months prior to the arrest that triggered enrollment among participants enrolled in the second year of the study. Thus, the current study is limited in its generalizability to those with prior arrests. The present study has worked with data that under-identified prior offenders and only included recent prior offenders in the prior offender category. Over 100 prior offenders, identified by Woodall, Delaney, Rogers, & Wheeler (2007) as prior offenders were categorized as no priors in the present study. A later analysis of the same participant pool with a start date for prior arrests at July 1984 found ten more, 175, prior offenders in the no priors group of the original study. Depending on how far in the past a researcher sets the search for prior arrests, a participant with one very distant prior arrest may fall into the no prior category. This is a consideration for future designs. The prior offender start date limitation of the current study means that among one third of participants, those enrolled prior to July 1995, there could have been prior 94

offenders with DWI arrests in recent weeks and months. The discrepancy between the original study and the present study data sets, the under-identification of participants with prior DWIs, had the effect of reducing likelihood the present study would produce significant results because over 100 prior offenders included in the no prior group would likely bias results against finding significant differences between prior and no prior groups. The methods employed in the present study concerning prior offenders will necessitate replication with an earlier prior arrests date to validate result concerning the effects of MADD VIPs upon those with priors versus no prior DWI arrests. Because an earlier prior arrest index date will identify more prior offenders within the current sample population, the future analysis may find higher levels of significance and effect sizes between priors versus no priors than in the present study. However, no conclusions may be drawn regarding priors versus no priors until data are reanalyzed with an earlier prior arrest date. It may be fruitful to investigate three groups: no priors, no recent priors, and recent priors. Recruitment, consent, and non-adherence to condition In the original Woodall et al. study (2007), of which this study is an extension, participants were recruited with the help and support of the Metropolitan Court. After the court assigned offenders to DWI School but before the DWI School date, researchers used the court‘s DWI School roster to pre-randomize participants to one of two group conditions: DWI School Only versus DWI School plus MADD VIP. It should be noted that the time lag between assignment to condition and attendance at condition increased the chance that offenders would not attend their prerandomized condition. Participant freedom to not attend their condition increased the bias 95

against finding significant results in the analysis more so than would have been the case if the participants had been residents in a treatment center and attendance were under control of the researchers. This non-attendance occurred both in the intervention and control groups equally. The randomization of participants to group may have controlled for the unequal study attrition between groups but could not control for the non-inclusion of less-functional drunk drivers, those who did not function well enough to attend court mandated interventions despite the heavy penalty of having an arrest warrant issued as a result. Only 80% of those mandated to DWI School arrived at class for their designated intervention. Non-attendance bore the penalty of an arrest warrant. Because of this heavy penalty for non-attendance, it may be speculated that those who were not present, 20% of the offenders during this period, might have been less functional than those who attended their court-assigned class. Because the participant sample excluded the less functional drunk drivers, the sample was biased toward obtaining treatment efficacy but this was equally true of both the intervention and control groups. Thus, the differences between intervention and control groups found in this study would only be representative of the more functional drunk drivers. In any case, the intervention and control groups would have been equally affected by this more-functional drunk driver bias. For those participants who attended the DWI School, participant consent and enrollment was enacted at the end of the DWI School class. At the conclusion of DWI School, the study enrollment personnel read the consent form aloud to participants as they silently followed along, reading their personal copies. The consent form listed benefits of participation in the study. Benefits of participation in the study were specified 96

as (1) possible benefits to society from increased knowledge about DWI intervention efficacy, and (2) possible benefits related to improved DWI interventions for future DWI offenders. DWI School attendees who accepted the invitation to participate in the study signed consent forms. Because the recruitment process made an appeal to prospective participant altruism, participants who signed the consent demonstrated a willingness to engage in an activity that would not benefit them personally but would benefit society and future offenders. The consent form specified that study participation was voluntary. Accordingly, not all offenders elected to altruistically participate in the study. Study recruitment ratio was 70% of those present at DWI school. Thus, only 70% of offenders were recruited out of the 80% who attended DWI school. These numbers further suggest that the DWI school recruitment pool may not have been 100% representative of all DWI first offenders. Those who had been pre-randomized to DWI School Only condition were excused after signing their consent form as they had already met their condition by DWI School attendance. Researchers send to the Metropolitan Court a list of participants who had been randomized to DWI School Only. The court then sent excusal letters to DWI School Only group members to excuse them from attending a MADD VIP. At the conclusion of DWI School and after signing consent forms, those who had been prior-randomized to the VIP plus DWI School condition were assigned to a VIP. The VIPs occurred at 30-90 days later after DWI School. Due to this time lag there was additional participant attrition.

97

Not all VIP-assigned participants attended VIP. The researchers elected to respond to this setback by employing the conservative intent to treat approach for data analysis. Intent to treat approach means that those who were assigned to VIP were treated as if they had attended, even if they did not show up. Intent to treat is a controversial approach, as some researchers have pointed out that including nonadherent participants increases probability of a Type II error, that is finding no effect when there was indeed an intervention effect (Fergusson, Aaron, Guyatt, & Hébert, 2002; Gross & Fogg, 2004; Michalak et al., 2002; Wert, 1995). Thus, the intent to treat approach is a most conservative approach to study design and analysis. Because it is more difficult to detect an effect using the intent to treat approach, if an effect is detected, then that effect must be strong to have overcome the bias against finding an effect. DWI school is a first offender intervention and the original MADD VIP study recruited participants from DWI school because the original study designers (Rogers, Woodall, Rao, Polacsek, & Milan, 1994) intended to study only first offenders. However, study designers knew it was possible that prior offenders would be recruited at DWI school because of loopholes in the court system. Due to loopholes, some multiple DWI offenders are allowed to have their sentences reduced to first offense status. This is referred to in the court system as being ― pled down.‖ A multiple offender who has been assigned first offender status is processed through the court system as if he or she is actually a first offender and is assigned to first offender remediation such as DWI School and MADD VIP. Knowing this misallocation of offender status was possible, the original investigators obtained court arrest records for all participants. They identified participants who were prior DWI offenders. Prior offenders were identified in the study database as 98

prior offenders but retained in the study and otherwise were treated the same as first offender participants. Random assignment Prior to DWI School, the original Woodall et al. researchers used the court DWI School roster to randomize participants to one of two study conditions. 1) The intervention condition consisted of a MADD VIP and DWI school intervention. Intervention participants took part in a state-mandated DWI school designed for first-time offenders, and attended a Victim Impact Panel organized by MADD for Bernalillo County residents. 2) The comparison condition consisted of DWI School Only. State of New Mexico law requires all first-time DWI offenders are to attend DWI school. The DWI school condition thus represents a usual and customary treatment for first-time DWI offenders. Design, Methods, and Procedures 2x5 Mixed Factorial Design The study, including the original Woodall et al. studies, was a randomized 2 x 5 mixed factorial design. There were two groups, intervention and comparison. Both intervention and comparison groups received a normal and customary DWI school. The only difference between the two group conditions was whether or not the participants received a MADD VIP intervention. There were five times of assessment for the MADD VIP plus DWI School intervention group and four assessments of the DWI School Only comparison group. The design was a mixed design employing both within-subjects and between-subjects factors.

99

A within-subjects factor means that the design tested each group both pre and post intervention. All subjects were given a pre-test and a post-test, and these two withingroup tests serve as a within-subjects factor. Participants were also divided into two groups. One group was the experimental group of interest (the VIP plus DWI school group) and the other group was a comparison group. The two-group design serves as the between-subjects factor. Table 3-1 lists the sequence of assessments for the 2x5 mixed factorial study design. Table 3-1: 2x5 Mixed Factorial Design Time of Assessment

MADD VIP + DWI School Intervention Group

DWI School Only Comparison Group

Pre-MADD VIP

Pretest 1 Questionnaire on enrollment day at DWI School

Pretest 1 Questionnaire on enrollment day at DWI School

MADD VIP

Yes

No

MADD VIP Post (same day)

MADD VIP Pre and Posttest 2 Questionnaire

No

1-Year Follow-up

Post-test 3 Questionnaire

Post-test 2 Questionnaire

2-Year Follow-up

Post-test 4 Questionnaire, Traffic Safety Data

Post-test 3 Questionnaire, Traffic Safety Data

12-Year Follow-up (this study)

Post-test 5 Traffic Safety Data

Post-test 4 Traffic Safety Data

100

Operationalization of reactance theory constructs into variables Reactance theory, in order to test whether it explains the MADD VIP outcomes, must be operationalized. Its abstract theoretical constructs must be translated into variables whose definitions describe instances of the theoretical constructs in the observable world, the study data. Whether or not reactance theory explains MADD VIP outcomes depends not only on whether theoretical constructs are operationalized but upon the accuracy of that operationalization. A scientist, when testing theory, must therefore take care to explain the process of the operationalization and how the operationalization(s) were tested for accuracy. The process for testing accuracy of operationalization of the reactance theory constructs in the present study is complex. The process is necessarily complex due to the need to establish the causal progression from statements made by VIP presenters, their reactance dosage, the mean reactance dosage of each VIP sampled, whether the sampled VIPs were equal or different intervention treatments. If VIPs are not uniform in treatment dosages, then they must be treated carefully—different levels of dosages must be considered as different levels in the treatment dosage variable. Only when dosages are accurately measured and levels established could the effects of reactance antecedent levels in VIP interventions be accurately measured against DWI recidivism, the behavioral outcome of interest. A series of operationalization processes, each a successive layer of scale, are described here. Tests of the data against theoretical constructs were conducted at three different levels of scale. At each level of scale, the data are tested to insure that the only 101

difference between variables in the final test, at the final level of scale, is indeed the level of presence and level of intensity of reactance constructs. Thus, the accuracy at which the theoretical constructs were operationalized into VIP message variables is determined in an articulated three-step process, each step executed at a different level of scale for the data. This multiscale operationalization process is described in chapter 3: Methodology. The analysis multiscale analysis begins with a set of raw data, the transcribed text of 15 VIP presentations. The raw data consists of words, sentences, and ideas conveyed by a varied combination of 57 VIP presenters at 15 different VIP presentation occasions. It is considered that each VIP occasion delivered a ― dosage‖ of theoretical reactance constructs, where the dosage could range from a reactance theory dosage of zero (no reactance theory constructs present, such as a happy statement) to a reactance theory dosage of level 8 (the highest-intensity reactance-inducing antecedent, ―a nger directed at the listener‖). One would like to obtain the theoretical reactance dosage for each VIP. However, to obtain the theoretical reactance dosage of a individual VIP occasion, the theoretical reactance dosage of each statement made during that VIP must be measured. After the theoretical reactance dosage of each statement within a VIP is measured, then the theoretical reactance dosage of all statements within a VIP may be summed and averaged. First, analysis of the raw data is conducted at the scale of the individual message act‖ is unitized as a unique statement. Each statement is a unique where each message ― instance of an expression of an idea, whether that idea is expressed in one or many sentences. Contents of one unit begin with statement of a new idea and end before a new

102

idea is introduced. The researcher thus unitized all VIP statements into 1,021 idea units, or message acts. Independent coders assigned each of these message units to a message type ranging from happy to angry. The coders assigned message types while using a coding system of eight message types that both discriminated between message types and ordered message types according to a continuum of increasing confrontational intensity. The ordering according to confrontational intensity was determined in two ways: (1) qualitative verification of VIP message types by agreement between independent coders and (2) quantitative verification via matching VIP message types with laboratory-created message types whose location on a continuum of increasing reactance-inducing intensity had been empirically established. A VIP‘s mean level of reactance-inducing statements was determined by averaging each VIP‘s frequency of reactance-inducing message types, weighted by the intensity at which these message types had been known to induce reactance. This was the level of analysis carried out by coders and verified by previous research on the frequency and intensity of VIP messages, on the individual message level of scale. Coders‘ decisions were based on text of the VIP statements. They did not hear audio or see video of the statements being presented by VIP presenters. Coders were blind to the demographics, the time and place of the statements, and all factors that would identify types of persons making the VIP statements that could possibly bias the coders in their assignment of message types on a continuum of intensity. Second, analysis is conducted at the scale of the VIP where each VIP presentation, presented to a unique group of DWI offenders, is a unit of analysis, an 103

independent variable category in ANOVA. In ANOVA, the value of the dependent variable was set at the VIP‘s mean level of reactance-inducing statements. Variance of VIP‘s mean level of reactance-inducing statements is analyzed to discover whether all VIPs in the sample offered the same type of intervention, or whether VIP interventions varied in type. VIP interventions were found to vary in type. Two distinct types of VIPs emerged from the analysis: high-reactance-inducing VIPs versus low-reactance-inducing VIPs. High-reactance-inducing VIPs dosed their participants with a significantly different mean level of reactance-inducing statements than low-reactance-inducing VIPs. The sampled 15 VIP interventions were bifurcated into their two constituent intervention types: low reactance-inducing VIPs and high reactance-inducing VIPs such that VIP message effects could be analyzed separately for each type of VIP. Third, analysis is conducted at the scale of type of intervention. The type of VIP was the unit of analysis. The message effect of the high-reactance VIP intervention type was tested as one level of treatment versus the message effect of the low-reactance VIPs. Both are regressed upon DWI offender recidivism. How literature informed the study design; Study’s contribution to literature The following discussion is presented in outline form to clarify the arguments and their supporting sub points. This discussion articulates in which ways the bodies of literature, from thematic analysis, message effects and reactance message effects research, inform the present study and in which ways the present study contributes to message effects research. It further discusses the interaction between qualitative and quantitative analysis, how they informed each other in this methodological symbiosis.

104

How the literature informs methods chosen for the design Thematic analysis research informs methods chosen for the design of the present study. Thematic analysis is a means to induce meaning by organizing observations by frequency or strength of themes that occur within observations. Observations, in the present study, were obtained through the reading of texts. These texts had been transcribed from VIP presentations that had occurred in vivo. In vivo is a Latin term that signifies in real life. In vivo analysis is a qualitative method particularly suited for analyzing observations derived from real life. Real life observations are different from observations derived from artificial laboratory-created environments and so require different methods of analysis. The different sources of these data, in vivo versus artificial sources, produce data with different qualities and data structures. Qualities of laboratory data are a function of the empirical design paradigm. Empirical design requires structures that pre-assign variables, the data structures or containers of the data, the buckets in which one contains the data. The data, even before they are collected, are predestined to fit into these pre-designed variable categories. The pigeonholes into which they will be relegated are by necessity unique and isolated. Isolation between discrete data categories is a convention for studies where statistics will be used to analyze differences between categorical data. Each category must serve as an individual with independent probability of being observed, unaffected by neighboring statements. Independence of categories is a necessary consideration because the probabilistic foundations of classical statistics formulas depend upon independent probability of occurrences of each data point that is observed, and the normal distribution of those occurrences (Kelton, Sandowski, & Sturrock, 2004). The 105

identification and import of independent probabilities is often explained with the familiar example of a flip of a coin. Data that are collected and stored within non-overlapping variables are not allowed to merge across variables, morph, grow, or change in any way. As independence of observations is a necessity, in the present study the data are assigned predetermined and non-overlapping qualities low, medium and high levels of the variable of interest, the level of VIP message dosage. Without categorization, the noise or variance of the data within each category would contribute to a case of overlapping or probabilistic dependence of observations upon each other. For example, one statement is related to the next statement in a conversation or VIP presentation. These relationships between statement types would undermine independence of observation assumptions of classical statistics, were such statements to be presented on a continuum of real number values. Any one message‘s meaning and effect could probabilistically overlap or depend upon another. In other words, if the data were regressed in its continuous form then data points would be assumed to reside on a continuum with qualities ranging from least to most. Their variance or elasticity due to their relationships to each other could result in dependence noise. In such a case of dependent data neighborhoods, their proximity would increase probability of their relational states. Relationships, or interdependence, would be salient characteristics of the data set. Interactive relationships between message sequences would serve to increase data volatility and increase overlapping of states of interdependent probabilities, decreasing data point independence and decreasing data suitability for statistical tests. 106

Thus, categorization, when the argument for categorization is supported by an ANOVA that substantiates sufficient and significant difference between categories, reinterprets data point probabilities of observations as non-overlapping independent categories. Since all individuals within the same category are assigned the same value, and since those three categorical values are clearly discrete and independent of each other, the categorization of otherwise noisy data can enhance the power and sensitivity of a regression. For such a case where both the independent and dependent variables are categorical, a loglinear logit regression is appropriate. For these reasons the present study categorized message types to reduce data noise, increase independence of observations, and increase power to detect an effect. The more independent are the categories of the variable of interest, the more power is enabled in statistical tests. The present study defined of independent categorical variables that were operationalized from theory informed by previous empirical research. The present study employed findings of previous research to classify message types into an ordinal hierarchy. Each level in the ordinal hierarchy of VIP message types possessed an independent non-overlapping probability of observation, empirically supported by reactance-inducing message archetypes established in previous research. The procedure of identifying and defining independent observational categories of reactance-inducing message types, and the scientifically defined reactance message types that informed this hierarchy of reactance-inducing message categories, is described in detail in the procedures section of chapter 3: Methodology. Using the procedure that will be describe later in more detail, the preestablished variable categories were defined for coders, where each stepped categorical level in the ordinal hierarchy of reactance 107

message types possessed an independent and increased level of probability of being a reactance inducing statement type. Once preestablished message type levels of the reactance-inducing message variable levels were defined as independent, and once VIP messages were coded and binned with these, then tests of different message effects of the different message categories could begin. The data within categories were tested against each other to discover whether their relationships supported the theoretical relationships predicted by the theory being tested. Did increased levels of reactance-inducing dosages correspond with increased reactance behavior in the form of increased drunk driving? In the present study, the process of preorganizing data, procedures, and methods to test pre-determined theory is grounded in the foundation of traditional scientific research. This tradition is known as the quantitative scientific paradigm. The quantitative paradigm is used in the present study to test whether data collected from MAD VIP presentations, and their relationships, support the theoretical relationships predicted by reactance theory. In order to transform naturally occurring text, text that was not preassigned variables or categories by the researcher. This approach diverges from previous reactance theory research. In previous laboratory research, the researcher assigned reactanceinducing message archetypes as having different levels of reactance inducement, then tested those levels using most often a self-report measure such as a questionnaire. The present study began used in vivo real life analysis of date to observe weather messages that occurred in real life interventions could be categorized into independent (nonoverlapping) probabilistic categories. Those independent qualitative categories might 108

subsequently be employed in quantitative analysis to test applicability of reactance theory to MADD VIP interventions. In vivo analysis can thus be employed as a precursor and pre-processing procedure to prepare qualitative field data for statistical analysis. Strauss & Corbin are pioneers of in vivo analysis. Strauss and Corbin suggest that when quantifying the influence of a theme, influence should be measured by two variables: both frequency and intensity. Frequency is a numerical count whose face value is naturally quantifiable. However, the procedure for measuring theme intensity requires qualitative judgment because ― theme intensity‖ does not immediately present a quantifiable face value, that is, unless there is empirical evidence upon which to base those judgments. In the present study, intensity was judged based on empirical results from previous research on levels of reactance induced by different message types. While developing the themes from the VIP texts, the researcher continually considered, ― what are the recurring message types and how do they relate to empirical research on reactance antecedents and levels of reactance inducement, if at all?‖ Message effects research informs methods in the present study. Message effects researchers have applied various operationalizations to meet the challenge of quantifying ― message intensity‖ because they have recognized the role message intensity plays as a variable in message effects research. Message intensity has been operationalized and quantified by different message effects researchers in the following different ways: A. Message effects researchers consider the intensity of a message as a variable of interest that may influence a persuasive effect. Message effects researchers commonly refer to message intensity as ― message strength.‖ However, there is no standard definition for message strength in message effects research. 109

Because the lack of a standard definition for ― message strength‖ has not been previously addressed in the literature, there is a brief discussion here of different definitions and operationalizations for this variable ― message strength.‖ 1. Some researchers have operationalized message strength as level of source authoritativeness, a quality of the message sender. Other message effects researchers evaluate a message as stronger if it is delivered with more emotion, a quality of the mode of message delivery. 2. Some researchers evaluate a message as stronger if the message source is more authoritative. In most cases, the researchers assign the level of authoritativeness of a message. For example, some researchers designate a message as authoritative if the message source is an academic expert or a peer reviewed journal article. 3. Other researchers do not assign level of authoritativeness of a message. They do not assume the authoritativeness of a message source to be constant upon all types of message receivers. These researchers consider whether the same message source can represent different levels of authority to different types of message receivers. These researchers may conduct manipulation checks to determine which source a study participant considers more authoritative. These researchers may disaggregate the data, creating subpopulations based on different participant traits. They have found that different personality types, such as internal versus external monitors, attribute authoritativeness differently. 110

B. Reactance message effects research informs methods in the present study. Reactance theory researchers have typically operationalized message strength differently than ― level of message source authoritativeness.‖ Generally, reactance theory researchers consider message strength as associated with the emotional intensity with which the source delivers the message. 1. Reactance theory assigns more weight to more intense message themes. a. Higher intensity levels of reactance-inducing statements in message delivery increase reactance to that message, b. Higher intensity levels of threat in a message increase reactance to that message, c. Higher intensity levels of confrontation in a message increase reactance to that message, and d. Higher intensity levels of loss frames in a message increase reactance to that message. 2. Reactance researchers have noted that the level of reactance inducement increases with increased levels of these antecedents. For example, reactance increases with increase in message source anger or confrontation. 3. In consonance with reactance theory and reactance research, in the present study message strength was operationalized as level of emotional intensity with which a message is delivered. It organizes emotional intensity in an ordinal scale that is comprised of escalating intensity of theoretical reactance antecedents. For example, source anger has been found to 111

induce the strongest reactance. Therefore on the ordinal scale in the present study ― I am angry with you‖ is a message type that is assigned the highest level of reactance inducement on the ordinal scale of message types that induce reactance. The position of each message type on the ordinal scale of level of reactance-inducing statements is discussed in the context of Tables 3-2 and 3-4. The rationale for assignment of message types to their position on the ordinal scale is based upon: a. The definition and definition scope of reactance antecedents in reactance theory literature, b. The scope of reactance antecedents as they are operationalized in empirical research on reactance-inducing message types, c. The degree of matching of the in vivo message types to those messages created and manipulated in laboratory settings, and d. The empirical findings on which message types induce stronger reactance than others. How the present study extends message effects research methods Previous reactance researchers created reactance-inducing messages and then they manipulated these messages in an artificial laboratory environment. The present study is informed by these studies, but it neither creates nor manipulates reactanceinducing methods, rather it observes naturally occurring reactance-inducing messages in real life, in vivo, and observes different levels of reactance associated with exposure to different dosages of reactance-inducing messages.

112

This study‘s definitions, operationalizations, and quantification methods are informed by previous empirical findings on levels of emotional intensity. Previous findings are employed in organization of observed reactance-inducing statements into an ordinal scale. The present study methods are different from and contribute to the message effects literature in the following ways: A. Reactance-inducing messages in the present study were not created by the researchers, but rather observed in vivo, in life, in a natural environment, B. In the present study naturally occurring messages have been thematically coded for both reactance-inducing statement frequency and intensity, the designation of level of intensity guided by previous reactance theory empirical research, C. Naturally occurring reactance-inducing messages were categorized by archetypal qualities and ordered in an ordinal scale from lowest to highest level of reactance-inducing strength. D. Where one value for a reactance-inducing message represented both its frequency and intensity, E. Where the frequency and intensity of all reactance-inducing messages occurring within a dosage of an exposure were aggregated to sum a total reactance-inducing message dosage that occurred in one time and space. In the present study, multiple message sources 57 VIP presenters contributed to a total reactance-inducing message dosage. It may be argued that a message dosage derived from an aggregate of multiple sources, where each message source has different characteristics such as age, gender, appearance, is a more accurate representation of a 113

reactance message influence than a message dosage derived from one-message source. This argument has been made in message effects literature by Jackson, O'Keefe, Jacobs, and Brashers (1989). For example, many messages categorized into archetypal categories create a stronger and more representative data point than one message. And also, multiple messages collected from multiple sources and categorized into archetypal categories create a stronger and more representative data point than messages collected from just one person. If one source has certain age, gender, appearance, then it cannot be known whether those variables are influencing reactance inducement more or less than the level emotion with which the message is delivered, or whether age, gender, or appearance variables are interacting with the emotion level variable to affect the level of reactance inducement. However, if multiple sources representing different age, gender, and appearance are all exhibiting similar high levels of emotion, then a measure of their aggregate emotional intensity and frequency of reactance-inducing message is less susceptible to bias due to the age, gender, or appearance of any one of the sources. F. In the present study, independent coders, in this case communication scholars, blindly coded messages from an ordinal scale of increasing reactance inducement. Coders were not conversant in reactance theory. They coded statements based on category definitions and examples that did not refer to these statements‘ level of reactance inducement. The ordinal scale use by coders was represented to them as a scale of increasing emotional intensity in a message.

114

How the interaction between qualitative and quantitative analysis informed each other in this methodological symbiosis. In the present methodology, results from two complimentary forms of analysis, qualitative and quantitative were employed symbiotically in a beneficial methodological feedback loop. The outputs from empirical studies in the literature contributed to the inputs for the qualitative analysis. The outputs from the qualitative analysis informed inputs, in terms of independent variable values, for the quantitative analysis. Each form of analysis contributed to the depth and accuracy of the other. Identification of qualitative themes was informed by empirical reactance antecedent research. For example, the message type, ― I am angry‖ had been found in empirical studies to induce reactance. The analyst was sensitized to recognize anger messages during the constant comparison analysis. Because anger messages were frequent and intense they were qualitatively assigned their own category. Further, findings from empirical research on reactance message types and their intensity relative to each other influenced the qualitative arrangement of naturally occurring message types into hierarchical relationships. For example, researchers had found that ― please change‖ was not reactance inducing, and therefore neutral, while the slightly different message ― you should change‖ was reactance inducing. Further, an angry statement such as, ― I am angry about what you did‖ had been found to induce more reactance than the first two statements. Thus in the hierarchy of levels of reactanceinducing statements, the message type ― please change‖ was ordered as least reactanceinducing of these three examples. ― I am angry about what you did‖ was a message

115

category that was arranged hierarchically as most reactance inducing, relative to the other two examples. Quantification and assignment of high/low levels of intensity was facilitated by the hierarchical arrangement of increasingly influential levels of reactance-inducing statements. This hierarchical ordering yielded an ordinal scale of message types according to their expected effect. The findings from the statistical analysis supported the choice of the ordinal positions of messages on the scale. The qualitative analysis provided the quantitative analysis with operationalizations of the independent variable levels of reactance-inducing statements, offering a frequency + intensity quantification that deeply supported the face validity of the values assigned to levels of reactance-inducing statements. Due to the qualitative analysis the operationalizations and levels of the independent variable were richly supported. Methods The methods section discusses how both qualitative and quantitative methods were employed to answer research questions. The use of these two methodological approaches represents employment of two different investigative paradigms. These two methodological approaches provide different sources of knowledge, different lenses of examination, for this study. Each of these paradigms subscribes to a distinctive epistemology. Epistemology is a theory that defines the scope of what is included and excluded as knowledge. An epistemology defines what knowledge is, where it comes from, how it is acquired, and types of reasoning that can be applied to existing knowledge to create new knowledge.

116

These two epistemologies need not be at odds. The present study joins the growing body of literature that employs both qualitative and quantitative approaches to investigation. This dual-epistemological approach to scientific investigation is referred to as triangulation, a multifaceted approach that involves more than one method or data set. Proponents of triangulated mixed-methods consider this approach to result in a balanced and rich understanding of the data. Qualitative method In this section, the qualitative method used to investigate themes in the VIP transcripts is described. This section discusses considerations made in the qualitative analysis process due to unique factors in the transcripts and the rationale for the choice of unit of analysis. This section concludes with a general discussion about how validity and reliability apply to qualitative analyses. Constant-comparison analysis Constant-comparison analysis is also known as grounded theory method (Strauss & Corbin, 1990). Constant-comparison analysis is a systematic research methodology that operates in a reverse direction compared to the scientific method. Textual data are approached without a prior theoretical framework and themes are extracted, identified as codes, and used to classify sections of text according to the preconceived unit of analysis. Codes are developed, grouped into categories, modified, split, merged, and may be hierarchically ordered or nested. Relationships are observed between categories that may give rise to theory. Another name for this method is the grounded theory method. The constant-comparison method is a process of refinement and redefinition of key themes. In this manner, the structure and relationship among key concepts in the data emerges from 117

the data rather than the data being fit into preconceived categories. This avoidance of preconceived categories allows for generation of new knowledge from the researcher‘s interaction with the text. Constant comparison analysis method is useful when it is desirable to explore a textual data set to discover new knowledge. Unit of Analysis. The minimum unit of analysis used in the present study was one line of text in the QSR N6 software. QSR N6 software is described in the next subsection ― Software and its Use.‖ The one-line unit of analysis was used because it allowed for the most flexibility in analysis of the data. In cases where a coded theme ran beyond one line of text, all the relevant lines of text were selected. In such a case, the QSR N6 program counts that multi-line coded text as one instance or unit. In QSR N6 any one line of text can be coded with multiple codes. Codes can overlap yet still be counted individually. This flexibility reflects the rich and overlapping nuances that occur in natural language and ensures the highest fidelity in transfer from textual meanings to quantified instances. Validity and Reliability. In qualitative analysis, the reliability and validity of results are determined by how well the analysis fits the data. This fit depends upon the logical arguments of the analyst. One proof of fit is that independent coders found the eight reactance intensity codes sufficient to describe all 2,021 MADD VIP statements. Validity emerges from the cogency of the analyst‘s arguments derived from examples in the data. In qualitative analysis, the observer‘s standpoint is considered valid if it is explained clearly to the reader.

118

Quantitative methods This section discusses the quantitative methods employed in answering research questions: content analysis, Content analysis method. Content analysis is a quantitative method of textual analysis. This method controls for researcher bias by training coders to independently and objectively analyze the data in a text. Units of Analysis. The units of analysis and coding of the VIP presentation, being qualitative, is necessarily crude. However, previous reactance research findings and scholarly literature informed the choice of codes and units of analysis. The considerations for code identification are discussed later in this chapter. The consideration of choice of unit of analysis is discussed here. One consideration in choice of the unit of analysis As a practical fact, we cannot concerns sampling. In the words of Jackson (1992, p. 22), ― apply random sampling procedures to message classes, as we can to human populations.‖ The message classes in this content analysis were not sampled. The messages used in the analysis comprised 100% of each VIP presentation. The VIP presentations themselves represented 100% of those MADD VIPs conducted during the original study period of 1994-1996 (Rogers, Woodall, Rao, Polacsek, & Milan, 1994). The content in the MADD VIP presentations were analyzed with the unit of analysis being a complete message. The message itself was the unit, whether that message was in the form of a single sentence or paragraph. Paragraph breaks were made when the scene or perspective changed. Kenneth Burke (1945), in his dramatistic pentad, defines the scene as the context in which the message content occurs. Therefore, units of analysis for coding content were based on units of scenes. In the present study, text119

coding breaks occur between scenes. For example, Table 3-2 shows coding of five three different scenes that were observed to occur in the same MADD VIP narrative.

Table 3-2: Example of Units of Analysis Coding from Codebook ReactanceInducing Intensity 2

3

2

3

1

Scene Code

Text We know that you don‘t want to be here, and we don‘t want to be here either.

you & I are the same (pathos)

forewarned: a sad message is coming

It‘s difficult even to know where to start, when you talk about the loss of a child. Even a child that‘s grown. If you‘re as lucky as I, when your children are grown, they become your best friends, and you still work and play together and they‘re such an important part of your life.

you & I are the same (pathos)

forewarned: a sad message is coming

That you don‘t know how to get along without them, when they‘re jerked away so suddenly. On a Monday in August my son Kevin had a wonderful day, he uh, played hooky from work. I know because he worked for me. But we had a big project the next day, and he really wanted to get things cleaned up. Um, although he had his own apartment for eight years, he still loved coming back to the house. Um, that‘s where he kept his motorcycle, that‘s where he kept his drums. Anything that made too much noise for his apartment building was left at our house.

a happy and hopeful message

Omnibus ANOVA for Unequal n. Results from the content analysis were used to conduct an ANOVA. The independent variable was VIP group. The dependent variable 120

was VIP levels of reactance-inducing statements (first column in Table 3-2), a continuous variable whose individual values were derived from a mean of two values assigned to that statement by phase 2 coders who scored the highest interrater reliability. The means for each VIP group‘s level of reactance-inducing statements were used for the full model. The full model assumed the alternative hypothesis was true, that there was group difference. Assuming no difference between groups, the restricted model used weighted group means to calculate a weighted grand mean for unequal n. An omnibus F was calculated to test whether there was a significant difference between message dosage levels for fifteen MADD VIP plus DWI School intervention groups. For more detail on the procedure for calculating ANOVA for unequal n, see nonorthogonal designs (Maxwell & Delaney, 2004, p. 320). Unit of analysis. The unit of analysis was the individual statement score. An individual statement score was the reactance-inducing intensity number for that statement arrived at by averaging the scores assigned to that statement from two coders who scored the highest interrater reliability and who each scored the entire data set. For example, if a statement was coded at reactance-inducing level 3 by coder A and at level 2 by coder B, then the average score for that statement was 2.5. Hierarchical Linear Modeling Method. If the above omnibus ANOVA comparing the 15 MADD VIPs shows significantly different scores for the independent variables, VIP level of reactance-inducing statements and VIP proportion of reactanceinducing statements, then a hierarchical linear regression model will regress DWI arrests on individual participants‘ message dosages, nested within their 15 MADD VIP groups. Hierarchical linear modeling allows for an improved estimation of individual effects 121

when different groups are receiving similar but different interventions. Hierarchical linear modeling can evaluate how exposure to different messages can have different message effects, depending on the MADD VIP group. Further, hierarchical linear modeling can draw on ― the estimation of variance and covariance components with unbalanced, nested data‖ (Bryk & Raudenbush, 1992, p. 7). If messages in the 15 presentations demonstrate a difference between groups‘ reactance-inducing intensity effects, then group effects will be controlled for as covariates in a survival analysis. Nested Data Structures. HLM (hierarchical linear modeling) enables the analysis of nested, data structures. People in this study, as in similar studies, exist within nested organizational structures such as the individual, the VIP cohort group, and the condition group, that is, intervention or comparison group. Participants who exist within each nested group are more similar to one another than individuals randomly sampled from a larger population. Therefore they should be statistically analyzed within the context of their nested groups. Unit of Analysis. The unit of analysis for HLM would be specified in two levels of scale. The unit of analysis at the first level of scale would be mean level of VIP reactance-inducing intensity perceived by each participant in a VIP group. The unit of analysis at the second level of scale would be mean level of VIP reactance-inducing intensity for each group. Should IVs be analyzed as fixed or random? The sample of MADD VIPs that was used in the study was a sample taken, for the sake of argument randomly sampled, from the entire population of MADD VIP presentations. Because the sample is considered random, the MADD VIP message dosage data can be treated, not as a fixed 122

factor, but as a random factor. A random factor classification assumes the different levels of dosage within different MADD VIPs were representative of the population of dosage in all VIPs. In order to determine whether the MADD VIP message dosage should be treated as a random (varying) factor or fixed (same) factor an ANOVA should be conducted to test for difference in message dosage levels between MADD VIP groups. If the MADD VIP message dosages are different, then MADD VIP intervention should be considered as a random factor in a mixed (random and fixed) model design known as HLM (Hierarchical Linear Modeling). In the present study the low and high level of reactance-inducing statement dosage groups were treated in a fixed factor design because the randomness (variability) of the independent factors of level of reactance-inducing statements and proportion of reactance-inducing statements were converted into two fixed factor categories (low/high levels) that, it is argued, adequately represented VIP statement population variation. As continuous variables they were not normally distributed. In fact, they were strongly bimodal and were thus best converted to categorical variables that were regressed using loglinear logit regression. Loglinear logit regression. Loglinear logit regression (also known as the logistic model, the logit model, multinomial logit, and maximum-entropy classifier) is a type of loglinear or logistic regression analysis. It is a generalized linear model (Durbin & Watson, 1950; Durbin & Watson, 1951) used for binomial regression. The dichotomous dependent variable is known as the logit. The logit function is the inverse of the "sigmoid", or "logistic" function used in statistics. The logit of a number p between 0 and 1 is given by the formula: 123

Equation 3-1

The outcome, in loglinear logit regression, is the log odds ratio of a case experiencing one level or another of a dichotomous dependent variable. The dependent variable outcome value for each case is the logit or log of the odds (Tabachnick & Fidell, 2007, p. 438). The equation for logistic regression is: Y = β0 + β1x1 + β2x2 + β3x3 + … βkxk,

Equation 3-2

where β0 is called the intercept and β1, β2, β3… βk are the regression coefficients of x1, x2, x3… xk that represent scores for the predictor variables. The intercept is the value of y when the value of predictor variables is zero. Each of the regression coefficients describes the size of the contribution of that predictor. Table 3-3 lists interpretation guidelines for different values of the logistic regression coefficient. Table 3-3: Guidelines for Interpretation of Logistic Regression Coefficient If the coefficient is: Positive in value

Negative in value

Large in value (significantly different from zero)

Small in value (near zero)

Then the interpretation is: The covariate (predictor) increases the probability of the outcome The covariate (predictor) decreases the probability of the outcome The covariate (predictor) strongly influences the probability of the outcome. It contributes meaningfully to the regression model. The covariate (predictor) has little influence on the probability of the outcome. It does not contribute meaningfully to the regression model.

124

Loglinear Logit Regression Compared to Other Regression Alternatives. Loglinear logit regression evaluates the contribution of predictors, as does discriminant analysis, but unlike discriminant analysis logistic regression does not assume normal distributions of predictor variables. Loglinear logit regression is a type of multiway frequency analysis, which requires discrete predictors; predictors in loglinear logit regression may have more than two levels. In the present study, data were analyzed as in more than two categories and as dichotomous variables, depending on the question and level of analysis necessary to obtain the clearest result. Loglinear logit regression is more flexible than multiple regression analysis. ― In logistic regression, the predictors do not have to be normally distributed, linearly related (if they are not continuous), or of equal variance within each group [i.e., homoscedasticity]‖ (Tabachnick & Fidell, 2007, p. 437). Loglinear logit regression is a good choice when there are unequal n in two groups being compared and when the predictor variable influences the two groups unevenly. Bennett, Beaurepaire, Langeluddecke, Kellow, and Tennant (1991) found that while univariate analysis indicated two groups differed on 17 predictors, yet logistic analysis indicated that only one predictor, by itself, contributed to a best fit model. The reason for differences in results between univariate analysis and logistic regression was that logistic regression is robust to unequal n in predictor variables. In the present study, as in the Bennett et al. study, the sample sizes were uneven. The DWI School comparison group comprised only 2% of the sample, versus 98% sample comprised of the MADD VIP plus DWI School intervention group. Logistic regression was able to discern a predictor relationship despite unequal n between groups, making it appropriate for use in

125

analysis of the present study data. There are other reasons logistic regression was appropriate and these are discussed next. Loglinear Logit Regression Appropriateness for the Present Study. The loglinear logit regression method is the appropriate regression method for the present study due to qualities of the data. Given non-normal distributions, unequal n, homoscedasticity, and bimodal distribution of effects in terms of bifurcated effects outcomes in independent and dependent variables, it was necessary to transform continuous variables into categorical variables and loglinear logit regression is suited to regress categorical variables. The method outputs odds of an event occurring, given the data. For example, logistic regression can output log odds and probability measures that DWI offenders will be rearrested for DWI again in the sooner/later categories given their age (dichotomous), number of priors (dichotomous or categorical), and antecedent patterns of VIP message dosage (low/high levels of reactance-inducing statements, dichotomous, or no VIP versus low reactance-inducing VIPs versus high-reactance VIPs). Loglinear logit regression is suitable for evaluating two predictors, level of reactance-inducing statements and proportion of reactance-inducing statements, because these two predictors are not normally distributed, linearly related to the dependent variables, and are heteroscedastic. A bimodal distribution cannot be transformed into a normal distribution. They are not linear, and they are not homoscedastic. They are best categorized, being bimodal, as dichotomous variables. Loglinear logit regression can evaluate levels of reactance-inducing statements and proportion of reactance-inducing statements though they do not meet necessary assumptions for other forms of regression. 126

Loglinear logit regression identifies the most adequate regression model even if there are unequal split of cases (as extreme as a 2% / 98% split) in levels of the independent variables. Unequal n in levels of the independent variables is the case with reactance-inducing level (n low = 90, n high = 294), where the smaller group has 31% of the number of cases as the larger group, and priors (nnone = 786, none-or-more = 47), where the smaller group has 6% of the number of cases as the larger group. When the distribution of the outcome variable in relation to an independent variable is not linear, as is the case with level of reactance-inducing statements, loglinear logit regression can still identify an effect. In the present study, the probability of short time_to_recidivism was affected by level of reactance-inducing statements for those priors who were age 30 and older, but the relationship was not linear. A linear relationship was not possible due to the bimodality of both variables. Loglinear regression is sensitive to detecting and identifying these differences in non-linear relationships. According to one rule-of-thumb, there should be at least five cases expected in each cell. According to another rule-of-thumb the optimal situation is where all expected cell frequencies are greater than one for all cells created by pairs of discrete predictors paired with the dependent variable. If the minimum number-of-cases-per-cell criterion is not met, then the remedy for this situation is first, collapse predictor variables into fewer levels. If that does not resolve the problem, then eliminate weakly contributing predictor variables that are correlated with strongly contributing predictor variables. Another remedy, not always available, is to increase the sample size.

127

The dependent variable, time to recidivism, in the loglinear regression was dichotomized as the first four years after intervention versus after four years following the intervention. The division between two levels of time to recidivism may be a relatively crude division point because there are only three cases of prior offenders who survived beyond four years. However, the inflection points of the survival change curves (see Figure 4-4) for both priors and no priors determined the best estimate of a proper break point for the dichotomous dependent variable. Again, it may be noted that over 100 prior offenders who had not been recently arrested were included in the no prior category in the present analysis. Unit of analysis. Logistic regression uses the individual case as the unit of analysis. Survival analysis method The survival analysis method calculates the time DWI offenders ― survive‖ until they are rearrested for DWI offenses once again. Of the several types of survival analysis available to statisticians, the most suitable statistical tests for this study are the Cox Proportional Hazards Regression and Life Tables analysis. Cox Proportional Hazards Regression evaluates the predictor effects upon survival time until recidivism. It also allows for the use of covariates. Life Tables analysis does not allow for use of covariates, but it provides the Wald statistic, a test for group difference in survival analysis. Unit of analysis. The survival analysis method uses the individual case as the unit of analysis. Test for Time Dependence. Time-dependent Cox regression can be used if the variables of interest vary over time. All of the variables in the present study were tested 128

for time dependence and found to be constant over time and so a time-dependent Cox Regression was not required. Censored Data. Survival data usually includes some cases for which the event of interest has not happened. For example, by the end of the study on December 31, 2007, the event of interest, recidivism, had not occurred for a majority of offenders. At the end of the study they still have not been rearrested. Cases that have not experienced the event or recidivism, who have not been rearrested for whatever reason, are classified as censored cases. Censored, or missing, cases cause traditional techniques such as t-tests or linear regression to be inaccurate. A survival analysis excludes censored cases from the regression section of the calculations but reintroduces them and uses them in calculating the survival function, a hazard likelihood—because these non-event cases have an odds ratio, a probability of no event to contribute to the hazard model. Causes for Missing Event Data. Event data can be missing from a case record for other reasons other than the offender was not rearrested: for some cases recidivism for DWI happened after the study closed. In other cases the court system lost track of DWI status sometime before the end of the study, for example a case file was lost, data were entered incorrectly, or a DWI arrest was made but the offense was transmuted to a different type of offense. In this case, the recidivism DWI offense did not show up in that participant‘s court record. In other cases, some study participants may have ― dropped off the New Mexico traffic safety radar‖ because they were in prison, moved out of state, or because they were deceased. Prediction of Survival based on IVs. Claims of causality, the extent to which they can be made in the present study, are now considered. The original study designed 129

by Rogers, Woodall, Rao, Polacsek, and Milan (1994), of which this study is a message effects extension, was an empirical study that compared participant outcomes between two randomly assigned groups: VIP only versus VIP plus DWI School. If a significant difference between these groups was found, then due to the experimental design, causality might be inferred. However, the present study subdivides both groups into high and low reactance-inducing VIP groups. This subdivision introduces an observational element into the study because participants were not randomly assigned to high and low reactance-inducing VIP groups. At the time the original study was conducted, it was not known that the VIP intervention consisted of two levels of reactance-inducing statements. The observational nature of the VIP subdivisions in this study precludes an ability to draw causal inferences from the VIP subdivisions. However, causal inferences can be made at the VIP versus VIP plus DWI School levels of the independent variable that compares intervention types. These considerations are kept in mind when discussing predictability of independent variables influence upon the dependent variables. Because the independent variable is sometimes referred to as the predictor variable in the present study does not mean that causal inferences are being made. Rather the term predictor is used in a general sense as an independent variable in the context of a regression equation. Cox regression is a subtype of survival analysis that allows for the inclusion of predictor variables (covariates) in the model. Cox regression omits the censored cases (those who did not experience the outcome event, recidivism) from the stepwise regression but includes censored cases in the computation of the probability of survival. Cox regression, using stepwise regression analysis, provides regression coefficients for each of the predictor variables, enabling assessment of the impact of multiple covariates 130

in the same model. Cox Regression can be used to examine the effect of continuous or discrete covariates (Cox, 1972). The Cox regression assumes the time to event (in this case the event is recidivism) and the covariates are related using the hazard function in equation 3-4.

Where

hi(t) h0(t) p bj xij

hi(t) =[h0(t)] eb0+b1xi1+...+bpxip

(Equation 3-4)

is the hazard rate for the ith case at time t is the baseline hazard at time t is the number of covariates is the value of the jth regression coefficient is the value of the ith case of the jth covariate The hazard function is a measure of the potential for the event to occur at a particular time t, given the event did not yet occur. Larger values of the hazard function signal greater potential for the event to occur. Si(t) is the likelihood the ith case survives past time t. The value of the hazard is equal to the product of the baseline hazard and a covariate effect. While the baseline hazard is dependent upon time, the covariate effect is the same for all time points. Thus, the ratio of the hazards for any two cases at any time period is the ratio of their covariate effects. This is the proportional hazards assumption. Si(t) =e−∫0t[h0(t) ] eb0+b1xi1+...+bpxipdt where The concept of "hazard" may not be intuitive, but it is related to the survival function. The value of the survival function is the probability that the given event has not occurred by time t. Again; the baseline hazard determines the shape of the survival function.

The equation 3-4 denotes ― S (t),‖ which stands for the survival time until recidivism. S is the conventional denotation for the survival function; t is the

131

conventional denotation for time. Equation 3-5 describes the equation S(t) such that survival until recidivism is a function of time. S(t) is equal to the likelihood ―Pr‖ (probability) T is later than some time t. For example, arrest T is later than intervention date (for MADD VIP plus DWI School intervention group) or enrollment date (for DWI School comparison group). S(0) = 1, at the beginning of the study. (Equation 3-5) The Wald Statistic in Logistic Regression and Survival Analysis The Wald statistic is used in the Cox regression to test whether each covariate (including the independent variable) has a significant causal relationship with the dependent variable of time to event. SPSS compares the square of the difference to the chi-square distribution. For the one dependent variable, such as time to event in the case of the single event Cox regression, the Wald statistic for the univariate case is represented in equation 3-6. (Equation 3-6)

The Wald statistic is used in logistic regression to evaluate the statistical significance of each of the coefficients in an acceptable model. For this purpose, the Wald statistic is computed as represented in equation 3-7, where ― the squared regression coefficient is divided by its squared standard error‖ (Tabachnick & Fidell, 2007, p. 445).

132

Bj2 Wj = --------- (Equation 3-7) SE2Bj Procedures The present study employed a four-phased procedural approach. 1.

Quantifying reactance message dosages.

a. The qualitative constant comparison method was used to identify coding themes in transcripts of 15 MADD VIPs. Qualitatively identified themes were refined to create eight ordinal reactance intensity codes, and their definitions, compiled in a codebook. Table 3-2 presents the reactance intensity codebook. Each code was developed as unique and distinct, with definable differences from other codes. Eight reactance intensity codes were developed to be parsimonious but sufficiently rich to classify all of the 2,021 MADD VIP statements by 56 presenters in 15 VIPs.18 The zone of red reactance intensity codes is arranged in an ordinal order of increasingly strong confrontational messages to induce behavior change. Brock (1968) found the best way to characterize ― threats to freedom‖ is by an ordinal scale. Hong and Faeda (1996) used an ordinal reactance scale to predict reactance. The present study also used an ordinal scale of level of reactance-inducing statements. Researchers have identified the confrontational messages in the ordinal continuum of the red zone as increasing levels of reactance-inducing statements as message strength increases (Buller et al., 2000; Campo & 18

It was not fruitful to analyze VIP texts at the level of each of the 56 presenters due to the high withinpresenter message variance compared to the low between-presenter message variance. Thus a presenterbased analysis was not conducted; it was outside of the scope of possibility for these naturally occurring in vivo (real life) speaker data.

133

Cameron, 2006; Engs & Hanson, 1989; Miller et al., 2006; Quick, 2003; Quick & Stephenson, 2004). The last five themes ―f orewarned: sad message coming,‖ ― worried, depressed, confused,‖ ― irritated, hurt, devastated,‖ ― you should change,‖ and ― angry‖ were reactance-inducing statements according to the research of Dillard and Shen (2005) and others (Brehm, 1966, 1972; Brehm & Cohen, 1962; Brehm & Cole, 1966; Goranson & Berkowitz, 1966). Researchers characterize these confrontational messages as reactance aggravators, ― reactance can be operationalized as a composite of self-report indices of anger and negative cognitions‖ (Dillard & Shen, 2005, p. 144). Quick and Stephensen (2007) model reactance as ― a latent variable comprised of negative cognitions and state anger‖ (p. 255). Previous researchers have identified red-area codes four, five, six, seven, and eight as reactance inducing (Dillard & Shen, 2005). They produce a lowered mood (Hong & Faeda, 1996); they are strong inducements to induce contrary behavior (Festinger, 1957, Festinger & Carlsmith, 1959; Freedman, 1965); and they are likely to increase a ― boomerang effect‖ of increased noncompliance (Hollander, 1971). i. Reactance Constructs Quantified within an Ordinal Scale of Codes. The blue area of the codes in Table 3-4 signifies positive valence statements intended to set up rapport between the speaker and his or her audience. These low-numbered codes signify a message that is not reactance inducing. 1. Code 1:“I am happy, hopeful” is a positively valenced statement. This statement has not been found to be reactance 134

inducing in previous research (Brehm, 1966, 1972; Brehm & Cohen, 1962; Brehm & Cole, 1966; Buller et al., 2000; Campo & Cameron, 2006; Dillard & Shen, 2005; Engs & Hanson, 1989; Goranson & Berkowitz, 1966; Miller et al., 2006; Quick, 2003; Quick & Stephenson, 2004, 2007). 2. Code 2: Pathos is emotional identification between the speaker and the audience, also a positive state according to research cited in the previous item. These first two blue-area statements encouraged DWI offenders in the audience feel at ease and to like the speaker. 3. Code 3: “please change,‖ is not reactance inducing according to research by Dillard and Shen. However, no matter how politely it is spoken, according to other researchers, ― please change‖ may threaten or irritate audience members by degrees depending on the extent to which an offender believes he or she has the freedom to drink and drive. Behavior change requests are most likely to annoy repeat offenders (Brehm, 1966, 1972; Buller, Burgoon, Hall, Levine, Taylor, Beach, Buller, & Melchor, 2000; Camp & Cameron, 2006; Engs & Hanson, 1989; Miller, Benefield, & Tonigan, 1993; Miller, Burgoon, Grandpre, & Alvaro, 2006; Quick, 2003). 4. Codes 4-8: Reactance-inducing statements. Researchers identified ― forewarning‖ as reactance inducing (Petty & 135

Cacioppo, 1977). ― Worried, depressed, confused,‖ ― Irritated, hurt, devastated,‖ ― You should change,‖ and ― I am angry with your‖ are types of statements that also have been found to be reactance inducing in the research cited in the first item of this code list. For purposes of creating an ordinal scale that signified reactance-inducing intensity, the above statements were organized in an ordinal hierarchy, in increasing levels of emotional escalation. ii. Codebook Production. Each code was defined and structured as independent of the other codes. A codebook was produced that employed the above ordinal list of high/low threat codes. As the ordinal number increased in the code list there was a corresponding increase signified in reactance-inducing intensity. Reactance research and reactance theory attributes more reactance-inducing affect as message intensity increases. For example, the higher the level of reactance-inducing level, confrontation, and loss frame, the more reactance inducing the message. Each reactance intensity message code was specified with a definition and an example, as demonstrated in the coding example in Table 3-2. The full list of reactance intensity codes is displayed in Table 3-4. For ease of use and to increase coding accuracy, reactance intensity codes used by the coders were reordered to represent ordinal escalation of the strength with which they induced reactance. ― Please change‖ and ― you should change‖ were adjoining as codes six 136

and seven. However, ― please change‖ is not reactance inducing and ― you should change‖ is a reactance-inducing statement (Dillard & Shen, 2005). For purposes of creating an ordinal measure of reactance inducement, codes were re-ranked in an order representing least to most reactance-inducing statements. ―Pl ease change‖ became code number three and each successive code moved up by one code number. The messages in the upper scale are positive and nonthreatening or not reactance inducing. These messages are ― Happy, hopeful‖ and ― You and I are same.‖ ― You and I are same‖ is also a pathosproducing (Aristotle, 2006) statement that rhetoricians use to increase consubstantiality (Burke, 1965) and thus compliance from their audience.

Table 3-4: Set of eight ordinal reactance intensity codes used to code the 2,021

Adjusted Codes

statements by 56 presenters in 15 MADD VIPs. 1 happy, hopeful 2 you & I are same 3 please change 4 forwarning: a sad message is coming 5 worried, depressed, confused 6 irritated, hurt, devastated 7 you should change 8 angry

Chg from 6 Chg from 3 Chg from 4 Chg from 5

An ordinal range of levels of reactance constructs were identified that ranged from non-reactance inducing, to mild reactance inducing, and to strong reactance inducing. Its number in the ordinal scale represented the ordinal position of a code. These

137

ordinal numbers were then used to calculate two independent variables that measured reactance message dosage: level of reactance-inducing statements in the VIP intervention (average reactance-inducing level for each VIP based on an mean frequency of occurrence combined with the intensity code levels) and proportion of reactanceinducing statements (proportion of statements in a VIP that were on the reactanceinducing levels of the scale). The zone of red codes contained an ordinal order of increasingly strong, confrontational, reactance-inducing messages. An ordinal scale was found to be best way to characterize ― threats to freedom‖ (Brock, 1968). Hong and Faeda (1996) used an ordinal reactance scale to predict reactance. The present study used an ordinal scale to differentiate between increasing severity of reactance-inducing statements. The ordinal scaling of reactance-inducing message severity was informed by research on these five reactance constructs (Buller et al., 2000; Campo & Cameron, 2006; Engs & Hanson, 1989; Miller et al., 2006; Quick, 2003; Quick & Stephenson, 2004). The reactance-inducing themes ―f orewarned: sad message coming,‖ ― worried, depressed, confused,‖ ― irritated, hurt, devastated,‖ ― you should change,‖ and ― angry‖ have also been categorized as reactance-inducing statements by Dillard and Shen (2005) and others (Brehm, 1966, 1972; Brehm & Cohen, 1962; Brehm & Cole, 1966; Goranson & Berkowitz, 1966). Researchers characterize these confrontational messages as reactance aggravators, aggravating anger and negative and contrary behaviors. (Dillard & Shen, 2005; Quick & Stephenson, 2007). Previous researchers have identified that red-area codes four, five, six, seven, and eight are reactance inducing (Dillard & Shen, 2005). They produce a lowered mood (Hong & Faeda, 1996). They are strong inducements to 138

induce contrary behavior (Festinger, 1957; Festinger & Carlsmith, 1959; Freedman, 1965), and they are likely to increase a ― boomerang effect‖ of increased noncompliance (Hollander, 1971) rather than the compliance expected by the sources of the messages. iii. Separate spreadsheets were created for each VIP presenter transcript. Each spreadsheet contained (1) a cell for each MADD VIP presenter statement and (2) a drop-down menu for each cell. Each drop-down menu contained a choice of the eight ordinal codes in increasing reactance-inducing intensity that were to be used in the content analysis. Thus a coder would be able to read a presenter statement then choose from a drop-down menu next to that statement which of the eight ordinal reactance-inducing codes would best classify that presenter statement. Figure 3-1 contains a screen shot of a portion of a coder‘s Excel coding spreadsheet. iv. Quantitative content analysis method was used to quantify the frequency of occurrence of coding themes. v.

Training of Coders. Eight coders were trained to code the MADD VIP transcripts using the qualitatively generated reactance-inducing codes. The independent coders each coded text using Excel spreadsheets with drop-down code menus for each VIP unit of analysis. Each unit of analysis (one scene, complete thought, or narrative) had its own cell with an adjacent drop-down menu. The coders made a choice from the eight reactance-inducing codes for each

139

unit of analysis. Figure 3-1 is a screen shot of a portion of a coder‘s Excel coding spreadsheet.

Figure 3-1: Screen shot of Excel spreadsheet from which coders coded MADD VIP transcripts. Each unit of analysis was contained in one cell, with adjacent drop-down menu from which coders chose one of eight reactance-inducing codes.

After coding was completed, the interrater reliability was computed using the Fleiss‘ kappa described below. If reliability is less than 80% then the coders must be retrained. The investigator and coders discuss where the differences are found. They redefine the problematic codes with refined definitions and examples. Then the coders

140

recode. If reliability is less than 80% then it is necessary to redefine codes and/or retrain coders until reliability reaches 80%. b. Phase 1 Content Analysis Validity and Reliability. In quantitative analysis, the epistemological assumption is: there is one conclusion that is probably true that can be arrived at by agreement of different parties. That level of truth in content analysis is measured by interrater reliability, a measure of coder agreement. The coded data was collected from the eight coders and interrater reliability was computed using Fleiss’ kappa. Fleiss' kappa is a generalization of Scott's pi statistic (1955), a statistical measure of interrater reliability. It is also related to Cohen's kappa statistic (1960). Whereas Scott's pi and Cohen's kappa work for only two raters, Fleiss' kappa works for any number of raters who are determining categorical ratings for a fixed number of items. Fleiss' kappa can be interpreted as expressing the extent to which the observed amount of agreement among raters exceeds what would be expected if all raters made their ratings completely randomly. If a fixed number of people assign numerical ratings to a fixed number of items then the kappa will give a measure for how consistent are the ratings. To measure coder reliability, half of each coder‘s documents were assigned to two other coders. Overlapping the coding provided a means to measure interrater reliability and a basis for arguing for generalizability of the content analysis. This standardized content analysis yielded generalizeable and replicable results with kappa as a coefficient of agreement for nominal scales = 0.68 (Cohen, 1960). Phase 1 kappas between pairs of coders ranged from 0.42 - 0.90, with a standard deviation 0.12. Statisticians consider a kappa of 0.61 - .80 substantial agreement (Everitt, 1996; Landis & Koch, 1977). 141

According to Cohen, an average kappa of 0.68 is interpreted as follows. On the average 68% of the coders‘ joint judgments were agreements (with chance excluded). The kappa marginals for the eight coders in ten overlapping pairs were 0.78, 0.96, 0.74, 0.90, 0.92, 0.78, 0.89, 0.96, 0.97, and 0.84. The average marginal marks the maximum value that kappa could take for this data as a function of the expertise of coders, the level of focused attention of the coders, and the quality of the data. The marginal was 0.87. Therefore, nearly 20% of the disagreement was a result of marginal inconsistencies, and this number signaled a degree of coder inattention. The marginal of .87 was not due to quality of the data or ambiguity of the coding set because the highest kappa was .90, pointing out the data set and coding set were unambiguous (Cohen, 1960). Figure 3-1 shows the formula used to calculate coder kappa. This kappa uses a simpler and more conservative calculation for nominal scales than that for ordinal scales. The calculations of the ordinal scale kappa are presented in the next discussion.

(Equation 3-8) A coefficient of interjudge agreement for nominal scales is described as follows: ― P is the likelihood of observed data, the proportion of units in which the judges agreed. Pe is the likelihood of chance, the proportion of units for which agreement is expected by chance. Kappa equals the proportion of joint judgments in which there is agreement, after chance is excluded. Its upper limit is +1.00, and its lower limit falls between zero and negative 1.00, depending on the distribution of judgments by a pair of two judges. The

142

maximum value which kappa can take for any given set of data is ĸM, which is dependent on the marginal distributions‖ (Cohen, 1960, pp. 37-47). c. Phase 2 Content Analysis Validity and Reliability. The two highest-scoring coders obtained the highest interrater reliability score in overlapping partial data samples in phase 1, a nominal kappa of 0.90. In phase 2 of content analysis, these two highest-rating coders each recoded the entire data set of 56 presentations again, once again using the eight ordinal reactance-inducing codes. This phase 2 of content analysis yielded a second measure of generalizability and replicability of results yielding a kappa of 0.78 for nominal scales. Kappa only considers whether judges agreed exactly or did not agree exactly on category coding. Kappa weighted is a chance-corrected proportion of weighted agreement for ordinal reactance-inducing codes that adds credit for the degree of disagreement on ordinal scales (Cohen, 1968). In phase 2, the highest-scoring phase 1 partial-data coders each recoded the entire data set and scored a kappa weighted, adjusted for ordinal scales, of 0.83. Everitt (1996) and Landis and Koch (1977) classify kappa of 0.61 to 0.80 as substantial agreement and a kappa of .81 to 1.00 as strong agreement. Therefore phase 2 content analysis by the highest-scoring coders produced a strong agreement. Equation 3-9 presents the formula for weighted kappa for ordinal coding scales.

ĸ = 1- ∑ vij poij ∑vij pcij

143

(Equation 3-9)

The previously described unweighted kappa (Cohen, 1960) treats all judge disagreements equally. Cohen‘s (1968) generalization to weighed kappa ― provides for the incorporation of ratio-scaled degrees of disagreement (or agreement) for each of the cells of the k x k table of joint nominal scale assignments such that disagreements of varying gravity (or agreements of varying degree) are weighted accordingly‖ (p. 213). Vij is the ratio weight, which represents the degree of disagreement on an ordinal scale. Poij is the proportion of the joint judgments (N in number) observed in the ij cell. Pcij is the proportion in the cell expected by chance. Weighted kappa equals the proportion of joint judgments in which there is agreement, after chance is excluded. Its upper limit is +1.00, and its lower limit falls between zero and -1.00, depending on the distribution of judgments by a pair of two judges. The maximum value which kappa can take for any given set of data is ĸM, which is dependent on the marginal distributions. 2.

Omnibus ANOVA to determine if HLM was warranted. This test was conducted upon the VIP message dosage values for the 15 VIP groups. There was no significant difference between the 15 MADD VIP groups mean level of reactance-inducing statements, or reactance dosage values, thus step 3 was not conducted. It was not necessary to conduct a hierarchical linear model to regress the outcome variables onto independent variables within participants‘ nested groups. However, a contrast revealed that there was a significant difference between VIP Group 13 (low dosage) and the other 14 groups (high dosage). Group 13 was then assigned a level of the analysis as ― low dosage‖ and all the other groups were assigned as ―hi gh dosage‖ groups.

144

3.

Hierarchical Linear Modeling was not necessary. There was no need, based on the ANOVA in step 3, to conduct HLM for treatment of random factors. (An independent variable is referred to as a factor in the language of research design.) A basic assumption in introductory statistics is to assume a fixed factor design. A one-way fixed factor, in general, is defined as a design where participants are grouped into different groups, where each group experiences different levels of a factor. The problem with applying fixed-factor design is that the different MADD VIP meetings may randomly vary in levels of MADD VIP message dosage. This is because different presenters presented, somewhat randomly, at different VIPs. A random factor design takes the perspective that the independent variable is ―not so much ‗manipulated‘ as ‗sampled.‘‖ (Maxwell and Delaney, 2004, p. 479).

4.

Tests for outliers, violation of assumptions, transformations and recoding, tests for relationships between reactance antecedents and recidivism to answer research questions: Figure 3-2 provides a flow chart that describes the order of the four procedural steps of analysis used in the present study.

145

STEP 1. Qualitative & Quanti-tative Content Calculation of VIP reactance message dosages. Tests for IV and DV variables‘ violation of assumptions of parametric statistical tests.

STEP 2: ANOVA and Decision: Are 15 VIP Group Message Dosages Statistically the Same?

Yes

Actual Path

No STEP 3. Hierarchical Linear Model regression nested within their 15 VIP groups. Output from HLM could be used as a weighting covariate in subsequent Survival Analyses

STEP 4. Answering Research Questions:

Tests for violation of assumptions, transformations, recoding, Loglinear Logit regressions, Survival Analysis: Cox regression

Figure 3-2: Flow chart of order of procedures. Instruments and Data Sources Rogers, Woodall, Rao, Polacsek, & Milan (1994) assessed MADD participants four times during the original study using a adaptation, Form 90-DWI, from a standardized questionnaire Form 90 (Hettema, Miller, Tonigan, & Delaney, 2008), and DWI recidivism data, with the fifth assessment being the present study that employs previous assessments and a 12-year database of participant recidivism data.

146

Instrument: The questionnaire Self-report questionnaires and interviews were used to obtain data on pre/post change in emotional mood in the original study. Rogers, Woodall, Rao, Polacsek, & Milan (1994) administered Form-90 DWI, a variant of NIAAA Form 90 developed by Hettema, Miller, Tonigan, and Delaney (2008), at pretest, post-test, and at one and 2-year follows ups. Form 90 was developed as a standardized assessment in NIAAA (National Institute on Alcohol Abuse and Addiction) Project Match, a $27 million dollar, nationwide test of alcohol interventions, conducted over eight years. The questionnaire asked participants an array of questions related to drinking. It also asked participants how they felt about being assessed about drinking. It asked, for example, whether participants were interested, nervous, distressed and it asked other mood indicators at different points in the study. The questionnaire asked participants about their attitudes towards driving drunk or their attitudes about being a passenger of a drunk driver. Woodall et al. (2005) administered the above-described instrument at pre and post MADD VIP, post DWI school, and at one and 2-year follow-ups. Some additional questions were added to the questionnaire at the later follow-up dates. Appendix 2: Prepost MADD VIP Instrument contains a copy of the questionnaire used in the present study. Some of the follow-up interviews could not be conducted because the participant was not located, thus there were cases where data were missing. In the present study, the rule for inclusion in calculations of the pre-post differences in emotional change scores was as follows. If the pre test had less than 6 responses missing, empty responses were replaced with case mean. Pre-post difference scores were calculated for those participants 147

who could be located for the post interviews (n = 518). Because of not being located for follow-up interviews, 198 cases were missing from the treatment group and 117 from the DWI School comparison group for emotional change scores. Questionnaire reliability and validity Self-reports are often not a reliable source of data (Richard, van der Pligt, & de Vries, 1996). Perhaps this observation is most true when the report is self-incriminating. Due to their self-incriminating content and thus less reliable nature, questionnaire self reports were not used to measure drinking frequency. Other sections of the questionnaire, derived from Form-90, were used in the data analysis. According to Hettema, Miller, Tonigan, and Delaney (2008), their retested Form 90-DWI, of which this questionnaire was an adaptation (Hettema, Miller, Tonigan, & Delaney, 2008), may be the current ―mostreliable tool for assessing DWI behavior.‖ Our data indicate that the Form 90-DWI shows promise for providing a reliable estimate of drinking behavior and several important DWI behaviors, including frequency of DWI and associated DWI BACs... as the BAC level associated with behaviors increased, reliability decreased. It is possible that this phenomenon is the result of memory deficits that have been documented to accompany high BAC levels…Assessing levels of validity is an important next step. This task may be difficult, however, as there is currently no established ― gold standard‖ for assessment of DWI behavior…In sum, the Form 90-DWI appears to yield reliable indices of DWI behavior among the tested sample, a finding that provides impetus for further research with additional samples. The Form 90-DWI shows promise for providing a much-needed measure of DWI intervention outcome. Form 90148

DWI overcomes limitations of indicators in current use, such as arrests and injuries. These measures occur at low frequency. Their accuracy is influenced by a variety of confounding factors. The Form 90-DWI does not rely on the respondent‘s subjective judgment of intoxication but rather estimates BAC from reconstructed drinking data. The current investigation provides preliminary evidence that the Form 90-DWI may be a reliable tool for assessing DWI behavior itself rather than just its tragic consequences (p. 120). Secondary data source Public records of participants‘ subsequent arrests at twelve years post intervention were used as a secondary data source. Public arrest records The public records of DWI re-arrests, obtained through the Citation Tracking System (CTS) data file maintained by the Division of Government Research at the University of New Mexico, provided the source of outcome measures—time to recidivism and number of subsequent arrests. The Variables Covariate operationalizations and measure of constructs C‘de Baca, Lapham, Liang, and Skipper (2001) found there was no statistical association between MADD VIPs and first-time offender recidivism. However, ― female repeat offenders who were referred to VIPs were significantly more likely to be rearrested‖ (p. 615) compared to non-VIP comparisons. Wells-Parker, Pang, Anderson, McMillen and Miller (1991) also found difference in male and female recidivism rates. With the caveat that these studies were quasi-experimental and their findings not entirely 149

reliable, yet they suggest that gender and number of previous DWI arrests should be at least investigated as covariates in the present study. Age Age is constructed as the age of participants at the time they were enrolled in the study, obtained from New Mexico court records. Gender Gender is constructed as participants‘ gender at the time they were enrolled in the study, obtained from New Mexico court records. Number of prior arrests Number of prior arrests is obtained from New Mexico traffic violation records. Independent variable operationalizations and measures of theoretical constructs Values for the independent variable, level of reactance-inducing statements, was arrived at through an operationalizations process that quantified presence of theoretical reactance-inducing constructs in the form of reactance-inducing message types. There is an advantage to aggregating multiple messages into archetypal message types in order to measure theoretical constructs and to further the science of message design. Jackson, O'Keefe, Jacobs, and Brashers (1989) compared single laboratory-controlled message research to research involving exemplars of multiple message types and found the multiple-message study design to be superior. multiple-message designs provide greater reliability in estimation of treatment effects, equivalent power for detection of variability in treatment effects, and easier identification of moderator variables (p. 364).

150

In the present study, MADD VIP transcripts containing multiple exemplars of reactance-inducing message types were analyzed. Six archetypal conventional message effects constructs plus eight constructs from reactance theory (Brehm, 1966) were held in mind as the researcher developed definitions and exemplars for message types from the MADD VIP transcripts. Employing constant comparison analysis, these fourteen constructs were translated to a hierarchical arrangement of increasing reactance-inducing message codes that coders used to measure level of presence or absence of reactance antecedents in 2,021 statements of 56 presenters at the 15 MADD VIP presentations. Variables that measure the levels and proportions of reactance antecedents in MADD VIPs comprise the independent measures in this study. A psi contrast was conducted to compare those VIP groups who scored relatively low on the reactance scale to those who scored relatively higher. This contrast revealed that there was a statistical justification for categorizing VIP groups into low and highreactance groups. The DWI School Only group functioned as a third comparison group, representing a group that had not been exposed to any level of the reactance-inducing VIP statements. Categorical comparison of the three groups enabled exploration of whether a change in reactance levels was consistent with a change in DWI recidivism. Identification of reactance constructs in VIP transcripts Theoretical constructs that guided the qualitative analysis were message context, content, function, intensity, and pathos as discussed above and in the review of literature. Force or pressure has been found to reduce compliance (Festinger & Carlsmith, 1959). Forceful language, language in the imperative, subjunctive, or conjunctive mood that expresses wishes or commands was classified as high-threat language (Duda, Hart, & 151

Stork, 2001). An example of high-threat language is ―Res ponsible drinking: you have to do it‖ (Dillard & Shen, 2005). Informational declarative language or language in the indicative mood (for example, questions) was coded as low-threat language (Dillard & Shen, 2005). Verb mood and inductive/deductive reasoning was classified as either highthreat or low threat according to the guidelines set down by Dillard and Shen (2005). Dependent variable operationalization and measure of reactance outcomes Reactance is an intervening state between reactance antecedents (Brehm, 1966) and the contrary behavior that results from the state of reactance. Reactance is an attitude that must precede acting out the reactant behavior. Reactance is not the reactant behavior itself but an intervening state between reactance antecedents and the contrary outcome behavior, in this case increased DWI recidivism. Evidence for reactance is found when an intervention condition that evidences high levels of reactance antecedents is associated with increased recidivism such as shorter time to recidivism and greater number of subsequent arrests. These two dependent measures are used in the present study. As in the C‘de Baca et al. study (2001), the present study uses the Citation Tracking System (CTS) data file maintained by the Division of Government Research at the University of New Mexico to provide outcome data. The database holds all New Mexico DWI records from July 1984 to present. Two dependent measures were derived from the CTS: time until recidivism and number of subsequent arrests. Time until recidivism The dependent measure time until recidivism was calculated from the CTS database. It was calculated as the time until the first re-arrest after enrollment in the study. This date was used instead of the date of the index arrest that landed the participant 152

in DWI School because in some cases the participant incurred a second DWI after the index event and before DWI School. Time until recidivism is a dependent variable used in survival analysis (Cox regression) and logistic regression. For logistic regression, time until recidivism is dichotomized into two categories: four years post intervention versus five years or more post intervention. The decision to break the continuous form of the variable at the end of four years is due to the decay curve for the outcome. There is a marked decay inflection point at the end of four years in the survival outcomes (see Figure 4-4). In this figure, the survival decay rate for those with no prior DWI arrests is shallower than the decay rate for those priors. However, both appear to have a decay rate inflection point around the end of the fourth year or beginning of the fifth. The survival curve decay rate thus provided the rationale for where to break the continuous variable time until recidivism into a two-level categorical variable. Number of subsequent arrests The dependent measure, number of subsequent arrests, was calculated from the CTS database. It was calculated as the number of subsequent arrests after enrollment in the study. The study enrollment date was used instead of the date of the index arrest that landed the participant in DWI School because in some cases the participant incurred a second DWI after the index event and before DWI School. The number of subsequent arrests is a dependent variable used in survival analysis (Cox regression) and logistic regression.

153

Emotional Change Score The dependent measure emotional change score was calculated from the study questionnaires administered at DWI School and one year following DWI School. Pre scores were subtracted from post scores to arrive at an emotional change score. A positive change score indicated a higher mood following intervention than before the intervention. A negative change score indicated lowered mood following intervention than before the intervention. Emotional change scores were found to drop as low as -37 units after an intervention in the present study. However, a negative value cannot be used in logistic regression for the dependent variable. In order to perform logistic regression with the emotional change scores as dependent variable, 40 points were added to all scores. The transformation resulted in emotional change scores that ranged in value from three to ninety five. A score of 40 indicated no emotional change from pretest to one-year posttest. A score of below 40 indicated a drop in mood following the intervention. A score of above 40 indicated an elevation in mood following the intervention. Wells-Parker et al. discuss using DWI recidivism data, as it is used to calculate the dependent variable in 99.5% of DWI intervention studies. They estimate that DWI arrest data causes an underestimate in effect size, but that there are ― no clearly superior measures‖ (p. 922) to DWI recidivism as a dependent variable. In the present study, the efficacy of the MADD VIPs‘ message (not to drink and drive) was judged by three main outcome variables: (a) Time to recidivism (Cox Regression, a type of survival analysis) (b) The number of MADD VIP participants‘ re-arrests over time versus their category comparisons (Logit Loglinear Analysis) 154

(c) Participants‘ pre/post test change in emotional variables as a measure of message effect. The Datasets Priors separate from no priors Participants with differing ― Number of Prior Arrests‖ were tested in separate datasets from those with no recidivisms. Levels of priors variable have been found to produce significantly different effects in a previous study (Woodall et al., 2008) of which this study is an extension. Numbers of prior arrests were related to DWI recidivism in quasi-experimental studies discussed in the review of literature. Although there is evidence to suggest that no priors (first offenders) as a group exhibit a high rate of alcohol dependence (Pristach, Nochajski, Wieczorek, Miller, & Greene, 1991), as does the prior offender group, yet differences between the two groups in recidivism rates supports analyzing them separately. Censored cases separate from non-censored cases Those participants who were not rearrested were not included in the database used to conduct regressions (they were censored data). The Cox Proportional Hazards Regression was an exception to this rule. Cox PH Regression does not use censored cases in the stepwise regression, but it does employ them to compute the hazard of recidivism in the calculation of the survival function. Software and its Use QSR N6 QSR N6 is user-friendly qualitative analysis software with capability to adjust and refine coding categories, merge categories. QSR N6 allows for coding in vivo or coding 155

text in real life contexts without artificiality of a laboratory manipulation or preconceived themes. It is flexible and allows the evolution of reactance-inducing codes. It supports the electronic merging or bifurcation of codes, automatically updating all units that carry those codes. It supports filtering, and segmenting of coding themes. QSR N6 also allows for organizing themes in a hierarchical relationship. Thematic relationships can be specified in a hierarchical organization of categories, then re-evaluated and adjusted as the picture of trends in the data comes into focus. All adjustments, merges, and splits in code definitions and the coded data are recorded automatically on the history of each code category. The software also allows for export of frequency counts of units in each category into a file format that is readable in SPSS. Each of the fifteen MADD VIP presentation transcripts were converted into ASCII text files and imported separately into QSR N6. The advantages of these separate entries was that once message themes were identified within MADD VIP group sessions, their frequency counts could be exported to SPSS for each separate MADD VIP group. The transcript texts were reread iteratively in QSR N6. During successive iterations, themes were identified. These themes were then reorganized iteratively in a hierarchical structure to represent relationships among them. Notes were iteratively redefined for each category as the coding process evolved. Notes for each categorical theme described what was included and why, and what was excluded and why, and the evolution of the different reactance-inducing codes. The coding scheme evolved because in qualitative analysis the coding scheme is not preset. The advantage of the constantcomparison approach is that the categories evolve from the data. Patterns in the data that may not have been expected are allowed to emerge. Variables are not excluded because 156

of a preexisting schema. Because they arise from the data, they describe and fit the data better than if they had been preconceived. This method of developing reactance-inducing codes for content analysis offers the advantage of being sensitive to nuances in the data that would not be evident if a codebook were used that was developed solely from the literature. SPSS SPSS (Statistical Package for the Social Sciences), a user-friendly statistics software with a graphical user interface. Statistical analyses, both parametric and nonparametric, were run in SPSS. Most of the analysis and all of the charts in the present study were products of SPSS. Data from SPSS, in order to create better charts, were imported into Excel. Microsoft Office Excel Microsoft Office Excel was used to create charts of higher clarity than available in SPSS. Excel was used to perform complex vertical lookup functions, create pivot tables, and conduct other similar data manipulations in order to prepare raw data and create new variables. Such data was then imported into SPSS for analysis. Methods Limitations Under-identification of prior offenders Due to analyst error in construction of the current data set, up to 125 prior offenders, those whose offenses older than months or weeks prior to their study enrollment were analyzed as having no prior offenses. Only those offenders with very recent prior offences were included in the prior category of the present study. This underidentification of participants with prior DWIs had the effect of reducing likelihood the 157

present study would produce significant results. Regardless of the direction of the error, the findings in the present study concerning prior offenders will need to be replicated to validate the present study‘s reliability concerning effects of MADD VIPs upon those with prior DWIs. Attrition due to deaths Random assignment to group condition likely results in an equal distribution of deaths, and so deaths in study populations who have been randomly assigned to group are usually similar for all groups. Random assignment to group equally distributes the probability of attrition due to deaths among all groups. In such a random situation, deaths occur equally in all groups and do not impact the study. As such, the reasons for death are not usually of interest. The assumption that deaths are not of interest holds true if the attrition due to deaths is equally probable for all groups. But it would not be true if group condition changed the probability for death in groups. In this case death becomes an effect of group condition and is of interest. Deaths and reasons for death might be worthwhile to investigate when group condition may increase or decrease risky behavior such as drunk driving and affect study outcomes. In the present study, attrition due to participants‘ deaths was not recorded. The reasons for death, such as drunk-driving-crash-related, or not, were not known. An investigation into number of participant deaths and causes of death would require use of participant identifiers. This research did not have access to participant identifiers. The data used in this study had been stripped of those identifiers in accordance with IRB (Institutional Review Board for Human Subjects Protection) recommendations. A future study might use participant identifiers to search public 158

obituary records for possible deaths and cause of deaths for participants. There are two causes of deaths that would be of interest in a future study: deaths from natural causes and non DWI crashes, and deaths from DWI crashes. Participants’ deaths from natural causes and non-alcohol-related crashes Because 16 of participants were over age 60 at the time of entering the study, there is some likelihood that some participants may have deceased before 12-31-07, the date the 12-year post safety data was collected. Non-alcohol-related crashes are of interest as a cause of death in order to distinguish from these types of accidental deaths versus deaths from DWI crashes. Participants’ deaths from DWI crashes Since 125 participants reported drinking and driving more than ten days per month, some of the participants who are listed as not having recidivisms may have been put in this category in error. They could have died following an alcohol-related crash, in which case they died before the end of the study but would have been not cited and arrested if they had lived. DWI offenders have a higher risk for death and death from accident and violence (Mann et al., 1994). Nonrepresentative sample Only 80% of those mandated to DWI school did attend, and within that group only 70% of the DWI school attendees elected to participate in the study. These numbers suggest that the population sample obtained for the study may not have been 100% representative of all DWI first offenders. Of those who had been arrested for DWIs during the study enrollment period only those offenders who were functional enough to

159

attend DWI school and altruistic enough to volunteer for the study were enrolled as study participants. Bimodal distribution of independent variables indicate conversion to dichotomous variables The data for level of reactance-inducing statements and proportion of reactanceinducing statements, the two independent variables of interest for research questions one through five, demonstrated bimodality in their distribution. This means that they are best represented as categorical variables in an analysis. There is no transform to normalize bimodal data. Reactance-inducing level and proportion of reactance-inducing statements were banded into high/low dichotomous categories (category membership was determined through a psi contrast discussed in the methods section) and compared to a third category, the DWI School no-VIP-reactance exposure group. These three levels of the two independent variables could be employed in survival analysis, where the stepwise regression does accept categorical predictor variables, in nonparametric chi-square, and as grouping variables in ANOVA and loglinear regression. Transformations, recoding to change score values, dichotomization of variables, and their rationale are, by convention, reported in the Results chapter (Tabachnick & Fidell, 2007, p. 77). Variable categorization increased power Given the preceding limitations of the independent variables‘ data structures, values for low-reactance and high-reactance VIPs, low-reactance and high-reactance VIPs, age were best dichotomized. The three dependent variables time to recidivism, emotional change, and number of subsequent arrests required recoding as dichotomous 160

variables in order to be used in loglinear logit regression, which requires discrete dependent variables. Although categorization is considered undesirable because it reduces the sensitivity of the data, yet in this case the predictor variables, due to bimodal distribution of their values, could not be described otherwise. Dichotomization is most often associated with loss of sensitivity in statistical tests, yet the loglinear logit method of regression yielded clearer results and more power, given data structure, for these data (Cohen, 1988). The non-normally distributed data were best parsed for categorical loglinear regression that employs a multinomial distribution. The multinomial logit regression, another name for loglinear logit regression, was more robust than other forms of regression that depend upon a normal distribution. Chinn's d (2000) was used to calculate effect size from odds ratio and Exp(B) parameters. Dependent variables did not appear to be compromised as a result their categorization. In fact, dichotomization of dependent variables for loglinear logit regression proved to be a benefit. Loglinear logit regression was the most efficacious regression because the logit model offers a conservation of power. It has fewer parameters that other regression models because ― the constant and all of the parameters that involve only the independent variables cancel‖ (Norusis, 2004, p. 27). Additionally, when a custom loglinear logit model is specified, the constant for the dependent variable may be omitted from the model, which also reduces the number of parameters and thus increases power. The dichotomous form for VIP data was chosen over multiple categories because two categories described the VIP data adequately, conserved power in the analysis, and allowed for inclusion of the DWI School Only group as a comparison group in the analyses. Without categorization of VIP independent variables, the DWI School Only 161

group could not have been included because this group had not experienced the VIP condition and thus had no values for low-reactance and high-reactance VIPs, or lowreactance and high-reactance VIPs. Chapter 3 Summary There are disadvantages inherent in conducting research on problem drinkers. Participant recruitment inefficiencies resulted in a population sample that may be nonrepresentative. This inefficiency was a function of the nature of the population where 44% of the pool of drunk drivers at the time of the study enrollment was either: (a) Unable to function normally enough to make it to DWI School and avoid a bench warrant, or (b) Not being altruistic, that is, not interested in benefiting society or other who may come after them through the court system. Thus, the study results are not generalizeable because those who were the most problematic drinkers are probably not represented in the sample. This handicap must be accepted as part of the nature of the population being studied. Heavy drinkers are characteristically low functioning and they are not characteristically altruistic or other centered. A benefit of researching this population of problem drinkers is that their outcome data, number and frequency of DWI recidivisms, is available on public record. The methods employed in the present study range from qualitative (constant comparison analysis) to quantitative (content analysis, chi-square, ANOVA for unequal n, Fisher‘s LSD, Pearson‘s r, odds ratios, loglinear logit regression, and Cox Proportional Hazards Regression). The choice of regression methods was limited by the non-normal distributions of the independent variables. This limitation, however, was overcome by 162

using loglinear logit regression, which assumes a multinomial distribution and does not rely upon continuous variables or a normally distributed data set.

163

CHAPTER 4: RESULTS This chapter begins with a preview of which tests were used to obtain results for the eight study research questions. It describes the calculations of the dependent variables, data structure considerations regarding outliers, bimodality of independent variables and how these conditions limited and informed the choice of statistical tests. Benefits and tradeoffs are discussed regarding the dichotomization of independent and dependent variables to meet data structure requirements of the appropriate tests, to reduce degrees of freedom, and to increase power. Next, the research questions are answered with tables and figures that organize and describe the output and results. At the end of the chapter a synthesized summary of the results is presented. Statistical Tests Conducted for Each Research Question The following forms of statistical analyses were employed to answer research questions one through eight. 1. Levels of reactance antecedents present in MADD VIPs were quantified via mean and proportions. The significance of the strength of their message dosages were determined by chi-square goodness of fit tests. 2. VIP levels of reactance-inducing statements were tested for difference among different VIPs using ANOVA for unequal n, Fisher‘s LSD, Pearson‘s r. 3. VIP levels of reactance-inducing statements were tested as predictors of lowered mood (lower emotional change scores following intervention) using loglinear logit regression. These results were validated through computation of odds ratios.

164

4. Levels of reactance-inducing statements were tested as predictors of survival time until recidivism using Cox Proportional Hazards regression, loglinear logit regression. These results were validated through computation of odds ratios. 5. Levels of reactance-inducing statements were tested as predictors of number of subsequent arrests using loglinear logit regression. These results were validated through computation of odds ratios. 6. Demographic predictors of number of subsequent arrests were explored using Cox Proportional Hazards Regression. 7. Demographic predictors that exacerbate negative message effects of MADD VIPs were explored using Cox Proportional Hazards Regression. 8. Whether VIPs are effective was summarized by reviewing significant results for research questions 1-7. Necessity of splitting data into levels of prior arrests The number of prior arrests, though only the most recent priors for the last twothirds of the sample, had the greatest effect in predicting survival for all cases. Thus, in order to answer the research questions, within the modification influence of levels of prior arrests, data were split into levels of recent prior arrests (hereinafter priors) versus no recent priors (hereinafter no priors). These two levels were analyzed separately. Results of these analyses are limited in application, as noted in the limitations of the study. Rationale An independent samples t-test was conducted to compare the number of subsequent arrests for offenders with one prior DWI versus offenders with no priors. 165

There was significant difference in number of subsequent arrests for those with one prior arrest (M = 2.31, SD = .62) versus those with no priors, M = .51, SD = .88; t (812) = 10.35, p < .0001. The addition of just one recent prior arrest made a significant difference in likelihood of recidivism. The influence of just one prior arrest upon recidivism was the same no matter which group condition, DWI School Only or DWI School plus MADD VIP, to which offenders had been assigned and no matter whether they had priors or no priors. The rationale for segmenting the data set into those priors and no prior offences was supported by findings from previous studies that differentiated these groups‘ outcomes. Wells-Parker et al. (1995) reported that those with no prior arrests were at low risk for recidivism. Those with multiple prior arrests were at high risk for recidivism. Number of priors was also found to have a contributing effect in a quasi-experimental study by C‘de Baca et al. (2005). A breakout of cases with no prior DWI arrests versus those priors indicated that these two classes of participants appeared to demonstrate different survival patterns. The intervention impacted the two groups differently. Those last two-thirds enrollees with recent prior arrests had a significantly greater hazard of recidivism, evidenced by lower cumulative probability of survival, than others in the study designated as no priors, but which included over 100 non-recent prior offenders. Note that for brevity, this distinction that over 100 non-recent prior offenders are analyzed as no priors will be assumed in further discussion of the prior versus no prior results. Figure 4-1 illustrates survival curves for priors and no priors, intervention and DWI School comparison groups combined, stratified by number of priors. Only 5.6% of 166

the sample, 47 participants, had DWI arrests prior to the study as calculated by the faulty CTS index date, and as has been discussed in the methods chapter. This means that the results reported here were biased in favor of not finding significant results because many offenders with prior DWI offenses have been erroneously included in the no prior category. The fact that significant results were obtained in the present study points to the possibility that when the data are reanalyzed with the 100 non-recent prior offenders moved to the prior offender category, the significance levels and effect sizes may be higher or lower than effects observed here.

Survival Stratified by Number of Prior DWI Arrests: All Cases NUMBER OFFENSES PRIOR TO STUDY (COURT ARREST RECORDS)

1.0

0 (N = 786) 1 (N = 26)

0.8

2 (N = 17)

Cum Survival

3 (N = 4) 0.6 Pairwise Comparison to 0 (Exact) Number Wilcoxon Sig Priors

0.4

1 2 3

54.05 36.80 11.81

.000 .000 .001

0.2

0.0 0

2

4

6

8

10

12

YEARS SURVIVED TO RECIDIVISM FROM ENRL DATE

Figure 4-1: No priors versus priors: Survival function. Those with no prior DWI arrests survived significantly longer than offenders with recent priors (Wilcoxon Gehan statistic, p < .0001). Many no priors survived to end of study without recidivism. No one

167

with one or two recent prior DWI arrests survived longer than10 years, no one with three recent priors survived longer than six years. Since the number of subsequent arrests was highly correlated with number of prior arrests (r = .40 , n = 833, p < .0001), it follows that those who were arrested more frequently before the intervention were likely to be arrested more frequently after the intervention regardless of what type of intervention they received. The correlations between these same two variables for intervention participants (r = .43, n = 426, p < .0001) versus comparisons (r = .37, n = 406, p < .0001) support this explanation and support the rationale to test prior offenders separately from those with no priors. Gender and age did not significantly affect recidivism for those priors, although age trended toward significance (p = .065) (Table 4-1). Therefore, gender and age were not considered as covariates in the analysis of the treatment effects for those with prior DWI arrests. Table 4-1: Priors: Covariates in the Cox Regression Equation B GENDER2

SE

Wald

df

Sig.

Exp(B)

-.141

.216

.426

1

.514

.869

AGE_REVERSED

.018

.010

3.415

1

.065

1.018

NUM_PRIORS

.281

.104

7.235

1

.007

1.324

Gender and age did not significantly affect recidivism for those with prior DWI arrests, although age trended toward significance. Number of priors did affect DWI recidivism. For every one-unit increase in number of priors the hazard for recidivism increased 1.3 times. 168

Calculation of Dependent Variables Time to recidivism Time to recidivism was a measure of how long participants survived from the beginning of their enrollment in the study until being rearrested for DWI. This variable was calculated by subtracting the study enrollment date from the date of the first recidivism. If the participant had survived the length of the 12-year study without recidivism, then time date of first recidivism was set to the date the data were collected, December 31, 2007, and the study enrollment date was subtracted from end of study date. In order to designate survivors who were never rearrested, a separate variable, status_event, indicated whether the participant had a recidivism status or not. Survivors who were not rearrested were reported as ―0 ‖ for a status of zero occurrence of the recidivism event. The status_event variable was used in survival analysis to exclude censored cases, those who did not experience DWI recidivism, from the stepwise regression portion of the analysis. For the survival analysis computation of hazard of recidivism, the censored cases were included. The status_event variable was used in case selection for logistic regression analysis. Only those cases that experienced the event, DWI recidivism, were included in regression analysis. The dates of participant DWI arrests were obtained from publicly available traffic safety records. Number of subsequent arrests Numbers of subsequent arrests were determined by summing the number of times a participant had been rearrested for DWI following their enrollment in the study. The

169

dates of participant DWI arrests were obtained from publicly available traffic safety records. Emotional change scores To aid in interpretation of results for research question 3, it is useful to discuss the source of the data used to calculate the variable emotional change scores. The pre-score was obtained from the emotional scale (see Appendix 1) administered to the MADD VIP plus DWI School intervention group at DWI school. The scale measured the level of emotional mood at the point when the participant took the test. Thus the pre-test was administered at DWI School to obtain a baseline mood before intervention. The post score employed in this study was obtained from the emotional scale administered to the MADD VIP plus DWI School intervention group only at their one-year follow-up interview. The one-year follow-up post score was used in this study because the immediate post VIP score was not included in the database available for this study. The emotional change score values were calculated by subtracting pre from post scores for each case where there were both scores. A high pre score subtracted from a low post score yielded a negative number, indicating a lowered mood19 following intervention. A positive valence change score indicated a more elevated mood following intervention. Raw emotional change scores ranged from a negative 38 to a positive 38. In order to prepare data for logistic regression, which requires positive values in the

19

A lowered mood is distinguished here from a depressed mood. The scale that was used to evaluate participant mood was not a diagnostic scale for clinical depression. Further, a mood change to a lower level did not necessarily mean that a participant was depressed. They could have been extremely happy prior to the intervention and then just less happy, but still happy, after the intervention. A lower mood does not necessarily mean a sad or depressed mood. Similarly, a higher mood score following intervention does not mean that a participant was happy. They could have been extremely depressed before the intervention and an improvement in mood might have still left them depressed and sad, only less depressed.

170

dependent variable, a value of 40 was added to all raw change scores to produce the emotional change score. Identification and Removal of Outliers General considerations regarding outliers Outliers must be considered in use of continuous forms of variables. Outliers may be equally present in intervention and DWI School comparison groups due to random assignment. In any case, they compromise model fit, within the constraints of the given model. Outliers can occur in independent variables, covariates, and dependent variables. Outliers may be a result of inaccurate data entry where there are blank fields, read as zero values, or due to unspecified missing values. In the case of the present study, data entry was rigorous. Data were entered twice, independently by different data entry personnel, the two databases were compared, and errors corrected. Outliers, in the present study, were not likely due to data entry error. In the present dataset, all data have been analyzed for missing values. Those cases with missing values were not included in the analyses. The outlier may not be a member of the population that the research design intended to sample. This would not be likely in the present study because the intended population was DWI offenders who had been convicted of DWIs. Only convicted DWI offenders were present in DWI School where the present sample was obtained. Finally, and the most likely reason for an outlier in the present study is if the intended population sample contained more extreme cases than are expected in a normal distribution. This case is addressed in the following section. Outliers should be identified, corrected if due to incorrect data entry, and otherwise removed (Tabachnick & Fidell, 2007). 171

Identification of outliers in the present data Outliers were present in the data, but it was unnecessary to remove them because variables were categorized. Variables were categorized because of data distributions and effect categories. In some cases, the data were bimodal. In other cases, the effect was bifurcated between two levels of the variable. In all these cases and for the above stated reasons, outliers did not affect analysis because the data were recognized as bimodal and then bifurcated. Bimodal data It was not possible to use levels of reactance-inducing statements, and proportion of reactance-inducing statements, the two independent variables of interest, as other than categorical variables. These two variables were bimodal; they were not transformable into a normal distribution and therefore not suitable for use as continuous variables. These variables were coded categorically, which eliminated the outlier bias. An additional benefit of the categorical coding was inclusion of the no-VIP comparison group, which could not have been included if the independent variables were continuous. Where continuous independent variables are used, all cases must contain values that are used in the statistical calculations. However, categorical grouping allowed for inclusion of participants who had no values for level of reactance-inducing statements or proportion of reactance-inducing statements, those who had attended DWI School Only. Categorization allowed the DWI School Only participants to be included in analyses as a comparison group because the numerical value for all members within a categorical group is the same. For example, the value for all members in the no-VIP group might be

172

― 1,‖ the value for all members in the low reactance-inducing VIP group might be ― 2,‖ and the value for all members in the high-reactance VIP group might be ― 3.‖ Bifurcated effect of age Age was used as both a continuous and categorical variable. Age outliers did not influence its accuracy as a continuous variable in survival analysis to compute the decrease in hazard for each annual increment in age. This was the only case where age was useful as a continuous variable. Age was useful as a categorical variable for use in survival analysis to study the differences between those with no prior DWI arrests who were under 30 years old versus those who were age 30 and older. The age of 31 was the mean for offenders, but the survival time to recidivism was markedly different for those under 30 versus age 30 and older among those with no priors. This was the rationale for banding age into two levels for those with no priors. Among those with prior DWI arrests, age did not make a difference in recidivism. Thus, age was not used as a covariate in the analysis for prior offenders. Bifurcated effect of emotional change Emotional change score values were simplified into a dichotomous dependent variable, which was used in loglinear logit regression and in computing odds ratios. A value of ― 1‖ signified a depressed mood one year following intervention. A ― 2‖ signified no change or a positive mood one year following intervention. Bifurcated effect of number of subsequent arrests Number of subsequent arrests ranged from zero to six (M = .61, SD = .975). This variable, because the mode was zero subsequent arrests, was simplified into a dichotomous dependent variable. The number ― 1‖ indicated zero subsequent arrests. The 173

number ― 2‖ indicated one to six subsequent arrests. This variable was also categorized into three levels for analysis of differences between odds for one subsequent arrest versus two or more. The categorical versions of this variable were used in loglinear logit regression and in computing odds ratios. Bifurcated effect of time to recidivism Time to recidivism was used both as a continuous variable, for which there were no outliers, and as a dichotomous variable. Survival time to recidivism was markedly different before and after the fourth year. At the fourth year there was a marked and rapid decline rate for time to recidivism in all three intervention modalities (no VIP, low reactance-inducing VIP, high-reactance VIP). This was the rationale for banding time into two levels: short time to recidivism (less than four years) and long time to recidivism (greater than four years). No case had a recidivism exactly at four years, thus the exact value of ― four‖ was not included in the short or long time to recidivism categories. Research Questions and Results 1. At what levels are reactance antecedents present in MADD VIP presentations? VIPs Contained High Levels of Reactance Antecedents. Reactance antecedents were present in MADD VIP presentations in significantly higher levels than would be due to 50/50 chance. Two variables measured the presence of reactance antecedents in VIP presentations: VIP level of reactance-inducing statements (high levels are reactanceinducing) and VIP proportion of reactance-inducing statements. The variable VIP reactance-inducing level reported the mean reactance-inducing level coded by coders who rated VIP presenters‘ statements. VIP level of reactance-inducing statements was measured on an eight-point scale where statements coded above level 3 indicated an 174

increasingly severe reactance-inducing statements. A frequency count of number of statements categorized within each of the codes indicated that 1,397 out of 2,021, or 69% of VIP presenter statements were above level 3 on the eight-point ordinal scale of reactance-inducing intensity and could be considered to be reactance inducing. The variable VIP proportion of reactance-inducing statements reported the number of presenters‘ reactance-inducing statements in each VIP proportional to total number of statements. A statement was reactance-inducing if it scored above level 3 on an eight-point scale. As reported in the previous paragraph, a frequency count of statement codes indicated that 1,397 out of 2,021, or 69% of VIP presenter statements were above level 3, where a score of 1-2 is a supportive and positive message, a score of 3 is neutral, and any score above 3 is increasingly reactance inducing (red area of Table 4-2). Scores for average reactance-inducing intensity by VIPs are reported in Table 4-3. To determine whether VIP statements were higher in their level of reactanceinducing statements and proportion of reactance-inducing statements than would occur due to 50/50 chance, a chi-square goodness of fit test was conducted. The chi-square test indicated there was a significant difference in the level of reactance-inducing statements and proportion of reactance-inducing statements in all of the VIPs as compared with a 50/50 probability that a statement would be emotional or reactance-inducing, χ2 (1, n = 2,021) = 295.66, p < .0001. Table 4-2 displays the set of eight ordinal reactance-inducing codes, listed in order of increasing intensity of reactance-inducing statements, which coders used to code the VIP statements.

175

Table 4-2: Set of eight ordinal codes used to code the 2,021 statements by 56 presenters

Adjusted Codes

in 15 MADD VIPs. 1 happy, hopeful 2 you & I are same 3 please change 4 forewarned: sad message coming 5 worried,depressed, confused 6 irritated,hurt, devastated 7 you should change 8 angry

chg from 6 chg from 3 chg from 4 chg from 5

Figure 4-2 demonstrates the comparative increasing levels of reactance-inducing messages for the 15 VIPs that were sampled in the present study. Mean message reactance levels for each VIP are reported by VIP intervention date. Most VIPs were rated between 4.4 to 4.8 on an eight-point scale of level of reactance-inducing statements. This indicates most VIP presentations contained more reactance-inducing statements than they did supportive or neutral valence statements. VIP 13 on June 29, 1996, was rated at the lowest level, 4.05. The lowest reactance score of 4.05 was still 1.05 levels of severity above the threshold, indicating even the lowest scoring VIP still consisted of a reactanceinducing presentation.

176

VIP EMOTIONALITY LEVEL

4.80000

4.60000

4.40000

4.20000

4.00000 03/30/95

04/27/95

05/25/95

06/29/95

07/27/95

09/28/95

10/26/95

11/16/95

12/14/95 01/25/96

02/29/96

03/28/96

04/25/96

VIP Tx DATE

Figure 4-2: VIP Level of Reactance-inducing Statements by VIP Intervention Date.

Any score above 3 on the y-axis represented a statement that was emotional and contained reactance antecedents. Most VIPs were rated between 4.4 to 4.8. VIP 13 on June 29, 1996, was rated at the lowest level, 4.05; it was still a reactance-inducing presentation overall; VIP 13 and VIP4 had lowest levels of reactance inducement. Reactance inducement is measured by the variable level of reactance-inducing statements, which includes all of the theoretical constructs associated with the reactanceinducing codes listed in Table 4-2. Note: there were 15 VIPs in the original study. All 15 are present in this study but in two instances there were two VIPs on the same date and they had to be merged because there was no other indication other than date to determine 177

which VIPs the participants had attended. This merging of four VIPs into two VIPs is discussed in the section below Merging. 2. Do the 15 different MADD VIP presentations have different reactance message dosages? If it is true that the 15 VIPs demonstrate different levels of reactance-inducing statement dosages, then this difference will become a covariate that will be controlled for by nested regression, known as hierarchical linear modeling. The 15 MADD VIP presentations, taken together in an omnibus ANOVA, were not found to demonstrate different reactance message dosages as a comparison of 15 groups. In order to determine whether there was a different message dosage for different VIP groups, and whether a hierarchical linear model should be tested, a one-way analysis of variance was conducted. The omnibus ANOVA explored whether there was a difference in level of reactance-inducing statements expressed by presenters of the fifteen different VIP groups. This one-way analysis of variance (ANOVA) for unequal "n" was calculated using coders' ratings of reactance-inducing level of VIP presenters' statements. The analysis was not significant, F (14, 2006) = 1.56, p = .075, (eta squared = .01). Because the VIP level of reactance-inducing statements variable is highly correlated with the VIP proportion of reactance-inducing statements variable (p< .0001), an ANOVA test of difference of group would not yield new information. Further, ANOVAs are calculated based on a mean value for a group. Because the Proportion of Reactance-inducing Statements was not a mean but a proportion for each group, an ANOVA was not conducted on levels of this second variable. Later in the analysis it became evident that proportion of reactance-inducing statements was an inferior and 178

redundant measure compared to level of reactance-inducing statements and it was no longer used. However, for completeness this variable continues to be discussed in this chapter. Given the generalized omnibus ANOVA findings of non-difference between VIP groups, it was not necessary to conduct a hierarchical linear regression model to regress DWI arrest data upon individual participants‘ message dosages, nested within their 15 MADD VIP groups. Message dosage values, coded by coders as discussed in the methods section, for both reactance antecedent variables VIP level of reactance-inducing statements and VIP proportion of reactance-inducing statements were listed in Table 4-2. Bifurcation of Reactance Antecedents into Dichotomous Variables. Although the 15 VIPs compared as 15 groups did not demonstrate significantly different reactanceinducing statement dosages, yet VIP Group 13 demonstrated lowest values for both variables level of reactance-inducing statements and proportion of reactance-inducing statements. The least significant difference test (Fisher's LSD with no adjustment for the post hoc nature of the test) was conducted contrasting VIP Group 13 (mean = 4.05) against the average of the other fourteen groups (weighted mean = 4.57) for a low/highreactance-inducing level contrast. F(1,2006) = 5.28, p < .01. The use of VIP 13 and VIP 4 as the ― low‖ level category enabled bifurcation of reactance-inducing level into low/high levels. This bifurcation was useful in answering the research question 4 about whether reactance-inducing level of message dosages at MADD VIPs influence DWI recidivism. If lower levels of reactance-inducing level produce significantly lower number of recidivisms and lengthen the time of being arrestfree, than do higher levels of the same variables, then there is support to argue that the reactance variable reactance-inducing level is indeed an antecedent whose categorical 179

levels predict a portion of DWI recidivism. Figure 4-3 displays the histograms for the lowest (VIP13) and highest (VIP15) reactance-inducing VIPs.

VIP15

20

40

15

30

Frequency

Frequency

VIP13

10

5

20

10

Mean = 4.0471 Std. Dev. = 2.05812 N = 85 0

0 0.00

2.00

4.00

6.00

8.00

Mean = 4.8401 Std. Dev. = 2.2024 N = 197 0.00

VIP13

2.00

4.00

6.00

8.00

VIP15

Figure 4-3: Histograms for VIP 13 (low reactance-inducing VIP) and VIP15 (highreactance VIP). Histograms display frequencies of reactance-inducing statements distributed by levels of reactance-inducing intensity. VIP 13 (n = 85, M = 4.05, SD = 2) compared to VIP 15 (n =197, M = 4.84, SD = 2). The heights of bars indicate frequency of statement occurrence for each level of reactance-inducing intensity. Locations of bars on the x-axis indicate level of reactance intensity. Red shaded areas identify highest levels of reactance-inducing intensities. Bars to the right of level 3 indicate highreactance statements. Intensity of reactance statements increases as bars move to the right. The most intense reactance category, anger, is level 8 on the far right. Enlarged histograms for all 15 VIPs are displayed in Appendix 2.

Merging of same-day VIP groups. Participant attendance at VIPs was reported by date, not differentiated by time of day. Thus if two VIPs occurred on the same date, their scores were averaged and participants were combined. This adjustment reduced number of VIP groups from 15 to 13. Table 4-3 displays these 13 MADD VIP groups. The colored rows indicate which presentations were combined

180

Table 4-3: Raw scores and transformed values for reactance-inducing level and reactance. Raw Score VIP proportion of reactanceinducing statements

VIP Date

n

VIP Number

Raw Score VIP mean reactanceinducing level

3/30/1995

11 46

VIP 1

4.4890

0.7473

VIP 2

4.7360

0.7528

VIP 3

4.6302

0.7292

VIP 4

4.3116

0.6575

VIP 5

4.1700

0.5900

4/27/1995 5/25/1995 5/25/1995 6/29/1995 6/29/1995

24 46

VIP 6

4.3796

0.6788

7/27/1995

15

VIP 7

4.7328

0.7302

9/28/1995

23

VIP 8

4.5637

0.7070

10/26/1995

24

VIP 9

4.3984

0.6429

11/16/1995

25

VIP 10

4.6757

0.7117

12/14/1995

22

VIP 11

4.6915

0.7518

1/25/1996

25

VIP 12

4.7913

0.7476

2/29/1996

44

VIP 13

4.0471

0.5765

3/28/1996

51

VIP 14

4.6389

0.7037

4/25/1996

27

VIP 15

4.8401

0.7259

Total n =

383

Mean =

4.5397

0.6969

Data structure imposed limitations. Both level of reactance-inducing statements and proportion of reactance-inducing statements were found to be present in significantly high levels of dosages (p < .0001) in VIP presentations and these two variables were highly correlated (p < .0001). Both levels of reactance-inducing statements and proportion of reactance-inducing statements, the two variables of interest in this study that measure VIP message dosage of reactance, were bimodal and not suitable for use as

181

continuous variables. They were banded into high/low dichotomous variables and used in survival analysis and logistic regression as categorical independent variables. 3. Does the reactance message dosage (level of reactance-inducing statements and proportion of reactance-inducing statements) predict direction of emotional change score in the MADD VIP plus DWI School intervention group? Neither VIP level of reactance-inducing statements nor VIP proportion of reactance-inducing statements predicted direction of emotional change score at year one post intervention. Emotional change direction was not dependent on group assignment or intervention category. However, those with no priors age 30 and older were happier one year post than they were during enrollment into the study at DWI School, no matter which intervention condition they attended (loglinear logit regression: DWI School Only β = -.457, p = .007; DWI School plus low reactance-inducing VIP β = -1.030, p = .048, DWI School plus high-reactance VIP β = -.503, p = .043). The negative direction of emotional change score immediately following VIP intervention, which Woodall, Delaney, Rogers, and Wheeler (2007) observed, did not linger one year later for any of the VIP intervention participants. 4. Does the reactance message dosage predict survival time to first recidivism within the MADD VIP plus DWI School intervention group, while controlling for covariates age, gender, and number of priors? Those with no prior DWI arrests: For those with no prior DWI arrests age was significant predictor of time to recidivism. Between two age groups of offenders with no priors, those who were under age 30 (n = 411) were 1.6 times more likely to be rearrested in the first four years, following intervention of any kind, compared to those age 30 and 182

older (n = 372) (odds ratio p = .001; Cox Proportional Hazards Exp(B) odds for age categorical variable p < .0001). Beginning at age 18, hazard for DWI recidivism decreased by 3% for each year an offender matured (Cox Proportional Hazards for age continuous variable effect size Exp(B) = .97, p = .009). Exp(B) is ― e‖ to the power of the regression coefficient. The natural log (ln) of Exp(B) is the regression coefficient. This finding compares with Marowitz‘s (1996b) finding of a 2.1% decrease in odds of recidivism for each year an offender matured. Within the group of those with no priors who were over age 30, level of intervention of any kind did not make a significant difference in time to recidivism. Those with no priors age thirty and older demonstrated a higher cumulative survival rate that those no priors under age thirty and they were positively affected by low reactanceinducing VIPs, which decreased odds of subsequent recidivisms by 1.6 times. For this no priors over-30 group (n = 372), the low reactance-inducing VIP (n = 44) was significantly associated with 1.6 times odds of fewer subsequent arrests than DWI School (p

madd message effects: a twelve-year ... - UNM Digital Repository [PDF]

Recommend Stories

Idea Transcript

Helpful Links

Smile Life

Get in touch