Criterion Measures (criterion + measure)

Distribution by Scientific Domains


Selected Abstracts


SELF- VERSUS OTHERS' RATINGS AS PREDICTORS OF ASSESSMENT CENTER RATINGS: VALIDATION EVIDENCE FOR 360-DEGREE FEEDBACK PROGRAMS

PERSONNEL PSYCHOLOGY, Issue 4 2002
Paul W. B. Atkins
Although 360-degree feedback programs are rapidly increasing in popularity, few studies have examined how well ratings from these programs predict an independent criterion. This study had 2 main aims: First, to examine the validity of ratings from a 360-degree feedback program using assessment center ratings as an independent criterion and to determine which source (i.e., self, supervisor, peers, or subordinates) provided the most valid predictor of the criterion measure of competency. Second, to better understand the relationship between self-observer discrepancies and an independent criterion. The average of supervisor, peer, and subordinate ratings predicted performance on the assessment center, as did the supervisor ratings alone. The self-ratings were negatively and nonlinearly related to performance with some of those who gave themselves the highest ratings having the lowest performance on the assessment center. Supervisor ratings successfully discriminated between overestimators but were not as successful at discriminating underestimators, suggesting that more modest feedback recipients might be underrated by their supervisors. Peers overestimated performance for poor performers. Explanations of the results and the implications for the use of self-ratings in evaluations, the design of feedback reports, and the use of 360-degree feedback programs for involving and empowering staff are discussed. [source]


Advances in the assessment of social competence: Findings from a preliminary investigation of a general outcome measure for social behavior

PSYCHOLOGY IN THE SCHOOLS, Issue 10 2008
Kelli D. Cummings
This study describes the initial validation of an innovative social--behavioral observational assessment tool that is designed to be used on a repeated basis to assess growth and development of social competence over time to: (a) identify the social functioning of all students, (b) assist in planning support for students at risk, and (c) evaluate the effectiveness of individual and system-wide interventions. Eighteen first-grade students were monitored over an 8-week period using the Initiation-Response Assessment (IRA) Code. The School Social Behavior Scales, a published teacher rating scale, was included as a criterion measure. Estimates of reliability and criterion-related validity were calculated for the IRA. The measure's sensitivity to growth over time and between-group variability were also assessed using hierarchical linear modeling procedures. Results indicate that scores on this measure are stable, and tap constructs similar to those assessed via teacher rating. © 2008 Wiley Periodicals, Inc. [source]


Validity of Three Measures of Health-related Quality of Life in Children with Intractable Epilepsy

EPILEPSIA, Issue 10 2002
Elisabeth M. S. Sherman
Summary: ,Purpose: Validity studies on health-related quality of life (HRQOL) scales for pediatric epilepsy are few, and cross-validation with other samples has not been reported. This study was designed to assess the validity of three parent-rated measures of HRQOL in pediatric epilepsy: (a) the Impact of Childhood Illness Scale (ICI), (b) the Impact of Child Neurologic Handicap Scale (ICNH), and (c) the Hague Restrictions in Epilepsy Scale (HARCES). Methods: Retrospective data were examined for 44 children with intractable epilepsy. Validity was assessed by evaluating differences across epilepsy severity groups as well as correlations between HRQOL scales and neurologic variables (seizure severity, epilepsy duration, current/prior antiepileptic medications) and psychosocial measures (emotional functioning, IQ, social skills, adaptive behavior). Scale overlap with a global QOL rating also was assessed. Results: The HRQOL measures were moderately to highly intercorrelated. The scales differed in terms of their associations with criterion measures. The HARCES was related to the highest number of neurologic variables and the ICNH to the fewest. All three scales were related to psychosocial functioning and to global quality of life. Conclusions: The results of this study suggest that the three measures are likely adequate measures of HRQOL for use in intractable childhood epilepsy. The measures were highly intercorrelated, and they were all broadly related to criterion measures reflecting specific domains of HRQOL as well as global QOL. Some differences between scales emerged, however, that suggest care in choosing HRQOL instruments for children with epilepsy. [source]


Development and Psychometric Properties of an Assessment for Persons With Intellectual Disability,The interRAI ID

JOURNAL OF POLICY AND PRACTICE IN INTELLECTUAL DISABILITIES, Issue 1 2007
Lynn Martin
Abstract, This paper describes the development of the interRAI-Intellectual Disability (interRAI ID), a comprehensive instrument that assesses all key domains of interest to service providers relative to a person with an intellectual disability (ID). The authors report on the reliability and validity of embedded scales for cognition, self-care, aggression, and depression. Four provider agencies volunteered to participate and assessed a total of 160 community-dwelling adults with ID using the interRAI ID, Dementia Questionnaire for Persons with Mental Retardation, and Reiss Screen for Maladaptive Behavior. All scales had acceptable levels of internal consistency (Cronbach's alpha 0.74,0.93) and good relationships with the criterion measures (r = 0.50,0.93, p < 0.0001). The development of the interRAI ID represents an important and successful first step toward an integrated, comprehensive, and standardized assessment of adults with ID. Use of this instrument may lead to more appropriate support planning, enhanced communication between various professionals supporting persons with ID, and a more seamless approach to supports across the health and social service systems. [source]


ENTRY-LEVEL POLICE CANDIDATE ASSESSMENT CENTER: AN EFFICIENT TOOL OR A HAMMER TO KILL A FLY?

PERSONNEL PSYCHOLOGY, Issue 4 2002
KOBI DAYAN
The study examined the validity of the assessment center (AC) as a selection process for entry-level candidates to die police and its unique value beyond cognitive ability tests. The sample included 712 participants who responded to personality and cognitive ability testing (CAT), and underwent an AC procedure. AC results included the overall assessment rating (OAR) and peer evaluation (PE). Seven criterion measures were collected for 585 participants from a training stage and on-the-job performance. Results showed that the selection system was valid. Findings yielded significant unique validities of OAR and PE beyond CAT and of PE beyond OAR even after corrections for restriction of range. Results support the use of ACs for entry-level candidates. [source]


Screening for distress in cancer patients: is the distress thermometer a valid measure in the UK and does it measure change over time?

PSYCHO-ONCOLOGY, Issue 6 2008
A prospective validation study
Abstract A prospective validation study was conducted in 171 consenting patients from oncology and palliative care outpatient clinics to validate the Distress Thermometer (DT) against the Hospital Anxiety and Depression Scale (HADS), General Health Questionnaire-12 (GHQ-12) and Brief Symptom Inventory-18 (BSI-18) at baseline, four weeks and eight weeks. Receiver Operating Characteristic analysis was used to examine the sensitivity and specificity of the DT scores against the clinically significant cut-off scores of the criterion measures reporting 95% confidence intervals. Standardised response means were used to compare DT scores with criterion measures over time. For a cut-off of 4 vs 5, sensitivity against HADS was 79%, specificity 81%; against GHQ-12, sensitivity was 63%, specificity 83%; and against BSI-18, sensitivity was 88%, specificity 74%. At both four and eight weeks, DT scores tended to change significantly in the same direction as the criterion measures. Ninety-five percent of patients found completing the DT acceptable. The DT is valid and acceptable for use as a rapid screening instrument for patients in the UK with cancer. Our results indicate that it can be used to monitor change in psychological distress over time, but further work is needed to confirm this. Copyright © 2007 John Wiley & Sons, Ltd. [source]


Personality in nonhuman primates: a review and evaluation of past research

AMERICAN JOURNAL OF PRIMATOLOGY, Issue 8 2010
Hani D. Freeman
Abstract Scientific reports of personality in nonhuman primates are now appearing with increasing frequency across a wide range of disciplines, including psychology, anthropology, endocrinology, and zoo management. To identify general patterns of research and summarize the major findings to date, we present a comprehensive review of the literature, allowing us to pinpoint the major gaps in knowledge and determine what research challenges lay ahead. An exhaustive search of five scientific databases identified 210 relevant research reports. These articles began to appear in the 1930s, but it was not until the 1980s that research on primate personality began to gather pace, with more than 100 articles published in the last decade. Our analyses of the literature indicate that some domains (e.g., sex, age, rearing conditions) are more evenly represented in the literature than are others (e.g., species, research location). Studies examining personality structure (e.g., with factor analysis) have identified personality dimensions that can be divided into 14 broad categories, with Sociability, Confidence/Aggression, and Fearfulness receiving the most research attention. Analyses of the findings pertaining to inter-rater agreement, internal consistency, test,retest reliability, generally support not only the reliability of primate personality ratings scales but also point to the need for more psychometric studies and greater consistency in how the analyses are reported. When measured at the level of broad dimensions, Extraversion and Dominance generally demonstrated the highest levels of inter-rater reliability, with weaker findings for the dimensions of Agreeableness, Emotionality, and Conscientiousness. Few studies provided data with regard to convergent and discriminant validity; Excitability and Dominance demonstrated the strongest validity coefficients when validated against relevant behavioral criterion measures. Overall, the validity data present a somewhat mixed picture, suggesting that high levels of validity are attainable, but by no means guaranteed. Discussion focuses on delineating major theoretical and empirical questions facing research and practice in primate personality. Am. J. Primatol. 72:653,671, 2010. © 2010 Wiley-Liss, Inc. [source]


Assessment Center for Pilot Selection: Construct and Criterion Validity and the Impact of Assessor Type

APPLIED PSYCHOLOGY, Issue 2 2003
Marc Damitz
Cette recherche a examiné la validité d'un centre d'évaluation pour la sélection de pilotes. Les scores de N = 1,036 participants ont été utilisés pour étudier la validité de construit. Un sous-échantillon de participants performants a été suivi et les évaluations des pairs ont été retenus comme mesures du critère. Les résultats démontrent une première évidence de validité de construit et de critère pour cet outil d'évaluation des compétences interpersonnelles et liées à la performance. Par ailleurs, les résultats ont aussi montré que le type d'évaluateur (psychologue vs pilote) modère la validité prédictive des scores du centre d'évaluation. Cet effet "type d'évaluateur" dépend de la sorte de variables prédictives. Les résultats sont discutés et des implications pratiques sont suggérées. This study examined the validity of an assessment center in pilot selection as a new field of application. Assessment center ratings of N= 1,036 applicants were used to examine the construct validity. A subsample of successful applicants was followed up and peer ratings were chosen as criterion measures. The results provide first evidence of the construct and criterion validity of this assessment center approach for rating interpersonal and performance-related skills. Furthermore the type of assessor (psychologist versus pilot) moderates the predictive validity of the assessment center ratings. This type-of-assessor effect depends on the kind of predictor variables. The results are discussed and practical implications are suggested. [source]