首页 | 官方网站   微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
To learn if Protestant ethic endorsement predicted intelligence controlling for the big five personality factors, 364 college students from England and the United States completed a 65-item multifaceted work ethic endorsement measure, the 50-item Wonderlic Personnel Test, and a 60-item measure of the big five personality factors. A hierarchical multiple regression revealed that, after controlling for respondent's sex and the big five, work ethic facets accounted for 5.6% of the variability in Wonderlic scores. However, the direction of the relationship between PWE facets and Wonderlic scores depended on the PWE facet in question. Furthermore, the facets accounted for more variability than did a composite work ethic score. The discussion focuses on the multidimensional nature of PWE endorsement and future research with this construct.  相似文献   

2.
Due to variation in test difficulty, the use of pre-fixed cut-off scores in criterion-referenced standard setting methods may lead to variation in grades and pass rates. This paper aims to empirically investigate the strength of this relationship. To this end we examine a dataset of over 500 observations from an institution of higher education in The Netherlands over the period 2008–2013. We measure variation in test difficulty by using students’ perceptions of the validity of the examination and by recording personnel changes in the primary instructor. The latter measure is based on the considerable variation in teachers’ ability to assess test difficulty that is found in the literature. Other explanatory variables are course evaluations, instructor evaluations and self-reported study time. Variation in student quality is controlled for by measuring course results in deviation from the cohort average. We take a panel approach in estimating the effect of the explanatory variables on the variability in grades and pass rates. Our findings indicate that exam validity and instructor change are significantly related to variation in test results. The latter finding supports the hypothesis that instructors’ difficulty in assessing test difficulty may introduce subjectivity in criterion-referenced standard setting methods.  相似文献   

3.
This article discusses concepts of variability and suggests a possible alternative measure that appears to support some students’ intuition.  相似文献   

4.
Research Findings: The growing body of literature demonstrating the importance of quality interactions with caregivers to infant development coupled with the increasing number of infants spending time in classroom settings highlights the need for a measure of interpersonal relationships between infants and caregivers. This article introduces a new measure of quality in infant classrooms based on the Classroom Assessment Scoring System (CLASS) framework. This measure focuses on teacher–infant interactions with the goal of understanding how these proximal process features can be assessed in this environment. Results from a small pilot study of 30 infant classrooms indicated that the CLASS–Infant demonstrated adequate variability as well as expected convergent and divergent validity with the most commonly used infant child care quality measure. The dimensions of the measure composed a single construct of classroom quality based on teacher–infant interactions. Practice or Policy: Implications of using this measure as an assessment of center-based infant classroom quality and improving professional development are discussed.  相似文献   

5.
The question is whether or not we or any student are allowed to define a measure of variability in some other way that is not written in textbooks and officially approved. We compare the sum of squares of all deviations against the usual sum of squares of deviations from the mean.  相似文献   

6.
《教育实用测度》2013,26(4):323-342
This study provides empirical evidence about the sampling variability and generalizability (reliability) of a statewide science performance assessment. Results at both individual and school levels indicate that task-sampling variability was the major source of measurement error in the performance assessment; rater-sampling variability was negligible. Adding more tasks improves the generalizability of the measurement. For the school-level assessment, the variation of performance among students within a school was larger than the variation among schools. Increasing the number of students taking a test within a school thus increases the generalizability of the assessment. Finally, the allocation of students in a matrix-sampling design is compared to a studentscrossed-with-tasks design. The former would require fewer tasks per student than the latter to build a generalizable measure of school performance.  相似文献   

7.
This study describes the initial validation of an innovative social‐‐behavioral observational assessment tool that is designed to be used on a repeated basis to assess growth and development of social competence over time to: (a) identify the social functioning of all students, (b) assist in planning support for students at risk, and (c) evaluate the effectiveness of individual and system‐wide interventions. Eighteen first‐grade students were monitored over an 8‐week period using the Initiation‐Response Assessment (IRA) Code. The School Social Behavior Scales, a published teacher rating scale, was included as a criterion measure. Estimates of reliability and criterion‐related validity were calculated for the IRA. The measure's sensitivity to growth over time and between‐group variability were also assessed using hierarchical linear modeling procedures. Results indicate that scores on this measure are stable, and tap constructs similar to those assessed via teacher rating. © 2008 Wiley Periodicals, Inc.  相似文献   

8.
Many Hong Kong-Chinese parents are active in their support for their children’s music training. To better understand this support, the Parental Involvement in Music Training Questionnaire (PIMTQ) is designed to measure the variability in parental involvement in their children’s music training. This study begins by exploring the factor structure of the PIMTQ and then establishes its measurement properties using Rasch modelling. Two hundred and ninety-five Hong Kong-Chinese parents completed a Chinese version of the 42-item instrument with principal components analysis of the responses showing seven factors. However, Rasch modelling showed that two of the five factors (Family Music Background and Family Music Interest) are unable to reliably predict variability in parent responses. We conclude, however, that the remaining five factors (Parental Support Toward Music Training, Parental Expectations, Home Music Environment, Music Programme Support and Attitude Toward Music) of the PIMTQ can be used as subscales to measure the involvement of Hong Kong-Chinese parents in their children’s music training.  相似文献   

9.
The purpose of this study was to explore the influence of the number of targets specified on the quality of exploratory factor analysis solutions with a complex underlying structure and incomplete substantive measurement theory. Three Monte Carlo studies were performed based on the ratio of the number of observed variables to the number of underlying factors. Within each study, communality, sample size, and the number of targets were manipulated. Outcomes included a measure of congruence and a measure of variability with regard to the rotated pattern matrix. The magnitude of the main effect for the influence of the number of targets on congruence and variability ranged from moderate to large. The magnitude of the interaction between the number of targets and level of communality ranged from small to moderate with regard to congruence and variability. Consistent with theoretical expectations, the minimum number of targets to specify to be reasonably assured of obtaining an accurate solution varied across study conditions.  相似文献   

10.
在当今高度竞争、瞬息万变的市场环境中,传统营销的理念无论从方法还是手段上都不能满足新经济发展的要求,而现代营销在观念更新、电子商务、客户关系管理等方面的变化适应了21世纪经济发展的潮流。  相似文献   

11.
In this article, we outline the initial stages in development of an assessment instrument for Australian Sign Language and explore issues involved in the development of such a test. We first briefly describe the instruments currently available for assessing grammatical skills in Australian Sign Language and discuss the need for a more objective measure. We then describe our adaptation of an existing American Sign Language test, the Test Battery for American Sign Language Morphology and Syntax. Finally, this article presents some of the data collected from a group of deaf native signers. These data are used to demonstrate the range of variability in key grammatical features of Australian Sign Language and to raise methodological issues associated with signed language test design.  相似文献   

12.
In two experiments, the behavioral effects of different response-feedback contingencies were examined with a task requiring human subjects to repeatedly type three-key sequences on a computer keyboard. In Experiment 1, the subjects first received positive feedback for response variability, followed by no feedback, or vice versa. In Experiment 2, the subjects first received positive feedback for response variability, followed by response-independent positive feedback, or vice versa. Response stability and variability were examined using different measures, such as percentage of trials meeting the variability criteria, frequency of use of the different response alternatives, and autocorrelations as an index of response randomness. The subjects’ behavior in the first phase in each condition came to reflect the current feedback contingency. Depending on the measure examined, responding after each contingency change was characterized by both response stability and decreases or increases in response variability. The collective results are discussed in the framework of previous animal and human studies on behavioral stability and variability.  相似文献   

13.
No Child Left Behind (NCLB) performance mandates, embedded within state accountability systems, focus school AYP (adequate yearly progress) compliance squarely on the percentage of students at or above proficient. The singular importance of this quantity for decision-making purposes has initiated extensive research into percent proficient as a measure of school quality. In particular, technical discussions have scrutinized the impact of sampling, measurement, and other sources of error on percent proficient statistics. In this article, we challenge the received orthodoxy that measurement error associated with individual students' scores is inconsequential for aggregate percent proficient statistics. Synthesizing current classification accuracy research with techniques from randomized response designs, we establish results which specify the extent to which measurement error—manifest as performance level misclassifications—produces bias and increases error variability for percent at performance level statistics. The results have direct relevance for the design of coherent and fair accountability systems based upon assessment outcomes.  相似文献   

14.
In this article, performance assessments are cast within a sampling framework. More specifically, a performance assessment is viewed as a sample of student performance drawn from a complex universe defined by a combination of all possible tasks, occasions, raters, and measurement methods. Using generalizability theory, we present evidence bearing on the generalizability and convergent validity of performance assessments sampled from a range of measurement facets and measurement methods. Results at both the individual and school level indicate that task-sampling variability is the major source ofmeasurment error. Large numbers of tasks are needed to get a reliable measure of mathematics and science achievement at the elementary level. With respect to convergent validity, results suggest that methods do not converge. Students' performance scores, then, are dependent on both the task and method sampled.  相似文献   

15.
The purposes of this study were to determine the degree to which two measures of phonological awareness/ability (Test of Phonological Awareness; Comprehensive Test of Phonological Processing) correlate with each other and with a measure of reading (Letter‐Word Identification), and to determine which of the individual measures of phonological ability best predict early reading skill in kindergarten children. With one exception, all correlations among measures of phonological awareness/ability were significant. In addition, all correlations among the phonological awareness measures and the reading measure were significant. Multiple regression analyses revealed that the combination of all predictor variables accounted for approximately 51% of the variability in scores on the Woodcock Letter‐Word Identification subtest. The Phonological Awareness and the Rapid Naming composites of the CTOPP were the best predictors of performance on the measures of word identification. © 2002 Wiley Periodicals, Inc.  相似文献   

16.
瞳孔直径大小检测驾驶员疲劳的实证效度分析   总被引:2,自引:0,他引:2  
驾驶员疲劳是指驾驶员在驾驶过程中,身体机能整体性的下降现象,它是导致交通事故的主要因素之一。因此,选取一种有效可靠的生理指标检测驾驶员的工作疲劳具有重要的实践意义。本研究采用驾驶员动态危险知觉视频量表加载工作负荷,诱发疲劳。使用自我报告法、眼动追踪与心电技术,测查39名驾驶员的疲劳状态。结果显示:采用瞳孔直径指标检测驾驶疲劳具有跨人群的一致性,随着驾驶员自我报告的困倦感增加,其瞳孔直径缩小,心率变异性减小,瞳孔直径与心率变异性呈显著正相关。说明随着工作时间的延长,驾驶员逐渐进入疲劳状态,瞳孔直径可以成为检测驾驶员心理疲劳的有效指标。  相似文献   

17.
During the past several years measurement and instructional specialists have distinguished between norm-referenced and criterion-referenced approaches to measurement. More traditional, a norm-referenced measure is used to identify an individual's performance in relation to the performance of others on the same measure. A criterion-referenced test is used to identify an individual's status with respect to an established standard of performance. This discussion examines the implications of these two approaches to measurement, particularly criterion-referenced measurement, with respect to variability, item construction, reliability, validity, item analysis, reporting, and interpretation.  相似文献   

18.
Abstract

In this meta-analysis, the authors attempted to develop a parsimonious model of factors that account for the significant variability in the findings on the effects of within-class grouping on student achievement. Two weighted least squares regression models were tested using 103 independent findings from 51 studies at elementary through postsecondary grades. Results indicate that the most important study features that accounted for 48% of the total variance include outcome measure source, teacher training equivalence, grouping basis, type of small-group instruction method, grade level, and relative ability of students. Goodness-of-fit statistics indicate that the model fits the data and that the remaining variance may be explained by sampling errors.  相似文献   

19.
Although there have been numerous studies investigating the predictive validity of early assessment, observed predictive validity coefficients across studies are not stable. A validity generalization study was conducted in order to answer the question of whether the relationship between early assessment of children and later achievement is generalizable or situation-specific. This study examined 716 predictive correlation coefficients from 44 studies using Hierarchical Linear Modeling (HLM). The findings of this study revealed that predictive validity of early assessment is not generalizable. Additional analyses indicated that predictive validity differ across assessments as a function of test type, specific construct being assessed, length of prediction, and administration procedures. The most impressive finding in this study was the variability of effect sizes across different test administration types. In particular, tests that were scored through ratings were found to be most effective. These findings suggest that instead of addressing a broad predictive validity between a test and a criterion measure, it is necessary to understand early assessment procedures as a whole system by including considerations of various variables related to testing conditions.  相似文献   

20.
Ability tests are used by teachers to provide additional context for interpreting student achievement and as a tool for differentiating instruction to the cognitive strengths and weaknesses of students. Tests that provide the most useful information for these purposes measure school-related content domains including verbal and quantitative reasoning. However, there is concern that verbal content affects validity for culturally and linguistically diverse students. In this study, the structure of a multidimensional ability test of verbal, quantitative, and nonverbal reasoning skills was explored in three groups of students who varied in language and cultural background. Configural invariance and invariant factor loadings were supported, but the verbal battery showed weaker relationships to the other batteries and reduced variability for English learners. Results indicate that battery-level scores are appropriate for all students, but that accounting for educational opportunity may be required for interpreting scores.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号