我们的网站为什么显示成这样?

可能因为您的浏览器不支持样式,您可以更新您的浏览器到最新版本,以获取对此功能的支持,访问下面的网站,获取关于浏览器的信息:

|本期目录/Table of Contents|

 融合反应时的多级评分IRT模型开发及其应用研究(PDF)

《心理学探新》[ISSN:1003-5184/CN:36-1228/B]

期数:
 2022年03期
页码:
 269
栏目:
 心理统计与测量
出版日期:
 2022-06-20

文章信息/Info

Title:
 Research on Development and Application of Polytomous IRT Model Incorporating Response Times
文章编号:
1003-5184(2022)03-0269-10
作者:
 汪大勋1郭莹莹2
 (1.江西师范大学,南昌 330022; 2.海亮教育科技集团,杭州 310052 )
Author(s):
 Wang Daxun1Guo Yingying2
 (1.Jiangxi Normal University,Nanchang 330022; 2.Hailiang Education Group Inc.,Hangzhou 310052)
关键词:
 项目反应理论 GPCM模型 JRT-GPCM模型 MCMC算法
Keywords:
 item response theory GPCM model JRT-GPCM model MCMC algorithm
分类号:
 B841.2
DOI:
 -
文献标识码:
 A
摘要:
 当前大多数融合反应时的IRT模型仅适用于0-1评分数据资料,极大的限制了IRT反应时模型在实际中的应用。本文在传统的二级计分反应时IRT模型基础上,拟开发一种多级评分反应时模型。在层次建模框架下,分别采用拓广分部评分模型(GPCM)和对数正态模型构建融合反应时的多级评分IRT模型(本文记为JRT-GPCM),并采用全息贝叶斯MCMC算法实现新模型的参数估计。为验证新开发的JRT-GPCM模型的可行性及其在实践中的应用,本文开展了两项研究:研究1为模拟实验研究,研究2为新模型在大五人格-神经质分量表中的应用。研究1结果表明,JRT-GPCM模型的估计精度较高,且具有较好的稳健性。研究2表明,被试的潜在特质与作答速度具有一定的正相关,且本研究结果支持Ferrando和Lorenzo-Seva(2007)提出的“距离-困难度假设”,即当被试的潜在特质与项目的难度阈限距离越远,那么被试会花费更多的时间对项目进行作答。总之,本研究为拓展反应时信息在心理测量及教育中的应用提供新的方法支持。
Abstract:
 With the development of computer testing technology,collecting reaction time has become a routine work of many large-scale tests.However,most current IRT models for fusion reaction time are only applicable to 0-1 score data,which greatly limits the application of IRT model in practice.Based on the traditional two-level scoring response time IRT model,this paper intends to develop a multilevel scoring response time model.Under the framework of hierarchical modeling,the extended partial scoring model(GPCM)and the log-normal model(jrt-gpcm)were used to construct the multi-stage scoring IRT model(jrt-gpcm)for fusion reaction,and the parameter estimation of the new model was realized by the holographic bayesian MCMC algorithm.In order to verify the feasibility of the newly developed jrt-gpcm model and its application in practice,this paper carried out two studies:Study 1 for simulation experiment research,the use of 2 x 2 double factor experiment design,one factor for the number of participants(1000 and 2000 respectively,the two level),another factor for the test number(20 and 30 respectively two levels),all items of 0,1,2,3 multistage grading,using holographic bay leaf,MCMC algorithm for parameter estimation,and validates the feasibility of MCMC algorithm and JRT-GPCM model to estimate accuracy; Study 2 for JRT-GPCM model in the application of the big five personality-neurotic subscales,testing group for college students,this paper USES the computer answer way,collected a total of 1030 data(including the answer in each available data that reaction time),by eliminating the invalid data(such as too many missing data/answer exception)on lie detection problem,the final valid data is 845.Study 1 results show that under the JRT-GPCM model,the estimated method of MCMC algorithm by fairly robustness,and the precision of the item and the person the parameters was preferably great,model has good robustness,and the topic,the more the higher estimation precision,It indicated that the number of subjects indicated that the rrt-gpcm model was reasonable and feasible.Study 2 shows that the parameter estimation indexes of all items are basically less than 1.1,indicating the convergence of parameter estimation of MCMC algorithm.The variance of each parameter and the standard deviation of the covariance are small,which indicates that the model has good robustness in empirical research.The 12 questions on the neurotic subscale ranged from 0.895 to 1.209,all of which were greater than 0.7(Fliege,2015),indicating that the 12 questions were of good quality.There was a positive correlation between the potential traits and the response speed of the subjects.The higher the neurotic tendency of the subjects,the higher the potential traits and the faster the response speed.Project step parameters(the location parameter)and its parameters is related to the intensity of time,the greater the absolute value of that project step parameters(off center value,then represents to the characteristics of extreme levels),so the participants answers in the less time needed for the project,namely the time intensity is small,the results support Ferrando and Lorenzo-Seva(2007)proposed “distance-difficult holiday”.In conclusion,this study provides a new method to expand the application of response time information in psychological measurement and education.

参考文献/References

 聂衍刚,林崇德,郑雪,丁莉,彭以松.(2008).青少年社会适应行为与大五人格的关系.心理科学,31(4),774-779.
周明元,甘登文,丁树良.(2005).GPCM模型项目参数估计程序的开发与研究.心理学探新,25(1),57-60.
Amelang,M.,Eisenhut,K.,& Rindermann,H.(1991).Responding to adjective check list items:A reaction time analysis.Personality and Individual Differences,12,523-533.
Andrich,D.(1978).A rating formulation for ordered response categories.Psychometrika,43,561-573.
Brooks,S.P.,& Gelman,A.(1998).General methods for monitoring convergence of iterative simulations.Journal of Computational and Graphical Statistics,7,434-455.
Bolsinova,M.,& Maris,G.(2016).A test for conditional independence between response time and accuracy.British Journal of Mathematical and Statistical Psychology,69,62-67.
Costa,P.T.Jr.,& McCrae,R.R.(1985).The NEO personality inventory manual.Odessa,FL:Psychological Assessment Resources.
Ferrando,P.J.,& Lorenzo-Seva,U.(2007).An item response theory model for incorporating response time data in binary personality items.Applied Psychology,31,525-543.
Fliege,H.,Becker,J.,Walter,O.B.,Bjorner,J.B.,Klapp,B.F.,& Rose,M.(2005).Development of a computer-adaptive test for depression(D-CAT).Quality of Life Research,14(10),2277-2291.
Fox,J.-P.(2010).Bayesian item response modeling:Theory and applications.New York,NY:Springer.
Fox,J.-P.,& Marianti,S.(2016).Joint modeling of ability and differential speed using responses and response times.Multivariate Behavioral Research,51,540-553.
Gelfand,A.E.,& Smith,A.F.M.(1990).Sampling-based approaches to calculating marginal densities.Journal of the American Statistical Association,85,398-409.
Jackson,D.N.(1986).The process of responding in personality assessment.In A.Angleitner & J.S.Wiggins(Eds.),Personality assessment via questionnaires(pp.123-143).Berlin,Germany:Springer-Verlag.
Jochen,R.(2013).Modeling responses and response times in personality tests with rating scales.Psychological Test and Assessment Modeling,55(4),361-382.
Klein Entink,R.H.,Fox,J.-P.,& van der Linden,W.J.(2009).A multivariate multilevel approach to the modeling of accuracy and speed of test takers.Psychometrika,74,21-48.
Klein Entink,R.H.,van der Linden,W.J.,& Fox,J.-P.(2009).A box-cox normal model for response times.British Journal of Mathematical and Statistical Psychology,62,621-640.
Lee,Y.-H.,& Chen,H.(2011).A review of recent response-time analyses in educational testing.Psychological Test and Assessment Modeling,53,359-379.
Locke,E.A.(1965).Interaction of ability and motivation in performance.Perceptual and Motor Skills,21,719-725.
Logan,S.,Medford,E.,& Hughes,N.(2011).The importance of intrinsic motivation for high and low ability readers' reading comprehension performance.Learning and Individual Differences,21,124-128.
Luce,R.D.(1986).Response times:Their role in inferring elementary mental organization.New York,NY:Oxford University Press.
Masters,G.(1982).A rasch model for partial credit scoring.Psychometrika,47(2),149-174.
Meng,X.-B.,Tao,J.,& Chang,H.-H.(2015).A conditional joint modeling approach for locally dependent item responses and response times.Journal of Educational Measurement,52,1-27.
Meyer,J.P.(2010).A mixture rasch model with item response time components.Applied Psychological Measurement,34,521-538.
Molenaar,D.,Tuerlinckx,F.,& van der Maas,H.L.J.(2015).A generalized linear factor model approach to the hierarchical framework for responses and response times.British Journal of Mathematical and Statistical Psychology,68,197-219.
Muraki,E.(1992).A generalized partial credit model:Application of an em algorithm.Applied Psychological,16(2),159-176.
Plummer,M.(2015).JAGS Version 4.0.0 user manual.Retrieved from http://sourceforge.net/ projects/mcmc-jags/.
Qian,H.,Staniewska,D.,Reckase,M.,& Woo,A.(2016).Using response time to detect item preknowledge in computer-based licensure examinations.Educational Measurement:Issues and Practice,35(1),38-47.
Su,Y.-S.,& Yajima,M.(2015).R2jags:Using R to run ‘JAGS'.R package version 0.5-7.Retrieved from http://CRAN.R-project.org/package=R2jags.
Suh,H.(2010).A study of bayesian estimation and comparison of response time models in item response theory.Doctoral dissertation,University of Kansas,Lawrence,KS.
van der Linden,W.J.(2006).A lognormal model for response times on test items.Journal of Educational and Behavioral Statistics,31,181-204.
van der Linden,W.J.(2007a).A hierarchical framework for modeling speed and accuracy on test items.Psychometrika,72,287-308.
van der Linden,W.J.,Breithaupt,K.,Chuah,S.C.,& Zhang,Y.(2007b).Detecting differential speededness in multistage testing.Journal of Educational Measurement,44,117-130.
van der Linden,W.J.(2007c).Conceptual issues in response-time modeling.Journal of Educational Measurement,46,247-272.
van der Linden,W.J.(2008).Using response times for item selection in adaptive testing.Journal of Educational and Behavioral Statistics,33,5-20.
van der Linden,W.J.,& Guo,F.(2008).Bayesian procedures for identifying aberrant response-time patterns in adaptive testing.Psychometrika,73,365-384.
van der Linden,W.J.(2009).Conceptual issues in response-time modeling.Journal of Educational Measurement,46,247-272.
van der Linden,W.J.,& Fox,J.-P.(2015).Joint hierarchical modeling of responses and response times.In W.J.van der Linden(Ed.),Handbook of item response theory:Vol.1.Models(pp.481-500).Boca Raton,FL:Chapman & Hall/CRC.
Vickers,D.(1980).Discrimination.In A.T.Welford(Ed.),Reaction times(pp.25-72).New York:Academic Press.
Wang,C.,Chang,H.,& Douglas,J.(2013).The linear transformation model with frailties for the analysis of item response times.Journal of Mathematical and Statistical Psychology,66,144-168.
Wang,C.,& Xu,G.(2015).A mixture hierarchical model for response times and response accuracy.British Journal of Mathematical and Statistical Psychology,68,456-477.
Wang,T.,& Hanson,B.A.(2005).Development and calibration of an item response model that incorporates response time.Applied Psychological Measurement,29,323-339.
Wise,S.L.,& DeMars,C.E.(2006).An application of item response time:The effort-moderated IRT model.Journal of Educational Measurement,43,19-38.
Wise,S.L.,& Kong,X.(2005).Response time effort:A new measure of examinee motivation in computer-based tests.Applied Measurement in Education,18,163-183.
Zhan,P.,Jiao,H.,& Liao,D.(2017).Cognitive diagnosis modelling incorporating item response times.British Journal of Mathematical and Statistical Psychology,71,262-286.

备注/Memo

备注/Memo:
 -
更新日期/Last Update:  2022-11-20