Detection of Differential Item Functioning Using Mantel-Haenszel, Standardization Proportion and BILOG-MG Procedures
Differential item functioning (DIF) is a procedure to identify whether an item favours a particular group of respondents once they are matched on respective ability levels. There are numerous procedures reported in the literature to detect DIF, but the Mantel-Haenszel (MH), Standardized Proportion Difference (SPD), and BILOG-MG are frequently used to ensure the fairness of assessments. The aim of the present study was to compare procedural characteristics using empirical data. We found Mantel-Haenszel and standardized proportion difference provide comparable results while BILOG-MG has flagged a large number of items, but the magnitude of DIF was trivial from a test development perspective. The results also showed Mantel-Haenszel and standardized proportion difference index provide the effect size measure of DIF, which facilitates for further necessary actions, especially for item writers and practitioners.
-
Differential Item Functioning, Effect Size, Classification, MH, SPD, BILOG-MG
-
(1) Muhammad Naveed Khalid
Resource Person, Allama Iqbal Open University, Lahore, Punjab, Pakistan.
(2) Farah Shafiq
Assistant Professor, Department of Education, University of Education, Lahore, Punjab, Pakistan.
(3) Shehzad Ahmed
Assistant Professor, Faculty of Education, University of Okara, Punjab, Pakistan.
- Candell, G. L., & Drasgow, F. (1988). An iterative procedure for linking metrics and assessing item bias in item response theory. Applied Psychological Measurement, 12, 253-260.
- Clauser, B. E., Mazor, K. M., & Hambleton, R. K. (1993). The effects of purification of the matching criterion on the identification of DIF using the Mantel-Haenszel procedure. Applied Measurement in Education, 6, 269- 279.
- Dorans, N. J., & Holland, P. W. (1993). DIF Detection and Description: Mantel- Haenszel and Standardization. En PW Holland and H. Wainer (Eds.), Differential Item Functioning, New Jersey: Lawrence Erlbaum Associates, Inc.
- Dorans, N. J., & Kulick, E. (1986). Demonstrating the utility of the standardization approach to assessing the unexpected differential item functioning on the Scholastic Aptitude Test. Journal of Educational Measurement, 23, 355- 368.
- Donoghe, J. R., & Allen, N. L. (1993). Thin versus thick matching in the Mantel-Haenszel procedure for detecting DIF. Journal of Educational Statistics, 18, 131-154.
- Guilera, G., Gómez-Benito, J., & Hidalgo, M. D. (2009). Scientific production on the Mantel- Hanszel procedure as a way of detecting DIF. Psicothema, 21, 492- 498.
- Hidalgo, M. D., & Gómez-Benito, J. (2010). Education measurement: Differential item functioning. In P.
- Peterson, E., Baker, & McGaw, B. (Eds.), International Encyclopedia of Education (3rd edition). USA: Elsevier - Science & Technology.
- Hidalgo-Montesinos, M. D., & Gómez-Benito, J. (2003). Test Purification and the Evaluation of Differential Item Functioning with Multinomial Logistic Regression. European Journal of Psychological Assessment, 19, 1-11.
- Holland, P. W., & Thayer, D. T. (1988). Differential item performance and Mantel- Haenszel procedure. In H. Wainer & H. I. Braun (Eds.), Test Validity, 129-145. Hillsdale, N.J.: Erlbaum.
- Holland, P W & Wainer, H (Eds.) (1993). Differential item functioning. Lawrence Erlbaum.
- Kim S.H., and Cohen, A.S. (1992). IRTDIF: A computer program for IRT differential item functioning analysis [Computer Program] University of Wisconsin-Madison.
- Mellenbergh, G. J. (1982). Contingency table models for assessing item bias. Journal of Educational Statistics 7, 105-118.
- Millsap, R. E., & Everson, H. T. (1993). Methodology review: Statistical approaches for assessing measurement bias. Applied Psychological Measurement 17, 297-334.
- Muraki, E., & Engelhard, G. (1989, April). Examining differential item functioning with BIMAIN. Paper presented at the annual meeting of the American Educational Research Association, San Francisco, CA.
- Zimowski, M. F., Muraki, E., Mislevy, R. J., & Bock, R. D. (1996). BILOG-MG: Multiple- Group IRT Analysis and Test Maintenance for Binary Items. Chicago, IL: Scientific Software International.
- Zenisky, A. L., Hambleton, R. K., & Robin, F. (2003). Detection of differential item functioning in large-scale state assessments: A study evaluating a two-stage approach. Educational and Psychological Measurement, 63, 49-62.
- Zwick, R., & Ercikan, K. (1989). Analysis of differential item functioning in the NAEP history assessment. Journal of Educational Measurement, 26, 55-66.
Cite this article
-
APA : Khalid, M. N., Shafiq, F., & Ahmed, S. (2021). Detection of Differential Item Functioning Using Mantel-Haenszel, Standardization Proportion and BILOG-MG Procedures. Global Educational Studies Review, VI(III), 71-78. https://doi.org/10.31703/gesr.2021(VI-III).08
-
CHICAGO : Khalid, Muhammad Naveed, Farah Shafiq, and Shehzad Ahmed. 2021. "Detection of Differential Item Functioning Using Mantel-Haenszel, Standardization Proportion and BILOG-MG Procedures." Global Educational Studies Review, VI (III): 71-78 doi: 10.31703/gesr.2021(VI-III).08
-
HARVARD : KHALID, M. N., SHAFIQ, F. & AHMED, S. 2021. Detection of Differential Item Functioning Using Mantel-Haenszel, Standardization Proportion and BILOG-MG Procedures. Global Educational Studies Review, VI, 71-78.
-
MHRA : Khalid, Muhammad Naveed, Farah Shafiq, and Shehzad Ahmed. 2021. "Detection of Differential Item Functioning Using Mantel-Haenszel, Standardization Proportion and BILOG-MG Procedures." Global Educational Studies Review, VI: 71-78
-
MLA : Khalid, Muhammad Naveed, Farah Shafiq, and Shehzad Ahmed. "Detection of Differential Item Functioning Using Mantel-Haenszel, Standardization Proportion and BILOG-MG Procedures." Global Educational Studies Review, VI.III (2021): 71-78 Print.
-
OXFORD : Khalid, Muhammad Naveed, Shafiq, Farah, and Ahmed, Shehzad (2021), "Detection of Differential Item Functioning Using Mantel-Haenszel, Standardization Proportion and BILOG-MG Procedures", Global Educational Studies Review, VI (III), 71-78
-
TURABIAN : Khalid, Muhammad Naveed, Farah Shafiq, and Shehzad Ahmed. "Detection of Differential Item Functioning Using Mantel-Haenszel, Standardization Proportion and BILOG-MG Procedures." Global Educational Studies Review VI, no. III (2021): 71-78. https://doi.org/10.31703/gesr.2021(VI-III).08