Item bias approaches Comparison of TIMSS test among Egyptian environment

  • Mahmoud Ali Moussa Lecturer of Assessment and Educational Evaluation, College of Education, Suez Canal university, Egypt.
Keywords: item bias, differential item functioning, test fair.


Item bias approaches Comparison of TIMSS test among Egyptian environment


The study aimed that comprise the item bias approaches of achievement tests. Analytical descriptive approach had been used. TIMSS 2007, and TIMSS 2015 data archive had been used. The uniform differential item functioning methods used to test the item bias. The study comprises between ANCOVA and Multiple regression. The finding was the Alg2 item was biased in Algebra dimension.

Keywords: item bias, differential item functioning, test fair.


Download data is not yet available.


Abah, J. (2018). The quest for statistical significance: Ignorance, bias and malpractice of research practitioners. International Journal of Research and Review, 5(3), 112-129.

Akcan, R., & Kabasakal, K. A. (2019). An Investigation of Item Bias of English Test: The Case of 2016 Year Undergraduate Placement Exam in Turkey. International Journal of Assessment Tools in Education, 6(1), 48-62.‏

Berger, M., & Tutz, G. (2016). Detection of uniform and nonuniform differential item functioning by item-focused trees. Journal of Educational and Behavioral Statistics, 41(6), 559-592.

Carnoy, M., Khavenson, T., & Ivanova, A. (2015). Using TIMSS and PISA results to inform educational policy: a study of Russia and its neighbours. Compare: A Journal of Comparative and International Education, 45(2), 248-271.

Cole, S. R., Kawachi, I., Maller, S. J., & Berkman, L. F. (2000). Test of item-response bias in the CES-D scale: experience from the New Haven EPESE study. Journal of clinical epidemiology, 53(3), 285-289.‏

Duncan, S. C. (2007). Improving the prediction of differential item functioning: A comparison of the use of an effect size for logistic regression DIF and Mantel-Haenszel DIF methods(Doctoral dissertation, Texas A&M University).

Elena-Oliveri, M., Lawless, R., Robin, F., & Bridgeman, B. (2018). An exploratory analysis of differential item functioning and its possible sources in a higher education admissions context. Applied Measurement in Education, 31(1), 1-16.

Engelhard Jr, G., Hansche, L., & Rutledge, K. E. (1990). Accuracy of bias review judges in identifying differential item functioning on teacher certification tests. Applied measurement in education, 3(4), 347-360.

Feingold, A. (2013). A regression framework for effect size assessments in longitudinal modeling of group differences. Review of General Psychology, 17(1), 111.‏

Finch, W. H. (2016). Detection of differential item functioning for more than two groups: A Monte Carlo comparison of methods. Applied Measurement in Education, 29(1), 30-45.

Gesicki, A. (2015). Decision rules based on hypothesis tests and effect sizes for logistic regression differential item functioning (Doctoral dissertation, University of British Columbia).

Grønmo, L. S., Lindquist, M., Arora, A., & Mullis, I. V. (2015). TIMSS 2015 mathematics framework. TIMSS, 11-27.

Hamilton, L. S. (1999). Detecting gender-based differential item functioning on a constructed-response science test. Applied measurement in Education, 12(3), 211-235.

Harter, J. K., & Agrawal, S. (2011). Cross-cultural analysis of Gallup’s Q12 employee engagement instrument. Omaha, NE: Gallup.

Innabi, H., & Dodeen, H. (2018). Gender differences in mathematics achievement in Jordan: A differential item functioning analysis of the 2015 TIMSS. School Science and Mathematics, 118(3), 127-137.

Johansone, I. (2015). Survey operations procedures in TIMSS 2015. Methods and procedures in TIMSS.

Karami, H. (2012). An introduction to differential item functioning. The International Journal of Educational and Psychological Assessment.‏

Karpen, S. C. (2017). Misuses of Regression and ANCOVA in Educational Research. American Journal of Pharmaceutical Education, 81(8), 65-101.

Liou, P. Y., & Hung, Y. C. (2015). Statistical techniques utilized in analyzing PISA and TIMSS data in science education from 1996 to 2013: A methodological review. International Journal of Science and Mathematics Education, 13(6), 1449-1468.

Liu, O. L. (2011). Do major field of study and cultural familiarity affect TOEFL® iBT reading performance? A confirmatory approach to differential item functioning. Applied Measurement in Education, 24(3), 235-255.

Martin, M. O., Mullis, I. V., & Foy, P. (2015). TIMSS 2015 assessment design. TIMSS, 85-99.

Ross, A., & Willson, V. L. (2017). Multiple Regression with Two Continuous Predictors and the Interactions Betweenbetween Them. In Basic and Advanced Statistical Tests (pp. 75-86). SensePublishersSense Publishers, Rotterdam.

Sachse, K. A., & Haag, N. (2017). Standard errors for national trends in international large-scale assessments in the case of cross-national differential item functioning. Applied Measurement in Education, 30(2), 102-116.

Sireci, S. G., Harter, J., Yang, Y., & Bhola, D. (2003). Evaluating the equivalence of an employee attitude survey across languages, cultures, and administration formats. International Journal of Testing, 3, 129–150.

Sireci, S. G., Yang, Y., Harter, J., & Ehrlich, E. J. (2006). Evaluating guidelines for test adaptations. Journal of Cross-Cultural Psychology, 37, 557–567.

Stephens, M., Landeros, K., Perkins, R., & Tang, J. H. (2016). Highlights from TIMSS and TIMSS Advanced 2015: Mathematics and Science Achievement of US Students in Grades 4 and 8 and in Advanced Courses at the End of High School in an International Context. NCES 2017-002. National Center for Education Statistics.

Stone, E., Cook, L., Laitusis, C. C., & Cline, F. (2010). Using differential item functioning to investigate the impact of testing accommodations on an English-language arts assessment for students who are blind or visually impaired. Applied Measurement in Education, 23(2), 132-152.

Swaminathan, H., & Rogers, H. J. (1990). Detecting differential item functioning using logistic regression procedures. Journal of Educational Measurement, 27, 361-370.

Sweeney, K. P. (1996). A Monte Carlo investigation of the likelihood-ratio procedure in the detection of differential item functioning. Unpublished doctoral dissertation, Fordham University, New York, NY

Thomson, S., Wernert, N., O'Grady, E., & Rodrigues, S. (2016). TIMSS 2015: A first look at Australia's results.

Wang, N., & Lane, S. (1996). Detection of gender-related differential item functioning in a mathematics performance assessment. Applied Measurement in Education, 9(2), 175-199.

Zumbo, B. D. (1999). A handbook on the theory and methods of differential item functioning (DIF). Ottawa: National Defense Headquarters.

Zumbo, B. D., & Thomas, D. R. (1997). A measure of effect size for a model-based approach for studying DIF. Working Paper of the Edgeworth Laboratory for Quantitative Behavioral Science, University of Northern British Columbia: Prince George, B.C.

Zwick, R., Thayer, D. T., & Mazzeo, J. (1997). Descriptive and inferential procedures for assessing differential item functioning in polytomous items. Applied Measurement in Education, 10(4), 321-344.

How to Cite
Moussa, M. (2019). Item bias approaches Comparison of TIMSS test among Egyptian environment. International Journal of Research in Educational Sciences., 2(4), 501 - 540. Retrieved from

Most read articles by the same author(s)