التعرف على قيمة مجموع الدرجات، أعظم إنجازات علم النفس القياسي Recognize the Value of the Sum Score, Psychometrics’ Greatest Accomplishment

المجلة: Psychometrika، المجلد: 89، العدد: 1
DOI: https://doi.org/10.1007/s11336-024-09964-7
PMID: https://pubmed.ncbi.nlm.nih.gov/38627311
تاريخ النشر: 2024-03-01
المؤلف: Klaas Sijtsma وآخرون
الموضوع الرئيسي: مواضيع بحث الصحة النفسية

نظرة عامة

في هذا القسم، يدافع المؤلفون عن الاستمرار في أهمية درجات المجموع في الممارسة النفسية، متعارضين مع الرأي الذي يحمله بعض علماء النفس بأن هذه الدرجات يجب أن تُستبدل بنماذج المتغيرات الكامنة. يؤكدون أن درجات المجموع ترتب المتغيرات الكامنة بشكل عشوائي عبر نماذج استجابة العناصر المختلفة، مدعومة بنظرية استجابة العناصر، التي توفر مبررًا رياضيًا لتطبيقها الترتيبي.

علاوة على ذلك، يتناول المؤلفون المخاوف المتعلقة بموثوقية وتقدير درجات المجموع، موضحين أن نظرية الاختبار الكلاسيكية تقدم مجموعة من الحدود الدنيا للموثوقية التي غالبًا ما تكون قريبة من القيم الحقيقية تحت افتراضات معقولة. يؤكدون أن فائدة درجات المجموع تكمن في قدرتها على التنبؤ بالسلوكيات والنتائج ذات الصلة. بينما يعترفون بمزايا نماذج القياس الحديثة، يجادل المؤلفون بأن نظرية الاختبار الكلاسيكية تظل ذات قيمة بسبب بساطتها وقابليتها للتطبيق العملي، والتي تم تجاهلها في المناقشات الأخيرة.

مناقشة

في هذا القسم، يناقش المؤلفون العلاقة بين درجة المجموع والمتغيرات الكامنة في سياق نظرية استجابة العناصر (IRT) ونظرية الاختبار الكلاسيكية (CTT). يعرفون درجة المجموع، \(X^+ = \sum_{j=1}^{J} X_j\)، حيث \(X_j\) تمثل الدرجة على العنصر \(j\)، ويؤكدون أنه بينما تسمح CTT بتفكيك درجات العناصر إلى درجات حقيقية \(T_j\) ومكونات خطأ \(E_j\)، إلا أنها لا تفرض قيودًا على الأبعاد. بالمقابل، تقدم IRT مثل هذه القيود، مما يعزز الفهم لكيفية ارتباط درجات العناصر بالسمات الكامنة الأساسية. يبرز المؤلفون التطور التاريخي لهذه النظريات، مشيرين إلى أنه بينما اكتسبت IRT شهرة منذ الثمانينيات، تظل درجة المجموع في CTT مستخدمة على نطاق واسع، خاصة في بناء الاختبارات العملية.

يتوسع المؤلفون أكثر في أهمية درجة المجموع في تقدير المتغيرات الكامنة، خاصة من خلال نتائج غرايسون التي تؤسس نسبة احتمال أحادية بين درجة المجموع والمتغير الكامن للعناصر الثنائية. تشير هذه الخاصية إلى أنه مع زيادة درجة المجموع، تزداد أيضًا المتوسط الشرطي للمتغير الكامن، مما يعزز فائدة درجة المجموع كتمثيل ترتيبي قوي للمتغير الكامن. يجادلون بأن درجة المجموع لا ينبغي أن تُهمل من قبل علماء النفس، حيث تحتفظ بخصائص قياس مهمة عبر نماذج IRT المختلفة، بما في ذلك نموذج التجانس الأحادي. في النهاية، يؤكد المؤلفون أن درجة المجموع ليست مجرد مقياس بسيط، بل تمثيل دفاعي وبديهي للسمات الكامنة، مدعوم بنظرية نفسية واسعة.

Journal: Psychometrika, Volume: 89, Issue: 1
DOI: https://doi.org/10.1007/s11336-024-09964-7
PMID: https://pubmed.ncbi.nlm.nih.gov/38627311
Publication Date: 2024-03-01
Author(s): Klaas Sijtsma et al.
Primary Topic: Mental Health Research Topics

Overview

In this section, the authors defend the continued relevance of sum scores in psychometric practice, countering the view held by some psychometricians that these scores should be replaced by latent variable models. They assert that sum scores effectively stochastically order latent variables across various item response models, supported by item response theory, which provides a mathematical rationale for their ordinal application.

Furthermore, the authors address concerns regarding the reliability and estimation of sum scores, demonstrating that classical test theory offers a set of lower bounds for reliability that are often close to true values under reasonable assumptions. They emphasize that the utility of sum scores lies in their ability to predict relevant behaviors and outcomes. While acknowledging the merits of modern measurement models, the authors argue that classical test theory remains valuable due to its simplicity and practical applicability, which have been overlooked in recent discussions.

Discussion

In this section, the authors discuss the relationship between the sum score and latent variables within the context of Item Response Theory (IRT) and Classical Test Theory (CTT). They define the sum score, \(X^+ = \sum_{j=1}^{J} X_j\), where \(X_j\) represents the score on item \(j\), and emphasize that while CTT allows for the decomposition of item scores into true scores \(T_j\) and error components \(E_j\), it does not impose dimensionality restrictions. In contrast, IRT introduces such restrictions, enhancing the understanding of how item scores relate to underlying latent traits. The authors highlight the historical evolution of these theories, noting that while IRT has gained prominence since the 1980s, CTT’s sum score remains widely utilized, particularly in practical test construction.

The authors further elaborate on the significance of the sum score in estimating latent variables, particularly through Grayson’s findings that establish a monotone likelihood ratio between the sum score and the latent variable for binary items. This property implies that as the sum score increases, the conditional mean of the latent variable also increases, thereby reinforcing the sum score’s utility as a robust ordinal approximation of the latent variable. They argue that the sum score should not be dismissed by psychometricians, as it retains important measurement properties across various IRT models, including the monotone homogeneity model. Ultimately, the authors assert that the sum score is not merely a simplistic measure but a defensible and intuitive representation of latent traits, supported by extensive psychometric theory.

كلمات مفتاحية: إحصائيات، إعادة إنتاجية النتائج، اختبار (علم الأحياء)، اقتصاديات القياس، العمومية، المتغير الكامن، بشر، بيانات ترتيبية، تنوع (سيبرنتيك)، رياضيات، علم النفس، علم النفس القياسي، علوم الحاسوب، قيمة (رياضيات)، موثوقية (أشباه الموصلات)، نظرية استجابة العنصر، نظرية الاختبار، نظرية الاختبار الكلاسيكية، نماذج، إحصائية