تحديد حجم العينة المناسب للمقابلات النوعية لتحقيق تشبع الشيفرة الحقيقي والقريب: تحليل ثانوي للبيانات Determining an Appropriate Sample Size for Qualitative Interviews to Achieve True and Near Code Saturation: Secondary Analysis of Data

المجلة: Journal of Medical Internet Research، المجلد: 26
DOI: https://doi.org/10.2196/52998
PMID: https://pubmed.ncbi.nlm.nih.gov/38980711
تاريخ النشر: 2024-05-09
المؤلف: Claudia Squire وآخرون
الموضوع الرئيسي: مجموعات التركيز والطرق النوعية

نظرة عامة

تستكشف هذه الدراسة عدد المقابلات الافتراضية اللازمة لتحقيق التشبع الحقيقي في الترميز أو التشبع القريب في البحث النوعي، وخاصة في سياق المقابلات المتعمقة. استنادًا إلى بيانات من خمس دراسات ممولة من إدارة الغذاء والدواء تشمل المرضى ومقدمي الرعاية الصحية، وجد الباحثون أن التشبع الحقيقي تم الوصول إليه بعد 91% إلى 100% من المقابلات المخطط لها، بينما تم تحقيق التشبع القريب بعد 33% إلى 60% من المقابلات المخطط لها (حوالي 15-23 مقابلة). تشير النتائج إلى أن الدراسات التي تستخدم الترميز الاستنتاجي وأدلة المقابلات المنظمة وصلت إلى التشبع بشكل أسرع مقارنة بتلك التي تستخدم أساليب أقل تنظيمًا.

يستنتج المؤلفون أن التشبع القريب قد يكون هدفًا كافيًا للبحث النوعي، حيث أن المقابلات الإضافية بعد هذه النقطة قد تؤدي إلى عوائد متناقصة. يؤكدون على أهمية النظر في عوامل مختلفة، مثل هيكل دليل المقابلة، ومنهجية الترميز، وخصائص السكان المدروسين، عند تحديد حجم العينة. تقترح الدراسة أن تركز الأبحاث المستقبلية على استكشاف التشبع عبر مجالات وسكان مختلفين لتطوير إرشادات مخصصة للباحثين، بدلاً من السعي إلى معيار عالمي للتشبع في الدراسات النوعية.

مقدمة

تناقش مقدمة ورقة البحث أهمية المقابلات المتعمقة كطريقة لجمع البيانات النوعية، وخاصة في فهم تصورات وسلوكيات الأفراد. تتيح هذه المقابلات، التي يجريها محاورون مدربون باستخدام أدلة منظمة أو غير منظمة، استكشافًا دقيقًا لمواضيع معقدة قد لا تلتقطها الأساليب الكمية. سرّعت جائحة COVID-19 من اعتماد أساليب جمع البيانات الافتراضية، التي أثبتت فائدتها في الوصول إلى السكان المهمشين وتعزيز التنوع الجغرافي، على الرغم من المخاوف الأولية بشأن العلاقة وجودة البيانات.

تهدف الدراسة إلى تقييم عدد المقابلات الافتراضية اللازمة لتحقيق التشبع في الترميز في خمس دراسات ممولة من إدارة الغذاء والدواء. يُعرف التشبع في الترميز بأنه النقطة التي تم فيها تطبيق جميع الرموز ذات الصلة على البيانات، حيث تميز الدراسة بين التشبع الحقيقي في الترميز (100% من الرموز) والتشبع القريب في الترميز (90% من الرموز). يبرز المؤلفون التباين في التعريفات وأساليب قياس التشبع عبر البحث النوعي، مؤكدين على الحاجة إلى إرشادات أوضح وأدلة تجريبية بشأن كفاية حجم العينة في سياقات المقابلات الافتراضية.

الطرق

تضمنت الطرق المستخدمة في هذه الدراسة تحليل بيانات من خمس مشاريع بحثية ممولة من إدارة الغذاء والدواء أجريت عبر منصات افتراضية، تحديدًا زووم وأدوبي كونكت. كان التركيز على مجموعتين رئيسيتين: المرضى الذين يعانون من حالات طبية أساسية ومقدمي الرعاية الصحية الذين يقدمون الرعاية الأولية أو المتخصصة. تضمنت كل منصة افتراضية تم استخدامها للمقابلات وظائف صوتية ومرئية، مما سهل مشاركة المحفزات البصرية خلال الجلسات. يتم تقديم ملخص لكل دراسة في الجدول 2. تم ترميز البيانات من الدراسات وإدارتها بشكل منهجي باستخدام برنامج NVivo (الإصدار 11، QSR International).

النتائج

تشير النتائج إلى أنه بينما لم يتم تحقيق التشبع الحقيقي في المقابلات النوعية حتى المقابلة الأخيرة، تم الوصول إلى التشبع القريب في وقت سابق، عادةً ما يكون أقل بقليل من منتصف فترة جمع البيانات. يشير هذا إلى أن المقابلات الإضافية التي تُجرى بعد الوصول إلى التشبع القريب قد تؤدي إلى عوائد متناقصة من حيث المعلومات الجديدة. كشفت تحليل خمس دراسات أن الدراسات التي تستخدم بشكل أساسي الترميز الاستنتاجي وصلت إلى التشبع القريب بشكل أسرع من تلك التي تحتوي على نسبة أعلى من الترميز الاستقرائي. وبالتالي، يجب على الباحثين النظر في نهجهم التحليلي عند تحديد حجم العينة، مع توصية بحجم عينة أكبر قليلاً للدراسات التي تستخدم مخططات الترميز التكرارية.

تفاوتت مدة المقابلات من 30 إلى 90 دقيقة، حيث استمرت معظمها 60 دقيقة. ومن المثير للاهتمام، أن أقصر مقابلة (30 دقيقة) تطلبت ثاني أعلى عدد من المقابلات لتحقيق التشبع، مما يشير إلى أنه بينما قد تلعب مدة المقابلة دورًا، فإن هيكل دليل المقابلة ونهج الترميز هما عوامل أكثر أهمية. تشير النتائج إلى أن عددًا أكبر من المقابلات قد يكون ضروريًا للوصول إلى مستوى مقبول من التشبع القريب مقارنة بالدراسات السابقة، التي أفادت بتحقيق تشبع كافٍ عند 6 إلى 12 مقابلة. بالإضافة إلى ذلك، فإن تضمين المشاركين من جميع مناطق مكتب التعداد السكاني الأمريكي يعزز من قابلية تعميم هذه النتائج خارج الأبحاث السابقة التي ركزت على جمع البيانات الشخصية في مواقع محدودة.

المناقشة

في قسم المناقشة من ورقة البحث، يؤكد المؤلفون على الدور الحاسم للتشبع في البحث النوعي، وخاصة في سياق جمع البيانات الافتراضية. يشيرون إلى أنه بينما يُعتبر التشبع مفهومًا معترفًا به على نطاق واسع، هناك نقص في التوافق بشأن تعريفه وقياسه، خاصة للدراسات التي تستخدم المقابلات الافتراضية. تشير النتائج إلى أن تحقيق التشبع القريب – الذي يُعرف بأنه 90% من جميع الرموز المطبقة تم استخدامها – قد يكون هدفًا عمليًا، حيث أن المقابلات الإضافية بعد هذه النقطة غالبًا ما تؤدي إلى عوائد متناقصة من الرؤى الجديدة. يبرز المؤلفون أن عوامل مختلفة، بما في ذلك هيكل دليل المقابلة، ونهج الترميز، وخصائص السكان المدروسين، تؤثر على عدد المقابلات المطلوبة للوصول إلى التشبع.

تشير الورقة أيضًا إلى أن الدراسات التي تستخدم أدلة مقابلة أكثر تنظيمًا وترميز استنتاجي تميل إلى الوصول إلى التشبع بشكل أسرع. على العكس من ذلك، قد تتطلب الدراسات التي تحتوي على أدلة أقل تنظيمًا وتعتمد على الترميز الاستقرائي أحجام عينات أكبر لتحقيق مستويات مماثلة من التشبع. يدعو المؤلفون إلى أن تستكشف الأبحاث المستقبلية التشبع عبر مجالات وسكان متنوعين، بهدف تطوير إرشادات مخصصة تعزز من دقة وقابلية تطبيق منهجيات البحث النوعي. سيساهم هذا النهج في فهم أكثر دقة للتشبع، مما يعود بالنفع في النهاية على الباحثين في تصميم دراساتهم.

القيود

تقدم الدراسة عدة قيود قد تؤثر على قابلية تعميم نتائجها. أولاً، كانت التحليلات مستندة إلى عينة من خمس دراسات فقط، جميعها كانت ضمن المجال الطبي، تشمل مشاركين ذوي معرفة مثل المرضى الذين يعانون من حالات طبية محددة ومقدمي الرعاية الصحية. تثير هذه التجانس مخاوف بشأن قابلية تطبيق النتائج على السكان الأوسع أو السياقات المختلفة.

ثانيًا، قد يكون الاعتماد على أدلة المقابلة شبه المنظمة التي تفضل نهجًا أكثر تنظيمًا قد قيد عمق الردود، حيث التزم المحاورون بشكل أساسي بالأسئلة المكتوبة بدلاً من السماح بمتابعات غير مكتوبة. علاوة على ذلك، استخدم عملية الترميز مزيجًا من الأساليب الاستنتاجية والاستقرائية، والتي، على الرغم من أنها كانت متسقة عبر الدراسات، قد تحد من تنوع الرؤى التي يمكن أن تظهر من استراتيجية ترميز أكثر مرونة. أخيرًا، نظرًا لأن جميع الدراسات تقع تحت مظلة أبحاث العلوم الاجتماعية، قد لا تكون النتائج قابلة للتحويل إلى تخصصات أخرى، مثل الاقتصاد أو البحث الطبي البحت، حيث يمكن أن تؤدي منهجيات وسياقات مختلفة إلى نتائج متميزة.

Journal: Journal of Medical Internet Research, Volume: 26
DOI: https://doi.org/10.2196/52998
PMID: https://pubmed.ncbi.nlm.nih.gov/38980711
Publication Date: 2024-05-09
Author(s): Claudia Squire et al.
Primary Topic: Focus Groups and Qualitative Methods

Overview

This study investigates the number of virtual interviews necessary to achieve true code saturation or near saturation in qualitative research, particularly in the context of in-depth interviews. Drawing on data from five Food and Drug Administration-funded studies involving patients and healthcare providers, the researchers found that true saturation was reached after 91% to 100% of planned interviews, while near saturation was achieved after 33% to 60% of planned interviews (approximately 15-23 interviews). The findings indicate that studies employing deductive coding and structured interview guides reached saturation more quickly compared to those with less structured approaches.

The authors conclude that near saturation may be an adequate target for qualitative research, as further interviews beyond this point may yield diminishing returns. They emphasize the importance of considering various factors, such as the structure of the interview guide, the coding methodology, and the characteristics of the population studied, when determining sample size. The study suggests that future research should focus on exploring saturation across different fields and populations to develop tailored guidance for researchers, rather than seeking a universal standard for saturation in qualitative studies.

Introduction

The introduction of the research paper discusses the significance of in-depth interviews as a qualitative data collection method, particularly in understanding individuals’ perceptions and behaviors. These interviews, conducted by trained interviewers using structured or unstructured guides, allow for a nuanced exploration of complex topics that quantitative methods may not capture. The COVID-19 pandemic accelerated the adoption of virtual data collection methods, which have proven advantageous in accessing marginalized populations and enhancing geographic diversity, despite initial concerns regarding rapport and data richness.

The study aims to empirically assess the number of virtual interviews necessary to achieve code saturation in five FDA-funded studies. Code saturation is defined as the point at which all relevant codes have been applied to the data, with the study distinguishing between true code saturation (100% of codes) and near code saturation (90% of codes). The authors highlight the variability in definitions and measurement approaches of saturation across qualitative research, emphasizing the need for clearer guidelines and empirical evidence regarding sample size adequacy in virtual interview contexts.

Methods

The methods employed in this study involved the analysis of data from five FDA-funded research projects conducted via virtual platforms, specifically Zoom and Adobe Connect. The focus was on two primary groups: patients with underlying medical conditions and healthcare providers delivering primary or specialty care. Each virtual platform utilized for the interviews incorporated both audio and video functionalities, facilitating the sharing of visual stimuli during the sessions. A summary of each study is presented in Table 2. Data from the studies were systematically coded and managed using NVivo software (version 11, QSR International).

Results

The results indicate that while true saturation in qualitative interviews was not achieved until the final interview, near saturation was reached earlier, typically just below the midpoint of data collection. This suggests that additional interviews conducted after reaching near saturation may yield diminishing returns in terms of new information. The analysis of five studies revealed that studies employing predominantly deductive coding reached near saturation more quickly than those with a higher proportion of inductive coding. Consequently, researchers should consider their analytic approach when determining sample size, with a recommendation for a slightly larger sample size for studies utilizing iterative coding schemes.

The length of interviews varied from 30 to 90 minutes, with most lasting 60 minutes. Interestingly, the shortest interview (30 minutes) required the second highest number of interviews to achieve saturation, indicating that while interview length may play a role, the structure of the interview guide and the coding approach are more significant factors. The findings suggest that a higher number of interviews may be necessary to reach an acceptable level of near code saturation compared to previous studies, which reported sufficient saturation at 6 to 12 interviews. Additionally, the inclusion of participants from all US Census Bureau regions enhances the generalizability of these findings beyond prior research focused on in-person data collection at limited sites.

Discussion

In the discussion section of the research paper, the authors emphasize the critical role of saturation in qualitative research, particularly in the context of virtual data collection. They note that while saturation is a widely recognized concept, there is a lack of consensus on its definition and measurement, especially for studies utilizing virtual interviews. The findings suggest that achieving near saturation—defined as 90% of all applied codes being utilized—may be a practical target, as further interviews beyond this point often yield diminishing returns in new insights. The authors highlight that various factors, including the structure of the interview guide, the coding approach, and the characteristics of the study population, influence the number of interviews required to reach saturation.

The paper also indicates that studies employing more structured interview guides and deductive coding tend to reach saturation more quickly. Conversely, those with less structured guides and a reliance on inductive coding may necessitate larger sample sizes to achieve similar levels of saturation. The authors advocate for future research to explore saturation across diverse fields and populations, aiming to develop tailored guidelines that enhance the rigor and applicability of qualitative research methodologies. This approach will contribute to a more nuanced understanding of saturation, ultimately benefiting researchers in their study designs.

Limitations

The study presents several limitations that may affect the generalizability of its findings. Firstly, the analysis was based on a sample of only five studies, all of which were situated within the medical field, involving knowledgeable participants such as patients with specific medical conditions and healthcare providers. This homogeneity raises concerns about the applicability of the results to broader populations or different contexts.

Secondly, the reliance on semistructured interview guides that favored a more structured approach may have constrained the depth of responses, as interviewers predominantly adhered to scripted questions rather than allowing for unscripted follow-ups. Furthermore, the coding process employed a combination of deductive and inductive methods, which, while consistent across studies, may limit the diversity of insights that could emerge from a more flexible coding strategy. Lastly, since all studies fall under the umbrella of social science research, findings may not be transferable to other disciplines, such as economics or pure medical research, where different methodologies and contexts could yield distinct outcomes.