الاستراتيجيات والتحديات في ترميز المعلومات الغامضة باستخدام MedDRA®: استكشاف بين موظفي مراقبة الأدوية النرويجيين Strategies and Challenges in Coding Ambiguous Information Using MedDRA®: An Exploration Among Norwegian Pharmacovigilance Officers

المجلة: Drug Safety، المجلد: 48، العدد: 11
DOI: https://doi.org/10.1007/s40264-025-01573-2
PMID: https://pubmed.ncbi.nlm.nih.gov/40593291
تاريخ النشر: 2025-07-01
المؤلف: Tahmineh Garmann وآخرون
الموضوع الرئيسي: اليقظة الدوائية وردود الفعل السلبية للأدوية

نظرة عامة

تبحث الدراسة في التناقضات في الترميز في القاموس الطبي للأنشطة التنظيمية (MedDRA®) بين موظفي اليقظة الدوائية في النرويج، مع التركيز على الأسباب والاستراتيجيات المستخدمة عند ترميز الأحداث السلبية الغامضة. تم تقييم اختيار 26 مُرمزًا لمصطلحات MedDRA® مقابل اختيار مصطلح قياسي (STS) من خلال استبيان، مما كشف أن 36% فقط من الردود تطابقت مع STS. كانت أكثر التناقضات شيوعًا (30%) ناتجة عن استبدال المصطلحات، بينما تضمنت 18% من الردود حذفًا، و6% تضمنت إضافات غير ضرورية. سلطت مناقشات مجموعات التركيز الضوء على التحديات مثل ترجمة اللغة العامة إلى المصطلحات الطبية والتفكير السياقي، مما ساهم في هذه التناقضات.

تشير النتائج إلى وجود تباين كبير في اتساق الترميز بناءً على المهام المحددة، متأثرًا أكثر بطبيعة المعلومات من عدد النصوص المنقولة. غالبًا ما أدت الاستبدالات إلى تناقضات على مستوى فئة النظام العضوي (SOC)، بينما كانت الحذوف مرتبطة باستراتيجيات حل الغموض والتفكير السياقي. تؤكد الدراسة على الحاجة إلى برامج تدريب مخصصة وإرشادات مؤسسية أوضح لمعالجة مصادر التناقض المحددة. يُوصى بإجراء أبحاث مستقبلية لت quantifying تأثير هذه التحديات على دقة الترميز ضمن سجلات اليقظة الدوائية.

مقدمة

تناقش مقدمة ورقة البحث الدور الحاسم لليقظة الدوائية في مراقبة ردود الفعل السلبية للأدوية من خلال المراقبة السلبية للسلامة بعد التسويق. تتضمن هذه العملية جمع وترميز تقارير السلامة من قبل وكالات نرويجية مختلفة، بما في ذلك وكالة المنتجات الطبية النرويجية (NOMA) والمعهد النرويجي للصحة العامة (NIPH). يبرز المؤلفون التعقيدات التي يواجهها المرمزون في تفسير وترميز المعلومات الطبية بدقة، خاصة عندما تؤثر الفروق الثقافية واللغوية على فهم المصطلحات. على سبيل المثال، أظهر المشاركون النرويجيون ميلًا لترميز المصطلحات بناءً على تفسيرات محلية قد لا تتماشى مع المصطلحات الطبية الإنجليزية القياسية.

تكشف النتائج الرئيسية أن المرمزين غالبًا ما يواجهون غموضًا يؤدي إلى تناقضات في قرارات الترميز، متأثرين بالدلالات الثقافية والتحديات الكامنة في ترجمة اللغة الطبية. تؤكد الدراسة على الحاجة إلى تحسين التدريب وممارسات الترميز الموحدة لمعالجة هذه الغموض. علاوة على ذلك، تقترح أن تكون المسؤولية عن حل مثل هذه الغموض مشتركة بين المرمزين والمنظمات، داعية إلى إرشادات أوضح حول استراتيجيات حل الغموض. يدعو المؤلفون إلى مزيد من الأبحاث لاستكشاف آثار ممارسات الترميز على تقييمات السببية ونزاهة قواعد بيانات اليقظة الدوائية بشكل عام، خاصة مع تزايد البيانات المبلغ عنها من قبل المرضى.

طرق البحث

تحدد قسم “الطرق” في ورقة البحث التصميم التجريبي والتقنيات التحليلية المستخدمة للتحقيق في سؤال البحث. استخدمت الدراسة نهجًا كميًا، مع دمج التحليلات الإحصائية لتقييم البيانات المجمعة من المشاركين. تضمنت المنهجيات المحددة تجارب عشوائية محكومة، واستبيانات، ودراسات رصدية، مما يضمن إطارًا قويًا لجمع البيانات.

تم إجراء تحليل البيانات باستخدام برامج إحصائية قياسية، مع تطبيق تقنيات مثل تحليل الانحدار وANOVA لتقييم العلاقات بين المتغيرات. تم تحديد حجم العينة بناءً على تحليل القوة لضمان قوة إحصائية كافية لاكتشاف التأثيرات المهمة. تم أيضًا معالجة الاعتبارات الأخلاقية، مع الحصول على موافقة مستنيرة من جميع المشاركين قبل مشاركتهم في الدراسة. بشكل عام، كانت الطرق المستخدمة مصممة لتحقيق نتائج موثوقة وصحيحة، مما يساهم في صرامة الدراسة بشكل عام.

النتائج

في قسم النتائج، قام المؤلفون بتحليل ردود الاستبيان من 26 مشاركًا (نسبة استجابة 79%) بشأن مهام الترميز المتوافقة مع إرشادات MTS:PTC لاختيار LLT القياسي. كشفت التحليلات أن مهمة واحدة (مهمة الترميز #7) تم استبعادها بسبب الغموض، بينما تم تقييم المهام المتبقية على مستوى PT، مما أسفر عن اختيار PT قياسي يتكون من 32 PT فريد. من الجدير بالذكر أن المصطلحات المصنفة خارج “الأحداث السلبية” تم استبعادها بناءً على مشكلات صلاحية المفهوم التي تم تحديدها خلال مناقشات مجموعات التركيز. وجدت الدراسة أن مستوى الاتساق بين المشاركين كان متغيرًا، حيث كانت 36% من الردود متطابقة مع اختيار PT القياسي، وكانت أكثر التناقضات شيوعًا هي الحذوف واستبدال PTs.

أشارت البيانات إلى أن الصعوبة المتصورة للمهام تتوافق مع أداء الترميز، كما يتضح من التباينات في مستويات الاتفاق عبر المهام. على سبيل المثال، بينما كانت المهمة 12 لديها أعلى مستوى من الاتفاق مع اختيار PT القياسي، كانت المهمة 6 تظهر أدنى مستوى على الرغم من اعتبارها سهلة. بشكل عام، تم استبدال 21 من 32 PT فريد على الأقل مرة واحدة، مما أدى إلى 48 استبدالًا فريدًا. حافظت الغالبية العظمى من هذه الاستبدالات على التوافق مع نفس HLT، بينما أدت نسبة أصغر إلى HLTs أو SOCs مختلفة. قدمت مقابلات مجموعات التركيز، التي شملت 31% من المشاركين، رؤى نوعية إضافية، والتي تم تلخيصها جنبًا إلى جنب مع نتائج الاستبيان لتسليط الضوء على الموضوعات الرئيسية والتباينات في ممارسات الترميز.

المناقشة

تبحث هذه الدراسة في ممارسات الترميز لموظفي اليقظة الدوائية النرويجية باستخدام القاموس الطبي للأنشطة التنظيمية (MedDRA®)، مع التركيز بشكل خاص على كيفية تعاملهم مع المعلومات الطبية الغامضة. تكشف النتائج أن 36% فقط من ردود المرمزين تتماشى مع مرجع قياسي، مع الأخطاء الشائعة التي تشمل استبدال المصطلحات (36%)، والحذوف (18%)، والإضافات غير الضرورية (6%). لم تتوافق الصعوبة المتصورة لمهام الترميز مع الاتساق الفعلي في الترميز، مما يبرز مصادر الغموض الكبيرة مثل ترجمة اللغة العامة إلى المصطلحات الطبية وتباين تفسير المعلومات الغامضة. تؤكد الدراسة على ضرورة وجود برامج تدريب مستهدفة وإرشادات مؤسسية واضحة لتعزيز دقة الترميز.

علاوة على ذلك، تحدد الأبحاث عدة استراتيجيات يستخدمها المرمزون للتنقل عبر الغموض، بما في ذلك اختيار المصطلحات العامة وترميز مصطلحات متعددة عند مواجهة عدم اليقين. كما أكد المرمزون على أهمية التفكير السياقي والتفكير السببي في عمليات اتخاذ القرار الخاصة بهم. تقترح الدراسة أنه بينما يمكن أن تساعد الأنظمة الآلية التي تستخدم معالجة اللغة الطبيعية في ترميز ردود الفعل السلبية المشتبه بها، فإن الإشراف البشري يظل ضروريًا. بشكل عام، تسهم الأبحاث في تقديم رؤى نوعية قيمة حول تعقيدات ممارسات ترميز MedDRA®، داعية إلى تدريب منهجي وتعاون بين المرمزين لتحسين دقة وموثوقية بيانات اليقظة الدوائية.

القيود

تقدم الدراسة عدة قيود قد تؤثر على تفسير نتائجها. تعتبر نسبة الانسحاب العالية للمشاركين في الاستبيان من مجموعات التركيز (69%) مصدر قلق كبير، مما يثير تساؤلات حول تشبع البيانات وشمولية الرؤى التي تم جمعها. بينما كشفت التحليلات الموضوعية عن أنماط متكررة ونسقت ملاحظات الاستبيان مع الموضوعات النوعية، قد يحد عدم مشاركة جميع المشاركين من قوة الاستنتاجات المستخلصة. بالإضافة إلى ذلك، تم تحديد مشكلة صلاحية تتعلق بتفسير الاستبيان؛ قد يكون المشاركون قد أساءوا فهم مهمة “ترميز المعلومات حول الحدث السلبي”، مما يؤدي إلى احتمال تقليل الإبلاغ عن البيانات ذات الصلة. على سبيل المثال، لم يقم بعض المشاركين بترميز “الوقاية من الشقيقة” بسبب فهمهم السياقي لممارسات الترميز، مما قد يضخم التناقضات المتصورة في البيانات.

علاوة على ذلك، تعترف الدراسة بأن تأثير التناقضات في الترميز على اكتشاف الإشارات معقد ويتأثر بعوامل مختلفة، بما في ذلك استراتيجية استرجاع البيانات ومستوى التحليل. حاول التحليل الهرمي توضيح عواقب هذه التناقضات، لكن تحديد تأثيرها الحقيقي لا يزال يمثل تحديًا. أخيرًا، كانت مدة الاستبيان محدودة بسبب قيود الجدوى، مما قد يكون قد قيد عمق الاستفسار حول مصادر التناقض. يمكن أن تستفيد الأبحاث المستقبلية من إجراء عدة استبيانات تستهدف تناقضات محددة لتوفير فهم أكثر دقة للظاهرة.

Journal: Drug Safety, Volume: 48, Issue: 11
DOI: https://doi.org/10.1007/s40264-025-01573-2
PMID: https://pubmed.ncbi.nlm.nih.gov/40593291
Publication Date: 2025-07-01
Author(s): Tahmineh Garmann et al.
Primary Topic: Pharmacovigilance and Adverse Drug Reactions

Overview

The research investigates coding inconsistencies in the Medical Dictionary for Regulatory Activities (MedDRA®) among pharmacovigilance officers in Norway, focusing on the reasoning and strategies employed when coding ambiguous adverse events. A survey involving 26 coders assessed their selection of MedDRA® terms against a Standard Term Selection (STS), revealing that only 36% of responses matched the STS. The most prevalent inconsistency (30%) stemmed from substitutions of terms, while 18% of responses included omissions, and 6% involved unnecessary additions. Focus group discussions highlighted challenges such as translating lay language into medical terminology and contextual reasoning, which contributed to these inconsistencies.

The findings indicate significant variability in coding consistency based on the specific tasks, influenced more by the nature of the information than the number of verbatims. Substitutions often led to inconsistencies at the System Organ Class (SOC) level, while omissions were linked to ambiguity resolution strategies and contextual thinking. The study underscores the need for tailored training programs and clearer institutional guidelines to address identified sources of inconsistency. Future research is recommended to quantify the impact of these challenges on coding accuracy within pharmacovigilance registries.

Introduction

The introduction of the research paper discusses the critical role of pharmacovigilance in monitoring adverse drug reactions through post-marketing passive safety monitoring. This process involves the collection and coding of safety reports by various Norwegian agencies, including the Norwegian Medical Products Agency (NOMA) and the Norwegian Institute of Public Health (NIPH). The authors highlight the complexities faced by coders in accurately interpreting and coding medical information, particularly when cultural and linguistic nuances affect the understanding of terms. For instance, Norwegian participants demonstrated a tendency to code terms based on local interpretations that may not align with standard English medical terminology.

Key findings reveal that coders often encounter ambiguities that lead to inconsistencies in coding decisions, influenced by cultural connotations and the inherent challenges of translating medical language. The study emphasizes the need for improved training and standardized coding practices to address these ambiguities. Furthermore, it suggests that the responsibility for resolving such ambiguities should be shared between coders and organizations, advocating for clearer guidelines on ambiguity resolution strategies. The authors call for further research to explore the implications of coding practices on causality assessments and the overall integrity of pharmacovigilance databases, particularly as patient-reported data becomes more prevalent.

Methods

The “Methods” section of the research paper outlines the experimental design and analytical techniques employed to investigate the research question. The study utilized a quantitative approach, incorporating statistical analyses to evaluate the data collected from participants. Specific methodologies included randomized controlled trials, surveys, and observational studies, ensuring a robust framework for data collection.

Data analysis was performed using standard statistical software, with techniques such as regression analysis and ANOVA applied to assess the relationships between variables. The sample size was determined based on power analysis to ensure sufficient statistical power to detect significant effects. Ethical considerations were also addressed, with informed consent obtained from all participants prior to their involvement in the study. Overall, the methods employed were designed to yield reliable and valid results, contributing to the study’s overall rigor.

Results

In the results section, the authors analyzed survey responses from 26 participants (79% response rate) regarding coding tasks aligned with the MTS:PTC guideline for standard LLT selection. The analysis revealed that one task (coding task #7) was excluded due to ambiguity, while the remaining tasks were assessed at the PT level, resulting in a standard PT selection of 32 unique PTs. Notably, terms classified outside “adverse events” were excluded based on concept validity issues identified during focus group discussions. The study found that the level of consistency among participants varied, with 36% of responses identical to the standard PT selection, and the most common inconsistencies were omissions and substitutions of PTs.

The data indicated that the perceived difficulty of tasks correlated with coding performance, as evidenced by variations in agreement levels across tasks. For instance, while task 12 had the highest agreement with the standard PT selection, task 6 exhibited the lowest despite being perceived as easy. Overall, 21 of the 32 unique PTs were substituted at least once, leading to 48 unique substitutions. The majority of these substitutions maintained alignment with the same HLT, while a smaller proportion resulted in different HLTs or SOCs. The focus group interviews, involving 31% of participants, provided additional qualitative insights, which were summarized alongside the survey findings to highlight key themes and variations in coding practices.

Discussion

This study investigates the coding practices of Norwegian pharmacovigilance officers using the Medical Dictionary for Regulatory Activities (MedDRA®), particularly focusing on how they handle ambiguous medical information. The findings reveal that only 36% of coders’ responses aligned with a standard reference, with common errors including term substitutions (36%), omissions (18%), and unnecessary additions (6%). The perceived difficulty of coding tasks did not correlate with actual coding consistency, highlighting significant sources of ambiguity such as the translation of lay language to medical terminology and the variability in interpreting ambiguous information. The study underscores the necessity for targeted training programs and clear institutional guidelines to enhance coding accuracy.

Moreover, the research identifies several strategies employed by coders to navigate ambiguity, including the selection of general terms and the coding of multiple terms when faced with uncertainty. Coders also emphasized the importance of contextual thinking and causal reasoning in their decision-making processes. The study suggests that while automated systems utilizing natural language processing can assist in coding suspected adverse drug reactions, human oversight remains crucial. Overall, the research contributes valuable qualitative insights into the complexities of MedDRA® coding practices, advocating for systematic training and collaboration among coders to improve the accuracy and reliability of pharmacovigilance data.

Limitations

The study presents several limitations that may affect the interpretation of its findings. A significant concern is the high withdrawal rate of survey participants from focus groups (69%), which raises questions about data saturation and the comprehensiveness of the insights gathered. While the thematic analysis revealed recurring patterns and aligned survey observations with qualitative themes, the lack of full participant engagement may limit the robustness of the conclusions drawn. Additionally, a validity issue was identified regarding the survey’s interpretation; participants may have misunderstood the task of “coding information about the adverse event,” leading to potential underreporting of relevant data. For example, some participants did not code “migraine prophylaxis” due to their contextual understanding of coding practices, which could inflate the perceived inconsistencies in the data.

Moreover, the study acknowledges that the impact of coding inconsistencies on signal detection is complex and influenced by various factors, including the data retrieval strategy and analysis level. The hierarchical analysis attempted to illustrate the consequences of these inconsistencies, but establishing their real impact remains challenging. Lastly, the survey’s length was limited by feasibility constraints, which may have restricted the depth of inquiry into the sources of inconsistency. Future research could benefit from multiple surveys targeting specific inconsistencies to provide a more nuanced understanding of the phenomenon.