DOI: https://doi.org/10.1057/s41599-024-03044-y
تاريخ النشر: 2024-05-10
تحديد الظهور بين التخصصات في علم العلوم: دمج تحليل الشبكات وBERTopic
الملخص
يتوسع الإنتاج العلمي العالمي بشكل متسارع، مما يتطلب فهمًا أفضل لعلم العلوم وخاصة كيفية توسع حدود المجالات العلمية من خلال عمليات الظهور. تقترح الدراسة الحالية تطبيق تقنيات نمذجة الموضوعات المدمجة لتحديد العلوم الناشئة الجديدة من خلال أنشطة إعادة تركيب المعرفة كما يتضح من خلال تحليل بيانات النشر البحثي. أولاً، يتم بناء مجموعة بيانات من البيانات الوصفية المشتقة من قاعدة بيانات مجموعة ويب العلوم الأساسية. ثم تُستخدم مجموعة البيانات هذه لإنشاء خريطة عالمية تمثل شبكة تداخل علمية تصنيفية. يتم تعريف مجال البحث على أنه متعدد التخصصات عندما يتم إدراج فئات علمية متعددة في وصفه. ثانيًا، تتم مقارنة الشبكات المتداخلة بين الفترات لتحديد أنماط التأثير المتغيرة في ضوء التخصصات المتعددة. ثالثًا، تمكّن نمذجة الموضوعات المدمجة من الربط غير المراقب للتصنيف متعدد التخصصات. نقدم نتائج التحليل لإظهار ظهور العلوم العالمية متعددة التخصصات، وعلاوة على ذلك، نقوم بإجراء تحقق نوعي على النتائج لتحديد مصادر المجالات الناشئة. بناءً على هذه النتائج، نناقش التطبيقات المحتملة لتحديد الظهور من خلال دمج المجالات العالمية متعددة التخصصات.
المقدمة
بينما ارتفع إنتاج البحث، انخفضت إنتاجية العلوم – أو القيمة المستمدة من ذلك الإنتاج – عبر المجالات (بلوم وآخرون 2020). تباطأ معدل الابتكار لأن مستوى التخصص (جونز 2009) وحجم الفرق (كوزلو 2023) اللازمة لإجراء العلوم قد زاد. مرتبطًا بالتخصص وحجم الفريق، ارتفعت تكاليف البحث والتطوير بشكل حاد، مما قلل من معدل إنتاجية العلوم (بلوم وآخرون 2020). سبب آخر هو كيفية قياس الظهور. على سبيل المثال، مع زيادة حجم الإنتاج العلمي، تقل القدرة على تقييم الموضوعات البحثية الناشئة لأن الأدبيات الكنسية من المرجح أن يتم الاستشهاد بها (تشو وإيفانز 2021). “هل يمكن أن نكون نفتقد نماذج جديدة خصبة لأننا محاصرون في مجالات دراسة مفرطة العمل؟” (تشو وإيفانز 2021، ص.5). علاوة على ذلك، هل يمكن أن نكون نخطئ في تحديد مصدر القيمة الناشئة من العلوم؟
مراجعة الأدبيات
الأدبيات العلمية (Kim و Chen 2015)، خاصة من خلال تركيبات جديدة من مجالات العلوم والتكنولوجيا بين التخصصات (Blei و Lafferty 2007؛ Eum و Maliphol 2023؛ Khan و Wood 2015؛ Lee et al. 2015). خرائط العلوم هي تمثيلات شبكية للأدبيات العلمية التي تطورت في أساليب البحث (Chen 2006). تحت هذه الأساليب السابقة، هناك تركيز على إيجاد ابتكارات جديدة جذريًا ضمن مجال متخصص من العلوم.
مقياس آخر للتنظيم الناشئ هو التداخل السريع النمو بين مجالات أو تقنيات متعددة (Bornmann 2013؛ Bornmann و Marx 2014؛ Lee et al. 2021؛ Leydesdorff et al. 2013). على مر الزمن، أصبحت الأبحاث أكثر تداخلًا بين التخصصات (Chakraborty 2018). تمر مجالات البحث بثلاث مراحل: النمو، النضج، والتداخل بين التخصصات (Chakraborty 2018).
المنهجية

عنصر سيرة ذاتية، رسالة، ببليوغرافيا، تصحيح، مراجعة كتاب، ملخص اجتماع، أو ورقة إجراءات) ونوع النشر (مجلة، كتاب في سلسلة، أو كتاب). تسمح هذه المعايير بتقييد عيّنتنا إلى المنشورات التي كُتبت لنفس الغرض، للحفاظ على جودة المقالات، وتجنب التكرار. تقتصر مجموعة البيانات المستخدمة هنا على مقالات المجلات من خلال تصفية أنواع الوثائق والنشر.
تحليل شبكة تزامن فئات العلوم-الموضوع
| 2012-2014 | 2015-2017 | |||||
| نشر | موضوع | مجلة | نشر | موضوع | مجلة | |
| LSB-TE | ٦٨٧٦٨ | ٨٠ | 162 | 79,112 | 81 | ١٧٥ |
| LSB-PS | ١١٥,٤٩٩ | 67 | 228 | ١٢٠,١٦١ | 67 | ٢٤٨ |
| بي إس-تي إي | ٣٤٥,٥٢٠ | 85 | 584 | 414,010 | 86 | 637 |
| LSB-PS-TE | ٢٥٤٤٧ | 43 | 40 | ٢٥,٨٠٥ | 43 | 43 |



نمذجة الموضوعات المدمجة
يعمل أسرع بخمس مرات دون المساس بالجودة

دراسة حالة حول العلوم بين التخصصات في شبكة العلوم
الذي من المتوقع أن يكون له تأثير أكبر، بدلاً من تلك المعروفة بالفعل. الفجوة بين نوعين من فئات العلوم – الموضوعات تبرر نهجنا في تمييز الموضوعات العلمية الواعدة في المستقبل عن تلك التي تسود بالفعل، والأهم من ذلك، تشير إلى أن التركيز على الموضوعات الناشئة يتناسب أكثر مع هدف هذا البحث.
التصنيف غير المراقب لموضوعات العلوم البينية الناشئة
| علم | فئات العلوم السائدة – المواضيع | مواضيع فئة العلوم المتزايدة |
| LSB-TE | علوم البيئة | الغابات |
| الهندسة، البيئية | علوم المواد، الأنسجة | |
| العلوم والتكنولوجيا الخضراء والمستدامة | الأدوات والأجهزة | |
| الطاقة والوقود | علم الأدوية والصيدلة | |
| الهندسة الكيميائية | العلوم والتكنولوجيا الخضراء والمستدامة | |
| علم البيئة | الطب، البحث والتجريب | |
| الصحة العامة والبيئية والمهنية | الهندسة، البيئية | |
| الأشعة، الطب النووي وتصوير الطب | علم البيئة | |
| LSB-PS | الكيمياء، التطبيقية | علوم الأعصاب |
| الكيمياء الحيوية وعلم الأحياء الجزيئي | علوم وخدمات الرعاية الصحية | |
| علوم وتكنولوجيا الغذاء | علم المناعة | |
| الكيمياء، التحليلية | علوم البوليمرات | |
| طرق البحث البيوكيميائية | علم الحفريات | |
| الكيمياء، متعددة التخصصات | الميكروبيولوجيا | |
| الكيمياء، الطبية | المصايد | |
| بي إس-تي إي | علوم المواد، متعددة التخصصات | الهندسة، الطيران والفضاء |
| الفيزياء التطبيقية | العلوم والتكنولوجيا الخضراء والمستدامة | |
| علوم النانو والتكنولوجيا النانوية | الهندسة البحرية | |
| الكيمياء، الفيزيائية | الجغرافيا، الفيزيائية | |
| الفيزياء، المادة المكثفة | الموارد المائية | |
| الكيمياء، متعددة التخصصات | الهندسة، الميكانيكية | |
| الهندسة، الكهربائية والإلكترونية | الصوتيات | |
| الطاقة والوقود | الهندسة، المحيط | |
| علوم المواد، الطلاءات والأفلام | أنظمة الأتمتة والتحكم | |
| LSB-PS-TE | علوم البيئة | الاستشعار عن بُعد |
| الموارد المائية | علوم التصوير والتكنولوجيا الفوتوغرافية | |
| الهندسة، البيئية | علوم الأرض، متعددة التخصصات | |
| علوم الحاسوب، التطبيقات متعددة التخصصات | علم البلورات | |
| الإحصاء والاحتمالات |

| نشر | نطاق n-gram | عدد المواضيع | حجم الموضوع الأدنى | |
| علم LSB-TE المتنامي | ٢٦,١٦٤ |
|
50 ~ 1000 | ١٣٠-٧٨٠ |
| علم LSB-PS المتنامي | 10,577 | 50-300 | ||
| علم PS-TE المتنامي | ٤٩٠٤٢ | 240-1440 | ||
| علم نمو LSB-PS-TE | 904 | ٥-٥٠ |
المناقشة والاستنتاج
| فئات العلوم | موضوع | الكلمات الرئيسية | الملصقات المولدة | عدد الوثائق |
| LSB-TE (26,164 منشور) | شاذ | القطع، التشغيل، الجلد، الطحن، الرادون، الأسبستوس، التشحيم، الدباغة، الرصاص، MQL | – | 272 |
| 0 | خشب، لجنين، خصائص، قوة، ميكانيكية، خيزران، رطوبة، سليلوز، معامل، عينات | الخصائص الميكانيكية وتركيب المواد الليفية الطبيعية | ٢٢٣١٤ | |
| 1 | ماء، دراسة، طاقة، نتائج، بيئية، نفايات، نموذج، استخدام، إنتاج، قائم | التقنيات البيئية المستدامة وإدارة الموارد | ١٣٣٩ | |
| 2 | المرضى، مجموعة، هو، المجموعات، السريرية، السيطرة، السرطان، التعبير، الخلايا | تعبير علامات الأورام السرطانية في مجموعات المرضى السريرية | ٢٢٣٩ | |
| LSB-PS (10,577 منشور) | شاذ | دماغ، fnirs، تصوير، بصري، خلايا عصبية، قشرة، قشري، عصبي، نسب، تحفيز | – | ١٧٦ |
| 0 | أنواع، ماء، بحر، مبكر، بيانات، متأخر، بحري، تشكيل، مناخ، جديد | دراسات التنوع البيولوجي البحري وتأثير المناخ | 5129 | |
| 1 | نموذج، بيانات، نماذج، مقترح، طرق، تجربة، انحدار، طريقة، محاكاة، سريرية | تقنيات نمذجة ومحاكاة التجارب السريرية | 397 | |
| 2 | أظهر، نشاط، خصائص، بروتين، خلايا، كيتوزان، خلية، درجة الحموضة، حمض، دواء | نشاط الكيتوزان الحيوي وتطبيقات توصيل الأدوية | 4785 | |
| PS-TE (49,042 منشور) | شاذ | منتدى، مجلة، آراء، قراء، مقالات، تكهنات، تحرير، مثير، أسس، مؤسس | – | 11 |
| 0 | الامتزاز، الإزالة، الغشاء، الرقم الهيدروجيني، العملية، التركيز، الماء، المعالجة، ملغ، الحمض | عمليات الامتزاز والغشاء لمعالجة المياه | 10,873 | |
| 1 | حرارة، نموذج، تدفق، نتائج، بيانات، مستند، ماء، طريقة، درجة حرارة، نقل | نمذجة وتحليل انتقال الحرارة في أنظمة السوائل | ٣٨,١٥٨ | |
| LSB-PS-TE (905 منشورات) | 0 | بيانات، دراسة، أرض، منطقة، استخدام، فيضان، مكاني، قائم، نموذج، مستخدم | تقييم مخاطر الفيضانات والنمذجة المكانية | ٣٣٤ |
| 1 | الارتباط، الجزيئي، البروتين، الطاقة، التفاعلات، التثبيت، الهيكل، الديناميات، الجزيئات، النتائج | تفاعل وتوصيل جزيئات البروتين | 570 |
| الجدول 5: مقالات تمثيلية لكل موضوع ناشئ متعدد التخصصات. | |
| فئات العلوم – موضوع ناشئ | مقالة تمثيلية |
| LSB-TE | |
| الخصائص الميكانيكية و | 0-1: كاين وآخرون (2015) |
| تركيب الألياف الطبيعية | 0-2: لافاليت وآخرون (2016) |
| المواد | 0-3: كانديليير وآخرون (2017) |
| البيئة المستدامة | 1-1: إيغلي وآخرون (2015) |
| التقنيات والموارد | 1-2: بالما-روخاس وآخرون (2017) |
| الإدارة | 1-3: هاريجاني وآخرون (2017) |
| تعبير علامات الأورام السرطانية في السريرية | 2-1: ليو وآخرون (2017) |
| مجموعات المرضى | 2-2: ليو ولي (2017) |
| 2-3: تشي وآخرون (2017) | |
| LSB-PS | |
| التنوع البيولوجي البحري وتأثير المناخ | 0-1: تشين وآخرون (2016) |
| دراسات | 0-2: باتاي و آخرون (2016) |
| 0-3: لاوري وآخرون (2017) | |
| نمذجة المحاكاة للتجارب السريرية | 1-1: فرنسي وآخرون (2016) |
| تقنيات | 1-2: ليو وآخرون (2016) |
| 1-3: لو (2017) | |
| نشاط الكيتوزان الحيوي وتوصيل الأدوية | 2-1: تشاو وآخرون (2016) |
| التطبيقات | 2-2: بره وآخرون (2017) |
| 2-3: غوميز وآخرون (2017) | |
| بي إس-تي إي | |
| 0-1: أحمد (2016) | |
| الامتزاز وعمليات الغشاء لمعالجة المياه | 0-2: تشانغ وآخرون (2016) |
| 0-3: سعدتي وآخرون (2017) | |
| نمذجة وتحليل انتقال الحرارة في أنظمة السوائل | 1-1: كولومبو وفيرويذر (2016) |
| 1-2: وو وآخرون (2017). | |
| 1-3: دابو وآخرون (2017) | |
| LSB-PS-TE | |
| تقييم مخاطر الفيضانات والمكانية | 0-1: دينغ وآخرون (2017) |
| النمذجة | 0-2: شيان وويلكنسون (2015) |
| 0-3: ريزئي وآخرون (2016) | |
| ربط جزيئات البروتين و | 1-1: شميم وآخرون (2015) |
| ديناميات التفاعل | 1-2: خان وآخرون (2017) |
| 1-3: بوبوفسكا وآخرون (2016) | |
| ملاحظة: تسبق المقالات التمثيلية رقم الموضوع ورقم الفهرس، على سبيل المثال، “O
|
|
| الفئات متعددة التخصصات | عنوان المجلة | عدد الوثائق | النسبة |
| LSB-TE | مجلة الإنتاج النظيف | 5589 | 21.4% |
| علوم البيئة والتكنولوجيا | 4524 | 17.3% | |
| مجلة المواد الخطرة | 2417 | 9.2% | |
| البحث الطبي الحيوي – الهند | 1933 | 7.4% | |
| الهندسة البيئية | 1615 | 6.2% | |
| إدارة النفايات | 1368 | 5.2% | |
| نمذجة البيئة والبرمجيات | 735 | 2.8% | |
| التقدم البيئي والطاقة المستدامة | 652 | 2.5% | |
| الحفاظ على الموارد وإعادة التدوير | 541 | 2.1% | |
| التقنيات النظيفة والسياسة البيئية | 512 | 2.0% | |
| LSB-PS | المجلة الدولية للجزيئات الحيوية الكبيرة | 3347 | 31.6% |
| علم الجغرافيا القديمة وعلم المناخ القديم وعلم البيئة القديمة | 1257 | 11.9% | |
| البيومكرومولكولات | 1250 | 11.8% | |
| مجلة ICES لعلوم البحار | 698 | 6.6% | |
| البحث في العصر الطباشيري | 586 | 5.5% | |
| البحث في البحار والمياه العذبة | 501 | 4.7% | |
| طرق إحصائية في البحث الطبي | 399 | 3.8% | |
| مجلة المياه والصحة | 282 | 2.7% | |
| علم المحيطات القديمة | 268 | 2.5% | |
| المناعة الغذائية والزراعية | 259 | 2.4% | |
| PS-TE | إزالة الملح ومعالجة المياه | 5622 | 11.5% |
| الهندسة الحرارية التطبيقية | 5040 | 10.3% | |
| المجلة الدولية لنقل الحرارة والكتلة | 3791 | 7.7% | |
| الكيمياء والهندسة المستدامة ACS | 2468 | 5.0% | |
| مجلة الهيدرولوجيا | 2206 | 4.5% | |
| التقدم في الهندسة الميكانيكية | 1931 | 3.9% | |
| هندسة المحيطات | 1639 | 3.3% | |
| مجلة IEEE للمواضيع المختارة في الملاحظات الأرضية التطبيقية والاستشعار عن بعد | 1447 | 3.0% | |
| الاحتراق واللهب | 1093 | 2.2% | |
| الموجات فوق الصوتية وكيمياء الصوت | 1089 | 2.2% | |
| LSB-PS-TE | مجلة الرسوم الجزيئية والنمذجة | 570 | 63.0% |
| جيوكارتو الدولية | 212 | 23.4% | |
| مراجعة المخاطر الطبيعية | 115 | 12.7% | |
| جيوكارتو الدولية | 8 | 0.9% |
توفر البيانات
تم النشر عبر الإنترنت: 10 مايو 2024
ملاحظة
3 https://www.sbert.net/docs/pretrained_models.html.
5 القائمة الكاملة لتصنيف العلوم: https://support.clarivate.com/ ScientificandAcademicResearch/s/article/Web-of-Science-List-of-Subject-Classifications-for-All-Databases?language=en_US.
6 تم استخدام الموجه التالي مع ChatGPT (GPT-4): لدي موضوع يحتوي على المنشورات العلمية المتعلقة بـ [“اسم العلوم متعددة التخصصات”]. يتم وصف الموضوع بالكلمات الرئيسية التالية: [“قائمة الكلمات الرئيسية”] بناءً على المعلومات أعلاه، هل يمكنك إعطاء تسمية قصيرة للموضوع؟
References
Archambault É, Campbell D, Gingras Y, Larivière V (2009) Comparing bibliometric statistics obtained from the Web of Science and Scopus. J Am Soc Inf Sci Technol. 60(7):1320-1326
Asyaky MS, Mandala R (2021) Improving the Performance of HDBSCAN on Short Text Clustering by Using Word Embedding and UMAP. Proc 2021 8th Int Conf Adv Inform Concepts Theory Appl 2021:1-6. https://doi.org/10.1109/ ICAICTA53211.2021.9640285
Balcı U, Sirivianos M, Blackburn J (2023) A data-driven understanding of left-wing extremists on social media. Preprint. arXiv preprint arXiv:2307.06981
Bataille CP, Watford D, Ruegg S, Lowe A, Bowen GJ (2016) Chemostratigraphic age model for the Tornillo Group: A possible link between fluvial stratigraphy and climate. Palaeogeogr Palaeoclimatol Palaeoecol 457:277-289
Berah R, Ghorbani M, Moghadamnia AA (2017) Synthesis of a smart pH responsive magnetic nanocomposite as high loading carrier of pharmaceutical agents. Int J Biol Macromol 99:731-738
Blei DM, Lafferty J (2007) A correlated topic model of science. Annals Appl Stat 1(1). https://doi.org/10.1214/07-aoas114
Bloom N, Jones CI, Van Reenen J, Webb M (2020) Are ideas getting harder to find? Am Econ Rev 110(4):1104-1144
Bobovská A, Tvaroška I, Kóňa J (2016) Using DFT methodology for more reliable predictive models: Design of inhibitors of Golgi
Bonacich P (2007) Some unique properties of eigenvector centrality. Soc Netw 29(4):555-564. https://doi.org/10.1016/j.socnet.2007.04.002
Börner K, Rouse WB, Trunfio P, Stanley HE (2018) Forecasting innovations in science, technology, and education. Proc Natl Acad Sci 115(50):12573-12581
Bornmann L (2013) What is societal impact of research and how can it be assessed? A literature survey. J Am Soc Inf Sci Technol 64(2):217-233
Bornmann L, Mutz R (2015) Growth rates of modern science: A bibliometric analysis based on the number of publications and cited references. J Assoc Inf Sci Technol 66(11):2215-2222
Boyack K, Glänzel W, Gläser J, Havemann F, Scharnhorst A, Thijs B, van Eck NJ, Velden T, Waltmann L (2017) Topic identification challenge. Scientometrics 111:1223-1224
Boyack KW (2017) Investigating the effect of global data on topic detection. Scientometrics 111(2):999-1015
Cagan R (2013) The San Francisco declaration on research assessment. Dis Models Mech 6(4):869-870
Candelier K, Hannouz S, Thévenon MF, Guibal D, Gérardin P, Pétrissans M, Collet R (2017) Resistance of thermally modified ash (Fraxinus excelsior L.) wood under steam pressure against rot fungi, soil-inhabiting micro-organisms and termites. Eur J Wood Wood Prod 75:249-262
Capra L (2024) A computational linguistic approach to study border theory at scale. ACM Trans Comput-Hum Interaction 37(4):1-23
Chakraborty T (2018) Role of interdisciplinarity in computer sciences: quantification, impact and life trajectory. Scientometrics 114:1011-1029
Chen C (2006) CiteSpace II: Detecting and visualizing emerging trends and transient patterns in scientific literature. J Am Soc Inf Sci Technol 57(3):359-377
Chen C (2017) Science mapping: a systematic review of the literature. J Data Inf Sci 2(2):1-40
Chen J, Shen SZ, Li XH, Xu YG, Joachimski MM, Bowring SA, Mu L (2016) Highresolution SIMS oxygen isotope analysis on conodont apatite from South China and implications for the end-Permian mass extinction. Palaeogeogr Palaeoclimatol Palaeoecol 448:26-38
Chian SC, Wilkinson SM (2015) Feasibility of remote sensing for multihazard analysis of landslides in Padang Pariaman during the 2009 Padang earthquake. Nat Hazards Rev 16(1):05014004
Chu JS, Evans JA (2021) Slowed canonical progress in large fields of science. Proc Natl Acad Sci 118(41):e2021636118
Colombo M, Fairweather M (2016) Accuracy of Eulerian-Eulerian, two-fluid CFD boiling models of subcooled boiling flows. Int J Heat Mass Transf 103:28-44
Curran CS, Leker J (2011) Patent indicators for monitoring convergence – examples from NFF and ICT. Technol Forecast Soc Change 78(2):256-273. https://doi. org/10.1016/j.techfore.2010.06.021
Daabo AM, Al Jubori A, Mahmoud S, Al-Dadah RK (2017) Development of threedimensional optimization of a small-scale radial turbine for solar powered Brayton cycle application. Appl Therm Eng 111:718-733
Day GS, Schoemaker PJ (2000) Avoiding the pitfalls of emerging technologies. Calif Manag Rev 42(2):8-33
de Lima BC, Baracho RMA, Mandl T, Porto PB (2023) Reactions to science communication: discovering social network topics using word embeddings and semantic knowledge. Soc Netw Anal Min 13(1):119
Devlin J, Chang MW, Lee K, Toutanova K (2019) BERT: Pre-training of deep bidirectional transformers for language understanding. NAACL HLT 2019 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies – Proceedings of the Conference, 1(Mlm), 4171-4186
Ding Q, Chen W, Hong H (2017) Application of frequency ratio, weights of evidence and evidential belief function models in landslide susceptibility mapping. Geocarto Int. 32(6):619-639
Egle L, Rechberger H, Zessner M (2015) Overview and description of technologies for recovering phosphorus from municipal wastewater. Resour Conserv Recycl 105:325-346
Eisenhardt KM, Martin JA (2000) Dynamic capabilities: What are they? Strategic Manag J 21(10):1105-1121
Eum W, Maliphol S (2023) Southeast Asian catch-up through the convergence of trade structures. Asian J Technol Innov 31(2):422-446
Fagerberg J, Landström H, Martin BR (2012) Exploring the emerging knowledge base of “the knowledge society. Res Policy 41(7):1121-1131. https://doi.org/ 10.1016/j.respol.2012.03.007
Fortunato S, Bergstrom CT, Börner K, Evans JA, Helbing D, Milojević S, Petersen AM, Radicchi F, Sinatra R, Uzzi B, Vespignani A, Waltman L, Wang D, Barabási AL (2018) Science of science. Science 359(6379). https://doi.org/10. 1126/science.aao0185
French B, Saha-Chaudhuri P, Ky B, Cappola TP, Heagerty PJ (2016) Development and evaluation of multi-marker risk scores for clinical prognosis. Stat Methods Med Res 25(1):255-271
Glänzel W, Thijs B (2012) Using “core documents” for detecting and labelling new emerging topics. Scientometrics 91(2):399-416. https://doi.org/10.1007/ s11192-011-0591-7
Glenisson P, Glänzel W, Janssens F, De Moor B (2005) Combining full text and bibliometric information in mapping scientific disciplines. Inf Process Manag 41(6):1548-1572. https://doi.org/10.1016/j.ipm.2005.03.021
Gomes S, Rodrigues G, Martins G, Henriques C, Silva JC (2017) Evaluation of nanofibrous scaffolds obtained from blends of chitosan, gelatin and polycaprolactone for skin tissue engineering. Int J Biol Macromol 102:1174-1185
Griffith R, Redding S, Van Reenen J (2004) Mapping the two faces of R&D: Productivity growth in a panel of OECD industries. Rev Econ Stat 86(4):883-895
Grootendorst M (2022) BERTopic: Neural topic modeling with a class-based TFIDF procedure. http://arxiv.org/abs/2203.05794
Harijani AM, Mansour S, Karimi B, Lee CG (2017) Multi-period sustainable and integrated recycling network for municipal solid waste-A case study in Tehran. J. Clean. Prod. 151:96-108
Heo PS, Lee DH (2019) Evolution patterns and network structural characteristics of industry convergence. Struct Change Econ Dyn 51:405-426. https://doi. org/10.1016/j.strueco.2019.02.004
Jones BF (2009) The burden of knowledge and the “death of the renaissance man”: Is innovation getting harder? Rev. Econ Stud. 76(1):283-317
Jung S, Segev A (2022a) Analyzing the generalizability of the network-based topic emergence identification method. Semantic Web 13(3):423-439
Jung S, Segev A (2022b) Identifying a common pattern within ancestors of emerging topics for pan-domain topic emergence prediction. Knowl Based Syst 258:110020
Kain G, Barbu MC, Richter K, Plank B, Tondi G, Petutschnigg A (2015) Use of tree bark as insulation material. For Products J 65(3-4):S16-S16
Kasperiuniene J, Briediene M, Zydziunaite V (2020) Automatic content analysis of social media short texts: scoping reviewof methods and tools. In Costa. A.P., Reis, L.P., & Moreira, A. (eds.) Computer Supported Qualitative Research: New Trends on Qualitative Research(WCQR2019) 4, 89-101
Khan AM, Shawon J, Halim MA (2017) Multiple receptor conformers based molecular docking study of fluorine enhanced ethionamide with mycobacterium enoyl ACP reductase (InhA). J Mol Graph Model 77:386-398
Khan GF, Wood J (2015) Information technology management domain: emerging themes and keyword analysis. Scientometrics 105(2):959-972. https://doi.org/ 10.1007/s11192-015-1712-5
Kim K, Jung S, Hwang J, Hong A (2018) A dynamic framework for analyzing technology standardisation using network analysis and game theory. Technol Anal Strat Manag 30(5):540-555. https://doi.org/10.1080/09537325.2017. 1340639
Kim MC, Chen C (2015) A scientometric review of emerging trends and new developments in recommendation systems. Scientometrics 104:239-263
Klavans R, Boyack KW (2011) Using global mapping to create more accurate document-level maps of research fields. J Am Soc Inf Sci Technol 62(1):1-18
Kogler DF, Essletzbichler J, Rigby DL (2017) The evolution of specialization in the EU15 knowledge space. J. Econ Geogr 17(2):345-373. https://doi.org/10. 1093/jeg/lbw024
Kogler DF, Whittle A, Buarque B (2022) The Science Space of Artificial Intelligence Knowledge Production. In: Kurz HD, Schütz M, Strohmaier R, Zilian SS (eds) The Routledge Handbook of Smart Technologies: An Economic and Social Perspective. Routledge, London, pp 241-268 https://doi.org/10.4324/ 9780429351921
Kozlow M (2023) “Disruptive” science has declined-even as papers proliferate. Springe Nat 613:225
Kwon S, Liu X, Porter AL, Youtie J (2019) Research addressing emerging technological ideas has greater scientific impact. Res Policy 48(9):103834. https:// doi.org/10.1016/j.respol.2019.103834
Larivière V, Haustein S, Börner K (2015) Long-distance interdisciplinarity leads to higher scientific impact. Plos One 10(3):e0122565
Lavalette A, Cointe A, Pommier R, Danis M, Delisée C, Legrand G (2016) Experimental design to determine the manufacturing parameters of a greenglued plywood panel. Eur J Wood Prod 74:543-551
Lee C, Kogler DF, Lee D (2019) Capturing information on technology convergence, international collaboration, and knowledge flow from patent documents: A case of information and communication technology. Inf Process Manag 56:1576-1591
Lee C, Hong S, Kim J (2021) Anticipating multi-technology convergence: a machine learning approach using patent information. Scientometrics 126(3):1867-1896. https://doi.org/10.1007/s11192-020-03842-6
Lee WS, Han EJ, Sohn SY (2015) Predicting the pattern of technology convergence using big-data technology on large-scale triadic patents. Technol Forecast Soc Change 100:317-329. https://doi.org/10.1016/j.techfore.2015.07.022
Leydesdorff L, Rafols I (2011) Indicators of the interdisciplinarity of journals: Diversity, centrality, and citations. J Informetr 5(1):87-100. https://doi.org/ 10.1016/j.joi.2010.09.002
Leydesdorff L, Wagner CS, Bornmann L (2018) Betweenness and diversity in journal citation networks as measures of interdisciplinarity-A tribute to Eugene Garfield. Scientometrics 114:567-592
Leydesdorff L, Wagner CS, Bornmann L (2019) Interdisciplinarity as diversity in citation patterns among journals: Rao-Stirling diversity, relative variety, and the Gini coefficient. J Informetr 13(1):255-269
Liu HQ, Li XL (2017) Effect of nursing intervention on liver cancer patients undergoing interventional therapy. Biomed Res 28(12):5285-5288
Liu D, Zhao H, Liu B, Zhang X, Ma Q (2017) Analysis on the expression level of serum MMP-7 in patients with abdominal aortic aneurysm accompanied by hypertension and clinical efficacy of endovascular graft exclusion. Biomed Res (0970-938X), 28(3)
Lowery CM, Cunningham R, Barrie CD, Bralower T, Snedden JW (2017) The northern Gulf of Mexico during OAE2 and the relationship between water depth and black shale development. Paleoceanography 32(12):1316-1335
Lu T (2017) Bayesian nonparametric mixed-effects joint model for longitudinalcompeting risks data analysis in presence of multiple data features. Stat Methods Med Res 26(5):2407-2423
Luo S, Lawson AB, He B, Elm JJ, Tilley BC (2016) Bayesian multiple imputation for missing multivariate longitudinal data from a Parkinson’s disease clinical trial. Stat Methods Med Res 25(2):821-837
Lyutov A, Uygun Y, Hütt MT (2021) Machine learning misclassification of academic publications reveals non-trivial interdependencies of scientific disciplines. Scientometrics 126(2):1173-1186. https://doi.org/10.1007/s11192-020-03789-8
MacKay DJ (2003) Information theory, inference and learning algorithms. Cambridge University Press
Mane KK, Börner K (2004) Mapping topics and topic bursts in PNAS. Proc Natl Acad Sci USA 101(SUPPL. 1):5287-5290. https://doi.org/10.1073/pnas. 0307626100
McInnes L, Healy J, Melville J (2016) UMAP: Uniform manifold approximation and projection for dimension reduction. http://arxiv.org/abs/1802.03426
Mejia C, Kajikawa Y (2020) Emerging topics in energy storage based on a largescale analysis of academic articles and patents. Appl Energy 263:114625. https://doi.org/10.1016/j.apenergy.2020.114625
Newman D, Bonilla EV, Buntine W(2011) Improving topic coherence with regularized topic models. Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011:1-9
Palma-Rojas S, Caldeira-Pires A, Nogueira JM (2017) Environmental and economic hybrid life cycle assessment of bagasse-derived ethanol produced in Brazil. Int J Life Cycle Assess 22:317-327
Petersen AM, Ahmed ME, Pavlidis I (2021) Grand challenges and emergent modes of convergence science. Human Soc Sci Commun 8(1):1-15
Qian Y, Härdle WK, Chen C (2017) Industry Interdependency Dynamics in a Network Context. SFB 649 Discussion Paper 2017-012, Humboldt University of Berlin. https://doi.org/10.2139/ssrn. 2961703
Qi Y, Hao S, Zhang J, Zhao C, Lian Y (2017) Effects of comprehensive nursing on the pain and joint functional recovery of patients with hip replacements. Biomed Res India 28:12
Rafols I, Meyer M (2010) Diversity and network coherence as indicators of interdisciplinarity: case studies in bionanoscience. Scientometrics 82(2):263-287. https://doi.org/10.1007/s11192-009-0041-y
Rafols I, Porter AL, Leydesdorff L (2010) Science overlay maps: A new tool for research policy and library management. J Am Soc Inf Sci Technol 61(9):1871-1887
Rapach DE, Strauss JK, Tu J, Zhou G (2015) Industry interdependencies and crossindustry return predictability. Working paper 12-2015. Singapore Management University, Lee Kong Chian School of Business
Rey-Martí A, Ribeiro-Soriano D, Palacios-Marqués D (2016) A bibliometric analysis of social entrepreneurship. J Bus Res 69(5):1651-1655. https://doi.org/ 10.1016/j.jbusres.2015.10.033
Rotolo D, Hicks D, Martin BR (2015) What is an emerging technology? Res Policy 44(10):1827-1843
Saadati F, Rahmani M, Ghahramani F, Piri F, Shayani-Jam H, Yaftian MR (2017) Synthesis of a novel ion-imprinted polyaniline/hyper-cross-linked polystyrene nanocomposite for selective removal of lead (II) ions from aqueous solutions. Desalination Water Treat 82:210-218
Samsir S, Saragih RS, Subagio S, Aditiya R, Watrianthos R (2023) BERTopic modeling of natural language processing abstracts: Thematic structure and trajectory. J Media Inform Budidarma 7(3):1514-1520
Schumpeter JA (1942) Capitalism, socialism and democracy. Harper and Row, New York
Schumpeter JA (1934) The Theory of Economic Development. Harvard Univeristy Press
Shamim A, Abbasi SW, Azam SS (2015) Structural and dynamical aspects of Streptococcus gordonii FabH through molecular docking and MD simulations. J Mol Graph Model 60:180-196
Shin H, Kim K, Kogler DF (2022) Scientific collaboration, research funding, and novelty in scientific knowledge. PLoS ONE 17(7):e0271678. https://doi.org/ 10.1371/journal.pone. 0271678
Small H, Boyack KW, Klavans R (2014) Identifying emerging topics in science and technology. Res Policy 43(8):1450-1467
Song CH, Han JW, Jeong B, Yoon J (2017) Mapping the patent landscape in the field of personalized medicine. J Pharm Innov 12(3):238-248. https://doi.org/ 10.1007/s12247-017-9283-z
Suominen A, Toivanen H (2016) Map of science with topic modeling: Comparison of unsupervised learning and human-assigned subject classification. J Assoc Inf Sci Technol 67(10):2464-2476. https://doi.org/10.1002/asi
Velden T, Boyack KW, Gläser J, Koopman R, Scharnhorst A, Wang S (2017) Comparison of topic extraction approaches and their results. Scientometrics 111(2):1169-1221. https://doi.org/10.1007/s11192-017-2306-1
Wang Y, Bashar MA, Chandramohan M, Nayak R (2023) Exploring topic models to discern cyber threats on Twitter: A case study on Log4Shell. Intell Syst Appl 20:200280
Wang Z, Chen J, Chen J, Chen H (2023) Identifying interdisciplinary topics and their evolution based on BERTopic. Scientometrics, 0123456789. https://doi. org/10.1007/s11192-023-04776-5
West JD, Jensen MC, Dandrea RJ, Gordon GJ, Bergstrom CT (2013) Author-level Eigenfactor metrics: Evaluating the influence of authors, institutions, and countries within the social science research network community. J Am Soc Inf Sci Technol 64(4):787-801
White K (2019) Publications Output: U.S. Trends and International Comparisons. In Nsb-2020-6. https://ncses.nsf.gov/pubs/nsb20206/
Winnink JJ, Tijssen RJW, van Raan AFJ (2019) Searching for new breakthroughs in science: How effective are computerised detection algorithms? Technol Forecast Soc Change 146:673-686. https://doi.org/10.1016/j.techfore.2018.05. 018
Wu W, Zhang S, Wang S (2017) A novel lattice Boltzmann model for the solid-liquid phase change with the convection heat transfer in the porous media. Int J Heat Mass Transf 104:675-687
Xu J, Bu Y, Ding Y, Yang S, Zhang H, Yu C, Sun L (2018) Understanding the formation of interdisciplinary research from the perspective of keyword evolution: A case study on joint attention. Scientometrics 117:973-995
Xu J, Ding Y, Bu Y, Deng S, Yu C, Zou Y, Madden A (2019) Interdisciplinary scholarly communication: an exploratory study for the field of joint attention. Scientometrics 119:1597-1619
Yau CK, Porter A, Newman N, Suominen A (2014) Clustering scientific documents with topic modeling. Scientometrics 100(3):767-786. https://doi.org/10.1007/ s11192-014-1321-8
Zahedi Z, van Eck NJ (2018) Exploring topics of interest of Mendeley users. J Altmetrics 1(1):1-12. https://doi.org/10.29024/joa. 7
Zhang J, Zhang G, Zhou Q, Ou L (2016) Thermodynamics, kinetics and isotherm studies on the removal of methylene blue from aqueous solution by calcium alginate. J Water Reuse Desalination 6(2):301-309
Zhao YM, Wang J, Wu ZG, Yang JM, Li W, Shen LX (2016) Extraction, purification and anti-proliferative activities of polysaccharides from Lentinus edodes. Int J Biol Macromol 93:136-144
الشكر والتقدير
رقم 17/SPR/5324، SciTechSpace). لم يكن للجهات الممولة أي دور في تصميم الدراسة، جمع البيانات وتحليلها، اتخاذ قرار النشر، أو إعداد المخطوطة.
مساهمات المؤلفين
المصالح المتنافسة
الموافقة الأخلاقية
الموافقة المستنيرة
معلومات إضافية
معلومات إعادة الطبع والإذن متاحة على http://www.nature.com/reprints
ملاحظة الناشر تظل Springer Nature محايدة فيما يتعلق بالمطالبات القضائية في الخرائط المنشورة والانتماءات المؤسسية.

© المؤلفون 2024
مدرسة الذكاء الاصطناعي التطبيقي، جامعة هاندونغ العالمية، بوهانغ، كوريا الجنوبية. مختبر الديناميات المكانية، كلية العمارة، التخطيط والسياسة البيئية ومركز التحليل البياني، كلية دبلن الجامعية، دبلن، أيرلندا. قسم التكنولوجيا والمجتمع، جامعة ولاية نيويورك، سونغدو، كوريا الجنوبية. البريد الإلكتروني: sira.maliphol@sunykorea.ac.kr
DOI: https://doi.org/10.1057/s41599-024-03044-y
Publication Date: 2024-05-10
Identifying interdisciplinary emergence in the science of science: combination of network analysis and BERTopic
Abstract
Global scientific output is expanding exponentially, which in turn calls for a better understanding of the science of science and especially how the boundaries of scientific fields expand through processes of emergence. The present study proposes the application of embedded topic modeling techniques to identify new emerging science via knowledge recombination activities as evidenced through the analysis of research publication metadata. First, a dataset is constructed from metadata derived from the Web of Science Core Collection database. The dataset is then used to generate a global map representing a categorical scientific co-occurrence network. A research field is defined as interdisciplinary when multiple science categories are listed in its description. Second, the co-occurrence networks are subsequently compared between periods to determine changing patterns of influence in light of interdisciplinarity. Third, embedded topic modeling enables unsupervised association of interdisciplinary classification. We present the results of the analysis to demonstrate the emergence of global interdisciplinary sciences and further we perform qualitative validation on the results to identify what the sources of the emergent areas are. Based on these results, we discuss potential applications for identifying emergence through the merging of global interdisciplinary domains.
Introduction
While research output has risen, scientific productivity-or the value derived from that output-has fallen across fields (Bloom et al. 2020). The rate of innovation has slowed because the level of specialization (Jones 2009) and the size of teams (Kozlow 2023) needed to conduct science has increased. Intertwined with specialization and team size, the costs of research and development have sharply risen, reducing the rate of science productivity (Bloom et al. 2020). Another reason is how emergence has been measured. For instance, as the volume of scientific output increases, the ability to evaluate emerging research topics decreases because canonical literature is more likely to be cited (Chu and Evans 2021). “Could we be missing fertile new paradigms because we are locked into overworked areas of study?” (Chu and Evans 2021, p.5). Moreover, could we be misidentifying where emerging value is derived from science?
Literature review
scientific literature (Kim and Chen 2015), especially through new combinations of interdisciplinary fields of science and technologies (Blei and Lafferty 2007; Eum and Maliphol 2023; Khan and Wood 2015; Lee et al. 2015). Science maps are network representations of the scientific literature that have evolved in research approaches (Chen 2006). Underlying these past approaches is an emphasis on finding radically new innovations within a specialized domain of science.
another measure of emergent organization is fast-growing multiple field or technology interdisciplinarity (Bornmann 2013; Bornmann and Marx 2014; Lee et al. 2021; Leydesdorff et al. 2013). Over time, research has become increasingly interdisciplinary (Chakraborty 2018). Research fields go through three stages: growth, maturity, and interdisciplinarity (Chakraborty 2018).
Methodology

biographical item, letter, bibliography, correction, book review, meeting abstract, or proceedings paper) and publication type (journal, book in series, or book). These criteria allow us to restrict our sample to publications that are written for the same purpose, to maintain the quality of articles, and to avoid duplication. The dataset employed here is limited to journal articles by filtering its document and publication types.
Science category-subject co-occurrence network analysis
| 2012-2014 | 2015-2017 | |||||
| Publication | Subject | Journal | Publication | Subject | Journal | |
| LSB-TE | 68,768 | 80 | 162 | 79,112 | 81 | 175 |
| LSB-PS | 115,499 | 67 | 228 | 120,161 | 67 | 248 |
| PS-TE | 345,520 | 85 | 584 | 414,010 | 86 | 637 |
| LSB-PS-TE | 25,447 | 43 | 40 | 25,805 | 43 | 43 |



Embedded topic modeling
operates five times faster without compromising on quality

Case Study on Interdisciplinary Science in the Web of Science
that is expected to be more influential, rather than those that are already well-known. The gap between two types of science category-subjects justifies our approach to distinguishing promising science category-subjects in the future from those that already prevail, and more importantly, indicates that focusing on the emerging topics fits more into the purpose of this research.
Unsupervised classification of the emergent interdisciplinary science topics
| Science | Dominant science category-subjects | Growing science category-subjects |
| LSB-TE | Environmental Sciences | Forestry |
| Engineering, Environmental | Materials Science, Textiles | |
| Green & Sustainable Science & Technology | Instruments & Instrumentation | |
| Energy & Fuels | Pharmacology & Pharmacy | |
| Engineering, Chemical | Green & Sustainable Science & Technology | |
| Ecology | Medicine, Research & Experimental | |
| Public, Environmental & Occupational Health | Engineering, Environmental | |
| Radiology, Nuclear Medicine & Medical Imaging | Ecology | |
| LSB-PS | Chemistry, Applied | Neurosciences |
| Biochemistry & Molecular Biology | Health Care Sciences & Services | |
| Food Science & Technology | Immunology | |
| Chemistry, Analytical | Polymer Science | |
| Biochemical Research Methods | Paleontology | |
| Chemistry, Multidisciplinary | Microbiology | |
| Chemistry, Medicinal | Fisheries | |
| PS-TE | Materials Science, Multidisciplinary | Engineering, Aerospace |
| Physics, Applied | Green & Sustainable Science & Technology | |
| Nanoscience & Nanotechnology | Engineering, Marine | |
| Chemistry, Physical | Geography, Physical | |
| Physics, Condensed Matter | Water Resources | |
| Chemistry, Multidisciplinary | Engineering, Mechanical | |
| Engineering, Electrical & Electronic | Acoustics | |
| Energy & Fuels | Engineering, Ocean | |
| Materials Science, Coatings & Films | Automation & Control Systems | |
| LSB-PS-TE | Environmental Sciences | Remote Sensing |
| Water Resources | Imaging Science & Photographic Technology | |
| Engineering, Environmental | Geosciences, Multidisciplinary | |
| Computer Science, Interdisciplinary Applications | Crystallography | |
| Statistics & Probability |

| Publication | n-gram range | Number of topics | Minimum topic size | |
| Growing-science of LSB-TE | 26,164 |
|
50 ~ 1000 | 130-780 |
| Growing-science of LSB-PS | 10,577 | 50-300 | ||
| Growing-science of PS-TE | 49,042 | 240-1440 | ||
| Growing-science of LSB-PS-TE | 904 | 5-50 |
Discussion and conclusion
| Science Categories | Topic | Keywords | Generated Labels | Number of Documents |
| LSB-TE (26,164 publications) | Outlier | cutting, machining, leather, grinding, radon, asbestos, lubrication, tanning, lead, mql | – | 272 |
| 0 | wood, lignin, properties, strength, mechanical, bamboo, moisture, cellulose, modulus, specimens | Mechanical Properties and Composition of Natural Fibrous Materials | 22314 | |
| 1 | water, study, energy, results, environmental, waste, model, using, production, based | Sustainable Environmental Technologies and Resource Management | 1339 | |
| 2 | patients, group, It, groups, clinical, control, cancer, expression, cells | Cancer Biomarker Expression in Clinical Patient Groups | 2239 | |
| LSB-PS (10,577 publications) | Outlier | brain, fnirs, imaging, optical, neurons, cortex, cortical, neural, attribution, stimulation | – | 176 |
| 0 | species, water, sea, early, data, late, marine, formation, climate, new | Marine Biodiversity and Climate Impact Studies | 5129 | |
| 1 | model, data, models, proposed, methods, trial, regression, method, simulation, clinical | Clinical Trial Modeling and Simulation Techniques | 397 | |
| 2 | showed, activity, properties, protein, cells, chitosan, cell, ph, acid, drug | Chitosan Bioactivity and Drug Delivery Applications | 4785 | |
| PS-TE (49,042 publications) | Outlier | forum, journal, views, readership, essays, speculation, editorial, provoking, asce, founded | – | 11 |
| 0 | adsorption, removal, membrane, ph, process, concentration, water, treatment, mg, acid | Adsorption and Membrane Processes for Water Treatment | 10,873 | |
| 1 | heat, model, flow, results, data, based, water, method, temperature, transfer | Heat Transfer Modeling and Analysis in Fluid Systems | 38,158 | |
| LSB-PS-TE (905 publications) | 0 | data, study, land, area, using, flood, spatial, based, model, used | Flood Risk Assessment and Spatial Modeling | 334 |
| 1 | binding, molecular, protein, energy, interactions, docking, structure, dynamics, molecules, results | Protein-Molecule Docking and Interaction Dynamics | 570 |
| Table 5 Representative articles for each interdisciplinary emergent topic. | |
| Science Categories-Emergent Topic | Representative Article |
| LSB-TE | |
| Mechanical Properties and | 0-1: Kain et al. (2015) |
| Composition of Natural Fibrous | 0-2: Lavalette et al. (2016) |
| Materials | 0-3: Candelier et al. (2017) |
| Sustainable Environmental | 1-1: Egle et al. (2015) |
| Technologies and Resource | 1-2: Palma-Rojas et al. (2017) |
| Management | 1-3: Harijani et al. (2017) |
| Cancer Biomarker Expression in Clinical | 2-1: Liu et al. (2017) |
| Patient Groups | 2-2: Liu and Li (2017) |
| 2-3: Qi et al. (2017) | |
| LSB-PS | |
| Marine Biodiversity and Climate Impact | 0-1: Chen et al. (2016) |
| Studies | 0-2: Bataille et al. (2016) |
| 0-3: Lowery et al. (2017) | |
| Clinical Trial Modeling and Simulation | 1-1: French et al. (2016) |
| Techniques | 1-2: Luo et al. (2016) |
| 1-3: Lu (2017) | |
| Chitosan Bioactivity and Drug Delivery | 2-1: Zhao et al. (2016) |
| Applications | 2-2: Berah et al. (2017) |
| 2-3: Gomes et al. (2017) | |
| PS-TE | |
| 0-1: Ahmed (2016) | |
| Adsorption and Membrane Processes for Water Treatment | 0-2: Zhang et al. (2016) |
| 0-3: Saadati et al. (2017) | |
| Heat Transfer Modeling and Analysis in Fluid Systems | 1-1: Colombo and Fairweather (2016) |
| 1-2: Wu et al. (2017). | |
| 1-3: Daabo et al. (2017) | |
| LSB-PS-TE | |
| Flood Risk Assessment and Spatial | 0-1: Ding et al. (2017) |
| Modeling | 0-2: Chian and Wilkinson (2015) |
| 0-3: Rizeei et al. (2016) | |
| Protein-Molecule Docking and | 1-1: Shamim et al. (2015) |
| Interaction Dynamics | 1-2: Khan et al. (2017) |
| 1-3: Bobovská et al. (2016) | |
| Note: The representative articles are preceded by the topic number and a number index, e.g., “O
|
|
| Interdisciplinary categories | Journal Title | Number of documents | Share |
| LSB-TE | Journal of Cleaner Production | 5589 | 21.4% |
| Environmental Science & Technology | 4524 | 17.3% | |
| Journal of Hazardous Materials | 2417 | 9.2% | |
| Biomedical Research-India | 1933 | 7.4% | |
| Ecological Engineering | 1615 | 6.2% | |
| Waste Management | 1368 | 5.2% | |
| Environmental Modeling & Software | 735 | 2.8% | |
| Environmental Progress & Sustainable Energy | 652 | 2.5% | |
| Resources Conservation and Recycling | 541 | 2.1% | |
| Clean Technologies and Environmental Policy | 512 | 2.0% | |
| LSB-PS | International Journal of Biological Macromolecules | 3347 | 31.6% |
| Paleogeography Paleoclimatology Paleoecology | 1257 | 11.9% | |
| Biomacromolecules | 1250 | 11.8% | |
| ICES Journal of Marine Science | 698 | 6.6% | |
| Cretaceous Research | 586 | 5.5% | |
| Marine and Freshwater Research | 501 | 4.7% | |
| Statistical Methods in Medical Research | 399 | 3.8% | |
| Journal of Water and Health | 282 | 2.7% | |
| Paleoceanography | 268 | 2.5% | |
| Food and Agricultural Immunology | 259 | 2.4% | |
| PS-TE | Desalination and Water Treatment | 5622 | 11.5% |
| Applied Thermal Engineering | 5040 | 10.3% | |
| International Journal of Heat and Mass Transfer | 3791 | 7.7% | |
| ACS Sustainable Chemistry & Engineering | 2468 | 5.0% | |
| Journal of Hydrology | 2206 | 4.5% | |
| Advances in Mechanical Engineering | 1931 | 3.9% | |
| Ocean Engineering | 1639 | 3.3% | |
| IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | 1447 | 3.0% | |
| Combustion and Flame | 1093 | 2.2% | |
| Ultrasonics Sonochemistry | 1089 | 2.2% | |
| LSB-PS-TE | Journal of Molecular Graphics & Modeling | 570 | 63.0% |
| Geocarto International | 212 | 23.4% | |
| Natural Hazards Review | 115 | 12.7% | |
| Geocarto International | 8 | 0.9% |
Data availability
Published online: 10 May 2024
Note
3 https://www.sbert.net/docs/pretrained_models.html.
5 Full list of science classification: https://support.clarivate.com/ ScientificandAcademicResearch/s/article/Web-of-Science-List-of-Subject-Classifications-for-All-Databases?language=en_US.
6 Following prompt has been used with ChatGPT (GPT-4): I have topic that contains the scientific publications related to [“Name of Interdisciplinary Science”]. The topic is described by the following keywords: [“List of keywords”] Based on the above information, can you give a short label of the topic?
References
Archambault É, Campbell D, Gingras Y, Larivière V (2009) Comparing bibliometric statistics obtained from the Web of Science and Scopus. J Am Soc Inf Sci Technol. 60(7):1320-1326
Asyaky MS, Mandala R (2021) Improving the Performance of HDBSCAN on Short Text Clustering by Using Word Embedding and UMAP. Proc 2021 8th Int Conf Adv Inform Concepts Theory Appl 2021:1-6. https://doi.org/10.1109/ ICAICTA53211.2021.9640285
Balcı U, Sirivianos M, Blackburn J (2023) A data-driven understanding of left-wing extremists on social media. Preprint. arXiv preprint arXiv:2307.06981
Bataille CP, Watford D, Ruegg S, Lowe A, Bowen GJ (2016) Chemostratigraphic age model for the Tornillo Group: A possible link between fluvial stratigraphy and climate. Palaeogeogr Palaeoclimatol Palaeoecol 457:277-289
Berah R, Ghorbani M, Moghadamnia AA (2017) Synthesis of a smart pH responsive magnetic nanocomposite as high loading carrier of pharmaceutical agents. Int J Biol Macromol 99:731-738
Blei DM, Lafferty J (2007) A correlated topic model of science. Annals Appl Stat 1(1). https://doi.org/10.1214/07-aoas114
Bloom N, Jones CI, Van Reenen J, Webb M (2020) Are ideas getting harder to find? Am Econ Rev 110(4):1104-1144
Bobovská A, Tvaroška I, Kóňa J (2016) Using DFT methodology for more reliable predictive models: Design of inhibitors of Golgi
Bonacich P (2007) Some unique properties of eigenvector centrality. Soc Netw 29(4):555-564. https://doi.org/10.1016/j.socnet.2007.04.002
Börner K, Rouse WB, Trunfio P, Stanley HE (2018) Forecasting innovations in science, technology, and education. Proc Natl Acad Sci 115(50):12573-12581
Bornmann L (2013) What is societal impact of research and how can it be assessed? A literature survey. J Am Soc Inf Sci Technol 64(2):217-233
Bornmann L, Mutz R (2015) Growth rates of modern science: A bibliometric analysis based on the number of publications and cited references. J Assoc Inf Sci Technol 66(11):2215-2222
Boyack K, Glänzel W, Gläser J, Havemann F, Scharnhorst A, Thijs B, van Eck NJ, Velden T, Waltmann L (2017) Topic identification challenge. Scientometrics 111:1223-1224
Boyack KW (2017) Investigating the effect of global data on topic detection. Scientometrics 111(2):999-1015
Cagan R (2013) The San Francisco declaration on research assessment. Dis Models Mech 6(4):869-870
Candelier K, Hannouz S, Thévenon MF, Guibal D, Gérardin P, Pétrissans M, Collet R (2017) Resistance of thermally modified ash (Fraxinus excelsior L.) wood under steam pressure against rot fungi, soil-inhabiting micro-organisms and termites. Eur J Wood Wood Prod 75:249-262
Capra L (2024) A computational linguistic approach to study border theory at scale. ACM Trans Comput-Hum Interaction 37(4):1-23
Chakraborty T (2018) Role of interdisciplinarity in computer sciences: quantification, impact and life trajectory. Scientometrics 114:1011-1029
Chen C (2006) CiteSpace II: Detecting and visualizing emerging trends and transient patterns in scientific literature. J Am Soc Inf Sci Technol 57(3):359-377
Chen C (2017) Science mapping: a systematic review of the literature. J Data Inf Sci 2(2):1-40
Chen J, Shen SZ, Li XH, Xu YG, Joachimski MM, Bowring SA, Mu L (2016) Highresolution SIMS oxygen isotope analysis on conodont apatite from South China and implications for the end-Permian mass extinction. Palaeogeogr Palaeoclimatol Palaeoecol 448:26-38
Chian SC, Wilkinson SM (2015) Feasibility of remote sensing for multihazard analysis of landslides in Padang Pariaman during the 2009 Padang earthquake. Nat Hazards Rev 16(1):05014004
Chu JS, Evans JA (2021) Slowed canonical progress in large fields of science. Proc Natl Acad Sci 118(41):e2021636118
Colombo M, Fairweather M (2016) Accuracy of Eulerian-Eulerian, two-fluid CFD boiling models of subcooled boiling flows. Int J Heat Mass Transf 103:28-44
Curran CS, Leker J (2011) Patent indicators for monitoring convergence – examples from NFF and ICT. Technol Forecast Soc Change 78(2):256-273. https://doi. org/10.1016/j.techfore.2010.06.021
Daabo AM, Al Jubori A, Mahmoud S, Al-Dadah RK (2017) Development of threedimensional optimization of a small-scale radial turbine for solar powered Brayton cycle application. Appl Therm Eng 111:718-733
Day GS, Schoemaker PJ (2000) Avoiding the pitfalls of emerging technologies. Calif Manag Rev 42(2):8-33
de Lima BC, Baracho RMA, Mandl T, Porto PB (2023) Reactions to science communication: discovering social network topics using word embeddings and semantic knowledge. Soc Netw Anal Min 13(1):119
Devlin J, Chang MW, Lee K, Toutanova K (2019) BERT: Pre-training of deep bidirectional transformers for language understanding. NAACL HLT 2019 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies – Proceedings of the Conference, 1(Mlm), 4171-4186
Ding Q, Chen W, Hong H (2017) Application of frequency ratio, weights of evidence and evidential belief function models in landslide susceptibility mapping. Geocarto Int. 32(6):619-639
Egle L, Rechberger H, Zessner M (2015) Overview and description of technologies for recovering phosphorus from municipal wastewater. Resour Conserv Recycl 105:325-346
Eisenhardt KM, Martin JA (2000) Dynamic capabilities: What are they? Strategic Manag J 21(10):1105-1121
Eum W, Maliphol S (2023) Southeast Asian catch-up through the convergence of trade structures. Asian J Technol Innov 31(2):422-446
Fagerberg J, Landström H, Martin BR (2012) Exploring the emerging knowledge base of “the knowledge society. Res Policy 41(7):1121-1131. https://doi.org/ 10.1016/j.respol.2012.03.007
Fortunato S, Bergstrom CT, Börner K, Evans JA, Helbing D, Milojević S, Petersen AM, Radicchi F, Sinatra R, Uzzi B, Vespignani A, Waltman L, Wang D, Barabási AL (2018) Science of science. Science 359(6379). https://doi.org/10. 1126/science.aao0185
French B, Saha-Chaudhuri P, Ky B, Cappola TP, Heagerty PJ (2016) Development and evaluation of multi-marker risk scores for clinical prognosis. Stat Methods Med Res 25(1):255-271
Glänzel W, Thijs B (2012) Using “core documents” for detecting and labelling new emerging topics. Scientometrics 91(2):399-416. https://doi.org/10.1007/ s11192-011-0591-7
Glenisson P, Glänzel W, Janssens F, De Moor B (2005) Combining full text and bibliometric information in mapping scientific disciplines. Inf Process Manag 41(6):1548-1572. https://doi.org/10.1016/j.ipm.2005.03.021
Gomes S, Rodrigues G, Martins G, Henriques C, Silva JC (2017) Evaluation of nanofibrous scaffolds obtained from blends of chitosan, gelatin and polycaprolactone for skin tissue engineering. Int J Biol Macromol 102:1174-1185
Griffith R, Redding S, Van Reenen J (2004) Mapping the two faces of R&D: Productivity growth in a panel of OECD industries. Rev Econ Stat 86(4):883-895
Grootendorst M (2022) BERTopic: Neural topic modeling with a class-based TFIDF procedure. http://arxiv.org/abs/2203.05794
Harijani AM, Mansour S, Karimi B, Lee CG (2017) Multi-period sustainable and integrated recycling network for municipal solid waste-A case study in Tehran. J. Clean. Prod. 151:96-108
Heo PS, Lee DH (2019) Evolution patterns and network structural characteristics of industry convergence. Struct Change Econ Dyn 51:405-426. https://doi. org/10.1016/j.strueco.2019.02.004
Jones BF (2009) The burden of knowledge and the “death of the renaissance man”: Is innovation getting harder? Rev. Econ Stud. 76(1):283-317
Jung S, Segev A (2022a) Analyzing the generalizability of the network-based topic emergence identification method. Semantic Web 13(3):423-439
Jung S, Segev A (2022b) Identifying a common pattern within ancestors of emerging topics for pan-domain topic emergence prediction. Knowl Based Syst 258:110020
Kain G, Barbu MC, Richter K, Plank B, Tondi G, Petutschnigg A (2015) Use of tree bark as insulation material. For Products J 65(3-4):S16-S16
Kasperiuniene J, Briediene M, Zydziunaite V (2020) Automatic content analysis of social media short texts: scoping reviewof methods and tools. In Costa. A.P., Reis, L.P., & Moreira, A. (eds.) Computer Supported Qualitative Research: New Trends on Qualitative Research(WCQR2019) 4, 89-101
Khan AM, Shawon J, Halim MA (2017) Multiple receptor conformers based molecular docking study of fluorine enhanced ethionamide with mycobacterium enoyl ACP reductase (InhA). J Mol Graph Model 77:386-398
Khan GF, Wood J (2015) Information technology management domain: emerging themes and keyword analysis. Scientometrics 105(2):959-972. https://doi.org/ 10.1007/s11192-015-1712-5
Kim K, Jung S, Hwang J, Hong A (2018) A dynamic framework for analyzing technology standardisation using network analysis and game theory. Technol Anal Strat Manag 30(5):540-555. https://doi.org/10.1080/09537325.2017. 1340639
Kim MC, Chen C (2015) A scientometric review of emerging trends and new developments in recommendation systems. Scientometrics 104:239-263
Klavans R, Boyack KW (2011) Using global mapping to create more accurate document-level maps of research fields. J Am Soc Inf Sci Technol 62(1):1-18
Kogler DF, Essletzbichler J, Rigby DL (2017) The evolution of specialization in the EU15 knowledge space. J. Econ Geogr 17(2):345-373. https://doi.org/10. 1093/jeg/lbw024
Kogler DF, Whittle A, Buarque B (2022) The Science Space of Artificial Intelligence Knowledge Production. In: Kurz HD, Schütz M, Strohmaier R, Zilian SS (eds) The Routledge Handbook of Smart Technologies: An Economic and Social Perspective. Routledge, London, pp 241-268 https://doi.org/10.4324/ 9780429351921
Kozlow M (2023) “Disruptive” science has declined-even as papers proliferate. Springe Nat 613:225
Kwon S, Liu X, Porter AL, Youtie J (2019) Research addressing emerging technological ideas has greater scientific impact. Res Policy 48(9):103834. https:// doi.org/10.1016/j.respol.2019.103834
Larivière V, Haustein S, Börner K (2015) Long-distance interdisciplinarity leads to higher scientific impact. Plos One 10(3):e0122565
Lavalette A, Cointe A, Pommier R, Danis M, Delisée C, Legrand G (2016) Experimental design to determine the manufacturing parameters of a greenglued plywood panel. Eur J Wood Prod 74:543-551
Lee C, Kogler DF, Lee D (2019) Capturing information on technology convergence, international collaboration, and knowledge flow from patent documents: A case of information and communication technology. Inf Process Manag 56:1576-1591
Lee C, Hong S, Kim J (2021) Anticipating multi-technology convergence: a machine learning approach using patent information. Scientometrics 126(3):1867-1896. https://doi.org/10.1007/s11192-020-03842-6
Lee WS, Han EJ, Sohn SY (2015) Predicting the pattern of technology convergence using big-data technology on large-scale triadic patents. Technol Forecast Soc Change 100:317-329. https://doi.org/10.1016/j.techfore.2015.07.022
Leydesdorff L, Rafols I (2011) Indicators of the interdisciplinarity of journals: Diversity, centrality, and citations. J Informetr 5(1):87-100. https://doi.org/ 10.1016/j.joi.2010.09.002
Leydesdorff L, Wagner CS, Bornmann L (2018) Betweenness and diversity in journal citation networks as measures of interdisciplinarity-A tribute to Eugene Garfield. Scientometrics 114:567-592
Leydesdorff L, Wagner CS, Bornmann L (2019) Interdisciplinarity as diversity in citation patterns among journals: Rao-Stirling diversity, relative variety, and the Gini coefficient. J Informetr 13(1):255-269
Liu HQ, Li XL (2017) Effect of nursing intervention on liver cancer patients undergoing interventional therapy. Biomed Res 28(12):5285-5288
Liu D, Zhao H, Liu B, Zhang X, Ma Q (2017) Analysis on the expression level of serum MMP-7 in patients with abdominal aortic aneurysm accompanied by hypertension and clinical efficacy of endovascular graft exclusion. Biomed Res (0970-938X), 28(3)
Lowery CM, Cunningham R, Barrie CD, Bralower T, Snedden JW (2017) The northern Gulf of Mexico during OAE2 and the relationship between water depth and black shale development. Paleoceanography 32(12):1316-1335
Lu T (2017) Bayesian nonparametric mixed-effects joint model for longitudinalcompeting risks data analysis in presence of multiple data features. Stat Methods Med Res 26(5):2407-2423
Luo S, Lawson AB, He B, Elm JJ, Tilley BC (2016) Bayesian multiple imputation for missing multivariate longitudinal data from a Parkinson’s disease clinical trial. Stat Methods Med Res 25(2):821-837
Lyutov A, Uygun Y, Hütt MT (2021) Machine learning misclassification of academic publications reveals non-trivial interdependencies of scientific disciplines. Scientometrics 126(2):1173-1186. https://doi.org/10.1007/s11192-020-03789-8
MacKay DJ (2003) Information theory, inference and learning algorithms. Cambridge University Press
Mane KK, Börner K (2004) Mapping topics and topic bursts in PNAS. Proc Natl Acad Sci USA 101(SUPPL. 1):5287-5290. https://doi.org/10.1073/pnas. 0307626100
McInnes L, Healy J, Melville J (2016) UMAP: Uniform manifold approximation and projection for dimension reduction. http://arxiv.org/abs/1802.03426
Mejia C, Kajikawa Y (2020) Emerging topics in energy storage based on a largescale analysis of academic articles and patents. Appl Energy 263:114625. https://doi.org/10.1016/j.apenergy.2020.114625
Newman D, Bonilla EV, Buntine W(2011) Improving topic coherence with regularized topic models. Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011:1-9
Palma-Rojas S, Caldeira-Pires A, Nogueira JM (2017) Environmental and economic hybrid life cycle assessment of bagasse-derived ethanol produced in Brazil. Int J Life Cycle Assess 22:317-327
Petersen AM, Ahmed ME, Pavlidis I (2021) Grand challenges and emergent modes of convergence science. Human Soc Sci Commun 8(1):1-15
Qian Y, Härdle WK, Chen C (2017) Industry Interdependency Dynamics in a Network Context. SFB 649 Discussion Paper 2017-012, Humboldt University of Berlin. https://doi.org/10.2139/ssrn. 2961703
Qi Y, Hao S, Zhang J, Zhao C, Lian Y (2017) Effects of comprehensive nursing on the pain and joint functional recovery of patients with hip replacements. Biomed Res India 28:12
Rafols I, Meyer M (2010) Diversity and network coherence as indicators of interdisciplinarity: case studies in bionanoscience. Scientometrics 82(2):263-287. https://doi.org/10.1007/s11192-009-0041-y
Rafols I, Porter AL, Leydesdorff L (2010) Science overlay maps: A new tool for research policy and library management. J Am Soc Inf Sci Technol 61(9):1871-1887
Rapach DE, Strauss JK, Tu J, Zhou G (2015) Industry interdependencies and crossindustry return predictability. Working paper 12-2015. Singapore Management University, Lee Kong Chian School of Business
Rey-Martí A, Ribeiro-Soriano D, Palacios-Marqués D (2016) A bibliometric analysis of social entrepreneurship. J Bus Res 69(5):1651-1655. https://doi.org/ 10.1016/j.jbusres.2015.10.033
Rotolo D, Hicks D, Martin BR (2015) What is an emerging technology? Res Policy 44(10):1827-1843
Saadati F, Rahmani M, Ghahramani F, Piri F, Shayani-Jam H, Yaftian MR (2017) Synthesis of a novel ion-imprinted polyaniline/hyper-cross-linked polystyrene nanocomposite for selective removal of lead (II) ions from aqueous solutions. Desalination Water Treat 82:210-218
Samsir S, Saragih RS, Subagio S, Aditiya R, Watrianthos R (2023) BERTopic modeling of natural language processing abstracts: Thematic structure and trajectory. J Media Inform Budidarma 7(3):1514-1520
Schumpeter JA (1942) Capitalism, socialism and democracy. Harper and Row, New York
Schumpeter JA (1934) The Theory of Economic Development. Harvard Univeristy Press
Shamim A, Abbasi SW, Azam SS (2015) Structural and dynamical aspects of Streptococcus gordonii FabH through molecular docking and MD simulations. J Mol Graph Model 60:180-196
Shin H, Kim K, Kogler DF (2022) Scientific collaboration, research funding, and novelty in scientific knowledge. PLoS ONE 17(7):e0271678. https://doi.org/ 10.1371/journal.pone. 0271678
Small H, Boyack KW, Klavans R (2014) Identifying emerging topics in science and technology. Res Policy 43(8):1450-1467
Song CH, Han JW, Jeong B, Yoon J (2017) Mapping the patent landscape in the field of personalized medicine. J Pharm Innov 12(3):238-248. https://doi.org/ 10.1007/s12247-017-9283-z
Suominen A, Toivanen H (2016) Map of science with topic modeling: Comparison of unsupervised learning and human-assigned subject classification. J Assoc Inf Sci Technol 67(10):2464-2476. https://doi.org/10.1002/asi
Velden T, Boyack KW, Gläser J, Koopman R, Scharnhorst A, Wang S (2017) Comparison of topic extraction approaches and their results. Scientometrics 111(2):1169-1221. https://doi.org/10.1007/s11192-017-2306-1
Wang Y, Bashar MA, Chandramohan M, Nayak R (2023) Exploring topic models to discern cyber threats on Twitter: A case study on Log4Shell. Intell Syst Appl 20:200280
Wang Z, Chen J, Chen J, Chen H (2023) Identifying interdisciplinary topics and their evolution based on BERTopic. Scientometrics, 0123456789. https://doi. org/10.1007/s11192-023-04776-5
West JD, Jensen MC, Dandrea RJ, Gordon GJ, Bergstrom CT (2013) Author-level Eigenfactor metrics: Evaluating the influence of authors, institutions, and countries within the social science research network community. J Am Soc Inf Sci Technol 64(4):787-801
White K (2019) Publications Output: U.S. Trends and International Comparisons. In Nsb-2020-6. https://ncses.nsf.gov/pubs/nsb20206/
Winnink JJ, Tijssen RJW, van Raan AFJ (2019) Searching for new breakthroughs in science: How effective are computerised detection algorithms? Technol Forecast Soc Change 146:673-686. https://doi.org/10.1016/j.techfore.2018.05. 018
Wu W, Zhang S, Wang S (2017) A novel lattice Boltzmann model for the solid-liquid phase change with the convection heat transfer in the porous media. Int J Heat Mass Transf 104:675-687
Xu J, Bu Y, Ding Y, Yang S, Zhang H, Yu C, Sun L (2018) Understanding the formation of interdisciplinary research from the perspective of keyword evolution: A case study on joint attention. Scientometrics 117:973-995
Xu J, Ding Y, Bu Y, Deng S, Yu C, Zou Y, Madden A (2019) Interdisciplinary scholarly communication: an exploratory study for the field of joint attention. Scientometrics 119:1597-1619
Yau CK, Porter A, Newman N, Suominen A (2014) Clustering scientific documents with topic modeling. Scientometrics 100(3):767-786. https://doi.org/10.1007/ s11192-014-1321-8
Zahedi Z, van Eck NJ (2018) Exploring topics of interest of Mendeley users. J Altmetrics 1(1):1-12. https://doi.org/10.29024/joa. 7
Zhang J, Zhang G, Zhou Q, Ou L (2016) Thermodynamics, kinetics and isotherm studies on the removal of methylene blue from aqueous solution by calcium alginate. J Water Reuse Desalination 6(2):301-309
Zhao YM, Wang J, Wu ZG, Yang JM, Li W, Shen LX (2016) Extraction, purification and anti-proliferative activities of polysaccharides from Lentinus edodes. Int J Biol Macromol 93:136-144
Acknowledgements
agreement No 17/SPR/5324, SciTechSpace). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Author contributions
Competing interests
Ethical approval
Informed consent
Additional information
Reprints and permission information is available at http://www.nature.com/reprints
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© The Author(s) 2024
School of Applied Artificial Intelligence, Handong Global University, Pohang, South Korea. Spatial Dynamics Lab, School of Architecture, Planning & Environmental Policy & Insight Centre for Data Analytics, University College Dublin, Dublin, Ireland. Dept. of Technology & Society, the State University of New York, Songdo, South Korea. email: sira.maliphol@sunykorea.ac.kr
