طريقة الفرع والحد لحساب مسار فيتربي في نماذج ماركوف الثلاثية Branch-and-bound method for calculating Viterbi path in triplet Markov models

المجلة: METRON
DOI: https://doi.org/10.1007/s40300-026-00307-3
تاريخ النشر: 2026-05-12
المؤلف: Oskar Soop وآخرون
الموضوع الرئيسي: سلاسل ماركوف وطرق مونت كارلو

نظرة عامة

تتناول هذه الورقة البحثية التحديات الحاسوبية في العثور على مسارات فيتربي، أو تقديرات الحد الأقصى بعد (MAP)، في نماذج ماركوف الثلاثية (TMMs). على عكس سلاسل ماركوف القياسية، فإن العملية الهامشية $X$ في TMMs عادة ما تكون غير ماركوفية، مما يعقد تحديد المسار الأكثر احتمالاً. يقترح المؤلفون إطار عمل قائم على الفروع والحدود يستفيد من خاصية ماركوف المشتركة للعملية الثلاثية الأساسية $(X, U, Y)$ لحساب حدود فعالة على الاحتمال الأقصى $P(X = x)$. يتم استكشاف استراتيجيات حدود مختلفة، بما في ذلك الحدود البسيطة، وحدود مجموع القوى، وحدود من نوع صموئيلسون، وحدود ماكس-سوم المتبادلة (SMS)، وتقديرات m-Viterbi، لتعزيز الكفاءة الحاسوبية.

تشير النتائج إلى أنه على الرغم من أن المشكلة تظل NP-hard، فإن الطريقة المقترحة تقلل بشكل كبير من مساحة البحث وتعقيد الحساب مقارنة بأساليب البحث الشامل. يسمح استخدام حدود عليا وسفلى ضيقة، خاصة مع تقديرات m-Viterbi (الموصى بها لـ $m \geq 2$)، بالحساب العملي لمسارات فيتربي في TMMs ذات الحجم المعتدل. كما يبرز المؤلفون أن فعالية استراتيجيات الحدود العليا المختلفة تختلف بناءً على هيكل المشكلة والموارد الحاسوبية المتاحة، مما يشير إلى أنه ينبغي على الممارسين تجربة هذه الاستراتيجيات على أساس كل حالة على حدة. بشكل عام، يسهل النهج المقترح التوقف المبكر في عملية البحث مع الحفاظ على حدود صارمة على احتمال المسار الأمثل.

مقدمة

في هذه الورقة، يستكشف المؤلفون الطرق الحاسوبية لتحديد مسار فيتربي، أو مسار الاحتمالية القصوى، لسلسلة ماركوف ذات الحالة المنتهية غير المتجانسة، مع التركيز بشكل خاص على العملية الهامشية $X$ التي تفتقر عادة إلى خاصية ماركوف. الهدف الأساسي هو تحديد مسار $x = (x_1, \ldots, x_n)$ الذي يعظم الاحتمال $P(X = x)$. بينما يجد خوارزمية فيتربي بكفاءة مثل هذه المسارات لسلاسل ماركوف، تصبح المهمة NP-hard عندما تكون $X$ غير ماركوفية، مما يؤدي إلى صياغة مشكلة الهامش الأقصى. هذه المشكلة ذات صلة خاصة في سياق نماذج ماركوف الثلاثية (TMMs)، حيث تتطلب تقسيم البيانات الملاحظة $Y$ تحديد العمليات الكامنة الأكثر احتمالاً $X$ و $U$.

لمعالجة مشكلة الهامش الأقصى، يقترح المؤلفون طريقة قائمة على الفروع والحدود تستخدم حدودًا عليا وسفلى مختلفة لـ $P(X = x)$. يستكشفون عدة أنواع من الحدود، بما في ذلك حدود مجموع القوى، وحدود من نوع صموئيلسون، وحدود ماكس-سوم المتبادلة، وتقديرات m-Viterbi. توضح الورقة التعقيد الحاسوبي المرتبط بهذه الحدود وتؤكد على استخدام خاصية ماركوف المشتركة لـ $(U, X)$ لتعزيز الكفاءة. تظهر النتائج التجريبية أن نهج الفروع والحدود يتفوق بشكل كبير على أساليب البحث الشامل، حيث أن تقدير m-Viterbi عمومًا يعطي أفضل حد سفلي. ومع ذلك، تتحدى الأمثلة المضادة الافتراضات السائدة بشأن العلاقة بين ترتيب $m$ للتقدير وفعاليته، مما يشير إلى أن زيادة $m$ لا تؤدي دائمًا إلى تحسين النتائج.

نقاش

تتناول قسم النقاش في الورقة إطار نماذج ماركوف الثنائية (PMMs) ونماذج ماركوف الثلاثية (TMMs)، مع تسليط الضوء على اختلافاتها الهيكلية وتأثيراتها على مهام التقسيم. تتميز PMMs بسلسلة ماركوف ثنائية المتغيرات حيث تكون العملية الإشارية (المتغير المخفي) غالبًا سلسلة ماركوف، بينما قد لا تكون العملية الملاحظة كذلك. تتيح هذه المرونة الاعتماد الشرطي بين الملاحظات، وهو ميزة كبيرة مقارنة بالنماذج المخفية لماركوف (HMMs) التي تفرض الاستقلالية بين الملاحظات بالنظر إلى الإشارة. تمتد TMMs إلى ثلاثة متغيرات، حيث تتضمن عملية مساعدة تساعد في النمذجة ولكنها ليست ذات أهمية أولية. يؤكد المؤلفون أنه على الرغم من أنه يمكن التعامل مع كل من PMMs وTMMs كنماذج ماركوف، فإن العمليات الهامشية لـ TMMs لا تحتفظ بخاصية ماركوف، مما يعقد تطبيق خوارزميات فك التشفير القياسية مثل خوارزمية فيتربي.

تتناول الورقة مشكلة التقسيم ضمن TMMs، حيث الهدف هو تقدير تسلسل الحالة غير الملاحظة بناءً على البيانات الملاحظة. يقدم المؤلفون مفهوم مشكلة الهامش الأقصى، التي تسعى إلى تعظيم الاحتمال الهامشي للحالات المخفية بالنظر إلى الملاحظات. يقترحون خوارزمية قائمة على الفروع والحدود لمعالجة هذه المشكلة NP-hard بكفاءة، إلى جانب طرق هيوريستية لتحسين الجدوى الحاسوبية. كما يقارن النقاش التحديات الحاسوبية للعثور على مسارات فيتربي في TMMs بتلك الموجودة في PMMs، مشيرًا إلى أنه بينما يمكن حساب مسارات PMAP بسهولة أكبر، تتطلب مسارات فيتربي نهجًا أكثر تعقيدًا بسبب فقدان خاصية ماركوف في العمليات الهامشية. بشكل عام، يبرز القسم الحاجة إلى خوارزميات متقدمة للتنقل بفعالية في تعقيدات التقسيم في نماذج ماركوف غير المتجانسة.

Journal: METRON
DOI: https://doi.org/10.1007/s40300-026-00307-3
Publication Date: 2026-05-12
Author(s): Oskar Soop et al.
Primary Topic: Markov Chains and Monte Carlo Methods

Overview

This research paper addresses the computational challenges of finding Viterbi paths, or maximum a posteriori (MAP) estimates, in triplet Markov models (TMMs). Unlike standard Markov chains, the marginal process $X$ in TMMs is typically non-Markovian, complicating the identification of the most probable path. The authors propose a branch-and-bound framework that leverages the joint Markov property of the underlying triplet process $(X, U, Y)$ to compute effective bounds on the maximum probability $P(X = x)$. Various bounding strategies, including simple, power sum, Samuelson-type, Swapped Max-Sum (SMS), and m-Viterbi approximations, are explored to enhance computational efficiency.

The findings indicate that while the problem remains NP-hard, the proposed method significantly reduces the search space and computational complexity compared to exhaustive search methods. The use of tight upper and lower bounds, particularly with m-Viterbi approximations (recommended for $m \geq 2$), allows for practical computation of Viterbi paths in moderately sized TMMs. The authors also highlight that the effectiveness of different upper bounding strategies varies based on the problem’s structure and available computational resources, suggesting that practitioners should experiment with these strategies on a case-by-case basis. Overall, the proposed approach facilitates early stopping in the search process while maintaining rigorous bounds on the optimal path probability.

Introduction

In this paper, the authors investigate computational methods for determining the Viterbi path, or maximum likelihood path, for a non-homogeneous finite-state Markov chain, specifically focusing on the marginal process $X$ which typically lacks the Markov property. The primary objective is to identify a path $x = (x_1, \ldots, x_n)$ that maximizes the probability $P(X = x)$. While the Viterbi algorithm efficiently finds such paths for Markov chains, the task becomes NP-hard when $X$ is non-Markovian, leading to the formulation of the maximal marginal problem. This problem is particularly relevant in the context of triplet Markov models (TMMs), where the segmentation of observed data $Y$ necessitates the identification of the most probable latent processes $X$ and $U$.

To address the maximal marginal problem, the authors propose a branch-and-bound method that utilizes various upper and lower bounds for $P(X = x)$. They explore several types of bounds, including power sum bounds, Samuelson-type bounds, swapped max-sum bounds, and m-Viterbi approximations. The paper outlines the computational complexity associated with these bounds and emphasizes the use of the joint Markov property of $(U, X)$ to enhance efficiency. Empirical results demonstrate that the branch-and-bound approach significantly outperforms exhaustive search methods, with the m-Viterbi approximation generally yielding the best lower bound. However, counterexamples presented challenge prevailing assumptions regarding the relationship between the order $m$ of the approximation and its effectiveness, suggesting that increasing $m$ does not always lead to improved results.

Discussion

The discussion section of the paper elaborates on the framework of pairwise Markov models (PMMs) and triplet Markov models (TMMs), highlighting their structural differences and implications for segmentation tasks. PMMs are characterized by a bivariate Markov chain where the signal process (hidden variable) is often a Markov chain, while the observed process may not be. This flexibility allows for conditional dependencies between observations, which is a significant advantage over hidden Markov models (HMMs) that enforce independence among observations given the signal. TMMs extend this concept to three variables, incorporating an auxiliary process that aids in modeling but is not of primary interest. The authors emphasize that while both PMMs and TMMs can be treated as Markov models, the marginal processes of TMMs do not retain the Markov property, complicating the application of standard decoding algorithms like the Viterbi algorithm.

The paper addresses the segmentation problem within TMMs, where the goal is to estimate the unobserved state sequence based on observed data. The authors introduce the concept of the maximal marginal problem, which seeks to maximize the marginal probability of the hidden states given the observations. They propose a branch-and-bound algorithm to efficiently tackle this NP-hard problem, alongside heuristic methods to improve computational feasibility. The discussion also contrasts the computational challenges of finding Viterbi paths in TMMs with those in PMMs, noting that while PMAP paths can be computed more easily, Viterbi paths require more complex approaches due to the loss of the Markov property in the marginal processes. Overall, the section underscores the need for advanced algorithms to effectively navigate the complexities of segmentation in non-homogeneous Markov models.