A buffer initially preloads a text subtitle stream recorded on a recording medium. The preloaded text subtitle stream includes a dialog style segment defining a group of region styles and a plurality of dialog presentation segments, each of which contains at least one region of dialog text. Each region of dialog text is linked to a selected one of the group of region styles defined in the dialog style segment. The dialog style segment and the plurality of dialog presentation segments are encapsulated in a plurality of PES packets, respectively. A text subtitle decoder decodes and presents each dialog presentation segment according to a continuous presentation flag included in each dialog presentation segment. The continuous presentation flag indicates whether continuous presentation is required for each dialog presentation segment with previous one. |