Proposal: Support multilingual transcripts #483
Replies: 3 comments
-
why not |
Beta Was this translation helpful? Give feedback.
-
The RSS spec defines a |
Beta Was this translation helpful? Give feedback.
-
For this part of the problem, we can actually use the existing xml:lang attribute: <item xml:lang="pt">
...
</item>
<item xml:lang="es">
...
</item> Technically these existing |
Beta Was this translation helpful? Give feedback.
-
Some podcasts are multilingual, where each episode might use a different language, or even where a single episode may switch between multiple languages.
It is already possible to list multiple languages in the channel of the RSS feed (e.g.
<language>en,es</language>
on the channel), and perhaps there should also be a similar optional tag on each item that defaults to the channel language, because that may be helpful when each episode is in a different language.But when a single episode contains multiple languages, we also need a way to tag which text belong to which language within the transcript.
I am not sure if there is an obvious way to do it in every format, but for JSON, we can add an optional
language
property to each segment which defaults to the item's language in the RSS feed, as follows:For WebVTT, maybe this information could be placed in a comment.
For SRT, maybe this information could be encoded in parentheses or some other type of brackets.
For HTML, maybe this can use the lang attribute.
Beta Was this translation helpful? Give feedback.
All reactions