Wrong letters/symbols in Norwegian subtitles (1 Viewer)

hulkhaugen

Portal Pro
November 9, 2008
450
58
Arendal
Home Country
Norway Norway
Nope, i'm talking about DVB-subtitles (not teletext) for DVB-S(2)-channels...i can post screenshots lather if that would help you...

DVB subtitles are broadcast as bitmaps. My earlier comment was based on it being teletext, where the issue would likely be seen.
Edit: Logs and a screenshot would help.
Edit2: I just see that logs have been attached to first post. Logs show it being teletext subtitles. Language is detected as "nor" (correct). arion_p did some work on this not so long ago... Maybe he can help.

If it's teletext, then why does it use a completely different font than some other channels? These subtitles i'm talking about looks like Areal font, and on a few channels, there's a subtitle-font looking like the Symbol-font. Nope, i'm not talking about channels where the subtitles are part of the video/unchangeable.

Looking at the code, conversion of those characters is done only for Danish. The language is taken from the DVB stream language descriptor just like Seidelin noted (not from regional settings as I thought). Of course if the language is not specified in the descriptor (as it happen in some broadcasts) not mapping will be done at all.

Currently the mapping for Danish is:
Code:
\u00C4 =>  \u00C6
\u00D6 => \u00D8
\u00E4 => \u00E6
\u00F6 => \u00F8
Can someone confirm that it also applies to Norwegian too. If not, please supply a mapping (please use the above notation to avoid ANSI/Unicode conversion errors) so that we can add it.

Hmm, that seems correct according to the charmap.exe on my Norwegian os. But i can't see why the symbols should be "translated" at all. If it's a name, then it should get it's original symbols. Guess the subs-writer are in control of what they write, we shouldn't have to correct them.

Norvegian and danish letters are the same, so this conversion would also be the same. 29 letters, A B C D E F G H I J K L M N O P Q R S T U V W X Y Z Æ Ø Å

Same as in Norway, Just the Swedes who have to be different...

Bottom line, if this isn't a error in a font file, then i guess the providers are the problem. Is it the installed Areal and Symbol fonts that are used?
 

pilehave

Community Skin Designer
  • Premium Supporter
  • April 2, 2008
    2,566
    521
    Hornslet
    Home Country
    Denmark Denmark
    In the source it looks like neither Arial or Symbol:

    _fontTeletext = new Font("Lucida Console", fntSize, FontStyle.Regular, GraphicsUnit.Pixel);

    (this is the teletext-renderer)
     

    hulkhaugen

    Portal Pro
    November 9, 2008
    450
    58
    Arendal
    Home Country
    Norway Norway
    Try this Core.dll, if it works, I will commit the changes.
    Also see if the same characters work in teletext. If I am correct and you have reg. settings Norwegian, teletext should have the same problem.

    Hey, this works perfect (10 mins testing on different channels) :D

    Do the Swedes have to suffer for this now then? :p
     

    arion_p

    Retired Team Member
  • Premium Supporter
  • February 7, 2007
    3,373
    1,626
    Athens
    Home Country
    Greece Greece
    Currently the mapping for Danish is:
    Code:
    \u00C4 =>  \u00C6
    \u00D6 => \u00D8
    \u00E4 => \u00E6
    \u00F6 => \u00F8

    Hmm, that seems correct according to the charmap.exe on my Norwegian os. But i can't see why the symbols should be "translated" at all. If it's a name, then it should get it's original symbols. Guess the subs-writer are in control of what they write, we shouldn't have to correct them.

    The character coding used for teletext transmission is different from anything used in computer systems. MP has to translate from that coding to UTF-16 coding which is what is used in .NET. In the ETSI standard for teletext there is no coding for Danish / Norwegian, however (as it seems) Denmark and Norway decided to replace the two glyphs used in Swedish (Ä, Ö) with the ones in Danish/Norwegian (Æ, Ø) thus breaking the standard. A TV set sold in Denmark will probably not be able to display the Swedish characters unless both the broadcaster and the TV support teletext level 2.5 and up (or has some setting to switch between the two). In MP we follow the ETSI standard and then if the language is Danish (or Norwegian with this fix) we change letters Ä, Ö to Æ, Ø.

    I need the characters in UTF-16 codes just because the source files are ANSI encoded instead of UTF-8 and since my ANSI code page is 1032 (Greek) I cannot input those characters without them being misinterpreted by the compiler as the Greek letters Φ, Δ.

    Have you tried teletext? Does it work properly?

    PS: Swedish will still work as expected
     

    hulkhaugen

    Portal Pro
    November 9, 2008
    450
    58
    Arendal
    Home Country
    Norway Norway
    Teletext, as in text tv? the old news source?

    I tried it now, and it still displays Swedish letters, but subtitles are correct...
     

    nikos1671

    Portal Pro
    April 21, 2008
    244
    8
    46
    Germany
    Home Country
    Greece Greece
    Hi arion, it would be perfect if you fix also the problem with the greek characters in teletext also in the dvb-t subtitles.i know that you already have make patch but now with last version of mp it doesnt work,please ask from the other devs to include your patch in the svn.we are some greek guys also here that we use mp :)
     

    Users who are viewing this thread

    Top Bottom