TVDB, UTF-8 and MyTvseries (1 Viewer)

Erez

Portal Pro
April 10, 2007
82
9
Home Country
Israel Israel
Hi,
I've noticed that TVDB now saves all series information in UTF-8.
Does MyTvseries extracts the info as UTF-8?
I ask because Hebrew information is extracted as gibberish.
If I want MyTvseries to extract Hebrew information correctly I need to go to TVDB, manually change my browser's encoding and save the series data under 'ISO-8859-1' encoding.
The problem is that for some reason that doesn't last for long. For example, yesterday I fixed the hebrew data for 'Lost' (saved it as 'ISO-8859-1' encoding) and today the info appears as gibberish again.
But if the info is saved as 'UTF-8' it doesn't change back to gibberish.
I'm guessing that MyTvseries extract the 'ISO-8859-1' data instead of the 'UTF-8' data.
Thanks,
Erez
 

Inker

Retired Team Member
  • Premium Supporter
  • December 6, 2004
    2,055
    318
    Didn't we go down that road before? I remember fighting for hours trying to get all those characters to work and I thought it was finally fixed?
     

    Erez

    Portal Pro
    April 10, 2007
    82
    9
    Home Country
    Israel Israel
    Hi Inker,
    After looking more into this problem I think the problem again is with the TVDB site. I think that due to changes at the site the Hebrew data gets screwed up (and also characters that are not a part of ISO-8859-1).
    I'll try to explain.
    As you might now there are several encoding methods. 'ISO-8895-1' support mainly english characters and 'UTF-8' support a lot more characters (Hebrew, Spanish and more).
    Till now the way for me to get correct Hebrew info in MyTvseries is to make sure that when I save the data at TVDB my browser is set to 'ISO-8859-1' (a bit strange, I know).
    This because of the TVDB interface. For example when you try this:
    http://thetvdb.com/interfaces/GetSeries.php?seriesname=Lost&language=24
    The TVDB interface access the data as 'ISO-8859-1' and not 'UTF-8'. So if I saved the data while my browser was set to any other encoding besides 'ISO-8859-1' using the interface to access the data will result in gibberish. But if you went to the show info at the site (not through the interface) you could select the proper encoding.

    The problem now is that for some reason the data that is saved as 'ISO-8859-1' gets screwed up all the time.
    I've tried resaving is but it gets screwed up again.

    Jocke, I guess we have the same problem.
    A partial solution for you is to resave the data as 'ISO-8859-1'.

    Anyway,
    I'll try my luck at the TVDB forum,
    Thanks,
    Erez
     

    Hawkeye

    Portal Pro
    January 29, 2005
    530
    97
    Halle (Saale)
    Home Country
    Germany Germany
    The problem now is that for some reason the data that is saved as 'ISO-8859-1' gets screwed up all the time.
    I've tried resaving is but it gets screwed up again.

    Maybe this happens when other users save informations in other languages...

    BTW: I got this problem too with german umlauts...
     

    Erez

    Portal Pro
    April 10, 2007
    82
    9
    Home Country
    Israel Israel
    It is reasonable that data that is saved as 'ISO-8859-1' gets screwed up since this method supports only ASCII characters.
    The main problem is with the interface. If you go to the series page and your browser is set to 'UTF-8' the data is displayed correctly. It looks like the interface it set to 'ISO-8859-1' and not 'UTF-8'.

    Erez
     

    z3us

    Portal Pro
    December 4, 2007
    1,047
    123
    46
    Home Country
    Spain Spain
    Im the spanish guy having problems with some characters.
    I hope tvdb people can get this fixed soon

    TY for the explanation
     

    Erez

    Portal Pro
    April 10, 2007
    82
    9
    Home Country
    Israel Israel
    OK, a short summary:
    There are two problems. One is that MyTvSeries uses the old interface instead of the new one (all language fixes and UTF-8 support are made for the new API). The second problem is that UTF-8 doesn't work perfectly with the new API (not all characters are returned correctly).
    Read this thread for more info:
    Online TV Database :: View topic - Unicode / UTF-8 and other fun

    Erez
     

    Users who are viewing this thread

    Top Bottom