special characters tvguide.xml (1 Viewer)

boeloeboeloe

Portal Member
May 15, 2006
15
0
Belgium
When I grab the program guide, I have always problems with special characters.
Something like this in the file tvguide.xml: "Clips van ��n artiest op een rij"

Is there a solutions for that?
 

gibman

Retired Team Member
  • Premium Supporter
  • October 4, 2006
    2,998
    1,372
    Usa
    Home Country
    Ethiopia Ethiopia
    first off, your xml file needs to be encoded in utf-8 format ?

    How do u obtain ur xml file ?

    webegp parsing a website ??

    try and post a small section of your file.

    /gibman
     

    boeloeboeloe

    Portal Member
    May 15, 2006
    15
    0
    Belgium
    The xml-file is encoded in utf-8 format.

    I grab the file with the mediaportal webepg (from de site www.teveblad.be, but same problem with other sites).

    When opening with notepad, you can see that there is a little space in the title-tag "America s" and in desc-tag "homevideo s". The required character is a single quote.

    When opening with dreamweaver, the character is showed correctly.

    When loading in Mediaportal is represented as a square.

    Hope that you have enough information?

    Thanks in advance.



    "<programme start="20071207182500" stop="20071207192000" channel="Kanaaltwee-kanaaltwee.be">
    <title>Buffy, the Vampire Slayer</title>
    <sub-title>Surprise</sub-title>
    <desc>Komische horrorserie.
    Buffy heeft een nachtmerrie waarin ze ziet hoe Drusilla afrekent met Angel. Ze vraagt zich af of haar droom realiteit is en gaat naar Angels appartement. Hij blijkt ongedeerd en stelt haar gerust dat Drusilla dood is. Zij is er echter niet van overtuigd dat dat wel zo is. Over de Meester had ze net zo’n nachtmerrie en even later viel hij haar aan. Dat ze het bij het rechte eind heeft, blijkt wanneer blijkt dat Spike en Drusilla een demoon op haar afsturen die niet gedood kan worden...</desc>
    <category>serie</category>
    <episode-num>13</episode-num>
    </programme>
    <programme start="20071207192000" stop="20071207200500" channel="Kanaaltwee-kanaaltwee.be">
    <title>Most Shocking</title>
    <desc>Video-opnames van waargebeurde en schokkende feiten.</desc>
    </programme>
    <programme start="20071207200500" stop="20071207203500" channel="Kanaaltwee-kanaaltwee.be">
    <title>America’ s Funniest Home Videos</title>
    <desc>Show (VS) met grappige homevideo’ s.</desc>
    </programme>
    <programme start="20071207203500" stop="20071207232500" channel="Kanaaltwee-kanaaltwee.be">
    <title>Bad Boys II</title>
    <desc>Actiekomedie van Michael Bay.
    Mike Lowrey en Marcus Burnett, twee detectives in Miami, maken jacht op de beruchte drugsbaron Tapia alias Johnny, die van plan is de stad te bevoorraden met extacy. Hun taak wordt er niet eenvoudiger op wanneer Sidney, de zus van Marcus en een agente van de drugsbrigade, probeert te infiltreren in het witwasnetwerk van Alexei, een Russische zware jongen...</desc>
    <category>film</category>
    </programme>"
     

    Furetto

    Moderator - Dutch Forums
    April 11, 2005
    664
    61
    52
    Brussels
    Home Country
    Belgium Belgium
    In another thread I posted a fix for this.

    The reason behind this is that TeveBlad is not using the normal apostrophe (ascii code 39) but the right curly quote mark (ascii code 146) and I guess the WebEPG grabber currently does not interprete this well.

    The fix I used was to use the ACTION command to replace all & # 39; by & # 146; [spaces added to prevent replacing of the codes] in both TITLE and DESCRIPTION.

    A more permanent fix would be to treat the & # 146; correctly inside webepg.

    *grumble* It would seem that now the description is sometimes added twice... Need to investigate
     

    Furetto

    Moderator - Dutch Forums
    April 11, 2005
    664
    61
    52
    Brussels
    Home Country
    Belgium Belgium
    Gibman, maybe indeed you can help out here.

    Excerpt from the website
    Code:
    <td nowrap class='detailtop' width='100%'>Debbie Travis& #146; Facelift
    </td>
    
    Comment: added a space between & and # to prevent the forum from replacing the code


    In the TVGuide.xml this becomes

    Code:
     <title>Debbie Travis# Facelift</title>
    where the hash replaces a character that is not represented correctly. Depending on the font, codepage etc used, you see either nothing (default font) or a square replacing a non-representable character (in Times New Roman) or the & # 198 ; character Æ (when in Uestudio I click on OEM to ANSI) or finally the apostrophe & # 146; ’ (when in Uestudio I click ANSI to OEM, but this borks all other special characters).

    As I mentioned in the other thread, I can workaround the issue for the TITLE by modifying the end of the www_teveblad_be grabber into

    Code:
      </Listing>
      <Actions>
        <Modify channel="*" field="#TITLE" search="& #146;" action="Replace">& #39;</Modify>
        <Modify channel="*" field="#DESCRIPTION" search="& #146;" action="Replace">&# 39;</Modify>
      </Actions>
    </Grabber>
    
    Comment: again added a space between & and #


    But with this code, the #DESCRIPTION tag gets doubled in many cases. Instead of simply saying

    Code:
    <desc>Kookprogramma.</desc>

    I suddenly get

    Code:
    <desc>Kookprogramma. Kookprogamma.</desc>

    (This last is from memory, I had it happen like this several times yesterday but removed the resulting files and don't feel like rerunning all just to prove I'm right :p )
     

    Furetto

    Moderator - Dutch Forums
    April 11, 2005
    664
    61
    52
    Brussels
    Home Country
    Belgium Belgium
    To start: sorry for hijacking this thread

    For a long time, the Belgian webepg script is giving a small issue. The single quote character is rendered as a question mark. I don't know enough of programming to go dig into the code and fix it myself, but I think in the thread where this is discussed I provide information that could allow a developer to fix this rather easily.

    To sum up in short, this website uses & #146; (space added on purpose) instead of & #39; as single quote or apostrophe. The normal single quote gets 'escaped' normally, but the one used on this website does not.

    James, do you think this could be fixed easily ? My workaround I mentioned in the thread is not workin as good as I hoped...
     

    James

    Retired Team Member
  • Premium Supporter
  • May 6, 2005
    1,385
    67
    Switzerland
    Hi Furetto,

    I have seen this thread and I will make a change to fix this problem for you :)

    The problem is not really with WebEPG, but that MP doesn't support UniCode characters ;)

    /James
     

    Users who are viewing this thread

    Similar threads

    Update: Problem solved! The electronic program guides (tvguide.xml) have been successfully transferred to MediaPortal. The issue stemmed from the corruption of one or both of the mc2xml.dat and/or mc2xml.exe files, for an unknown reason. Consequently, even though the tvguide.xml file appeared to be updated, its content kept being an...
    Update: Problem solved! The electronic program guides (tvguide.xml) have been successfully transferred to MediaPortal. The issue...
    I was having a problem with the EPG on MP2 1.4.1 (although it used to run without any issues for a while). So I installed MP2 1.5...
    Replies
    8
    Views
    2K
    According to Wikipedia, Windows only uses code pages other than UTF-8 to support legacy (mostly 32 bit) applications and supports two code pages in addition to UTF-8 (OEM and ANSI). According to Microsoft, UTF-8 has been the preferred character set since 2019. If TV Server (or its import routine) were using UTF-8, then umlauts...
    According to Wikipedia, Windows only uses code pages other than UTF-8 to support legacy (mostly 32 bit) applications and supports...
    Because I have around 200 TV channels and more than a dozen groups, I tried to import the TV channel groups from an old export.xml...
    Replies
    4
    Views
    1K
    I would have to know more details, and ultimately, I'll have to test it when I get my new display, but an AI answer I got (I don't trust AI and this may not be correct) was that HDR is encoded with the HEVC codec and I know MePo uses the HEVC codec to play 4K UHD Blue-ray discs. So, there is at least the possibility that it will work...
    I would have to know more details, and ultimately, I'll have to test it when I get my new display, but an AI answer I got (I don't...
    Does MePo 1 pass the HDR metadata, Dolby Vision, HDR10, etc., along to the display when playing 4K UHD Blu-ray discs? If not, are...
    Replies
    2
    Views
    1K
    After many hours of troubleshooting I've given up and accepted that it's just "one of these things" when using madVR for watching live TV. I've now ticked the box to "Use EVR for LiveTV", with a default display refresh rate of 50Hz. In combination with tweaking some of the Nvidia 3D settings for the MP executable only, I now have a...
    After many hours of troubleshooting I've given up and accepted that it's just "one of these things" when using madVR for watching...
    Every so often, when I switch to a TV channel, my MP client stops accepting user inputs, either from the remote or keyboard/mouse...
    Replies
    30
    Views
    8K
    Now with Version 1.37 x64 MP is near to perfect with repect to speed, robustness and funtionality. There is just one point where I have an Idea for further improvement. As I am not too interested in advetisments, I am using comskip to mark times of ads, which is normally working, as it should but produces an error when trying to...
    Now with Version 1.37 x64 MP is near to perfect with repect to speed, robustness and funtionality. There is just one point where I...
    Now with Version 1.37 x64 MP is near to perfect with repect to speed, robustness and funtionality. There is just one point where I...
    Replies
    0
    Views
    970
    Top Bottom