Hi,
I've been using MP for some 6 months now and I love it.
I live in Greece and lately I tried creating some grabber files for Greek channels to supplement the one included (GR/www_in_gr.xml which i have also enhanced to include descriptions and genres as well as more channels).
These new grabber files use the websites of the respective broadcasters but unfortunately the sites use all-caps for the program titles. WebEPG tries to title-case those tiltes and uses the standard .NET ToLower() method. Unfortunately .NET does not correctly handle some special cases in Greek and IIRC some other languages. Specifically for Greek lower casing the letter Sigma is context sensitive: it becomes "lower case Sigma Final" if it is at the end of the word but "lower case Sigma Not_Final" otherwise (i.e. in the middle of the word). ToLower() incorrectly always turns it to "lower case Sigma Not_Final". And although the meaning is not altered (as happens in some other languages) it is still plain wrong (imagine if HELLO was title cased as HellO: you can still understand the meaning but it doesn't seem right does it?)
I could patch this in WebEPG (just replace non_final with final sigma if it is at the end of a word) but since there are special cases in other languages too, perhaps there should be a more structured way to handle this (e.g. an extensible class in Utils to handle special cases of case folding)
As a side note: I noticed that (almost) all comparisons of program titles, genres and channel names are binary which makes them fast but case and accent sensitive (e.g. if I schedule to record a program "every time" but then the site changes the case of the titles, the program is no longer considered to be the same and it is not recorded)
Regards,
Panayotis
PS: I will post the grabber files once finished for those interested.
I've been using MP for some 6 months now and I love it.
I live in Greece and lately I tried creating some grabber files for Greek channels to supplement the one included (GR/www_in_gr.xml which i have also enhanced to include descriptions and genres as well as more channels).
These new grabber files use the websites of the respective broadcasters but unfortunately the sites use all-caps for the program titles. WebEPG tries to title-case those tiltes and uses the standard .NET ToLower() method. Unfortunately .NET does not correctly handle some special cases in Greek and IIRC some other languages. Specifically for Greek lower casing the letter Sigma is context sensitive: it becomes "lower case Sigma Final" if it is at the end of the word but "lower case Sigma Not_Final" otherwise (i.e. in the middle of the word). ToLower() incorrectly always turns it to "lower case Sigma Not_Final". And although the meaning is not altered (as happens in some other languages) it is still plain wrong (imagine if HELLO was title cased as HellO: you can still understand the meaning but it doesn't seem right does it?)
I could patch this in WebEPG (just replace non_final with final sigma if it is at the end of a word) but since there are special cases in other languages too, perhaps there should be a more structured way to handle this (e.g. an extensible class in Utils to handle special cases of case folding)
As a side note: I noticed that (almost) all comparisons of program titles, genres and channel names are binary which makes them fast but case and accent sensitive (e.g. if I schedule to record a program "every time" but then the site changes the case of the titles, the program is no longer considered to be the same and it is not recorded)
Regards,
Panayotis
PS: I will post the grabber files once finished for those interested.
Greece