Update: Please consider my patch for that in another post (in general development)...
Hi,
could the EPG grabber support above mentionet standart? This standart tells how are encoded latin and non-latin characters in EPG.
As now all data are threated as ASCII only whic is true for latin1 characters. Non-latin characters are encoded as two byte characters - first byte accent + second byte base character.
The problem propably lies in in file DVBEPG.cs where texts are decoded using code like this
and ASCII there causes problem...
More info about standard should be there http://tv-plattform.de/download/DVB-T/MinAnfo/A1_E-Book2-02.pdf
page 85 - Anex A
Small excerpt from coding tables:
Columns:
1. Windows(ISO 1250) hex
2. Windows(ISO 1250) dec
3. ISO Latin 2(ISO 8859-2) dec
4. ISO 6937 hex
5. Unicode hex
6. Character (could be wrong due to the www)
1. 2. 3. 4. 5. 6.
80 138 169 CF53 0x0160 Å
81 141 171 CF54 0x0164 Ť
82 142 174 CF5A 0x017d Ž
83 154 185 CF73 0x0161 Å¡
84 157 187 CF74 0x0165 ť
85 158 190 CF7A 0x017e ž
86 188 188 CF4C 0x013d Ľ
87 190 190 CF6C 0x013e ľ
88 192 192 C252 0x0154 Å”
89 193 193 C241 0x00c1 Ã
8A 196 196 C841 0x00c4 Ä
8B 197 197 C24C 0x0139 Ĺ
8C 200 200 CF43 0x010c Č
8D 201 201 C245 0x0116 É
8E 204 204 CF45 0x0114 Äš
8F 205 205 C249 0x00cd Ã
90 207 207 CF44 0x010e ÄŽ
91 210 210 CF4E 0x0147 Ň
92 211 211 C24F 0x00d3 Ó
93 212 212 C34F 0x00d4 Ô
95 214 214 C84F 0x00d6 Ö
96 216 216 CF52 0x0158 Ř
97 217 217 CA55 0x016e Å®
98 218 218 C255 0x00da Ú
99 220 220 C855 0x00dc Ü
9A 221 221 C259 0x00dd Ã
9B 224 224 C272 0x0155 Å•
9C 225 225 C261 0x00e1 á
9D 228 228 C861 0x00e3 ä
9E 229 229 C26C 0x013a ĺ
9F 232 232 CF63 0x010d Ä
A0 233 233 C265 0x0117 é
A1 236 236 CF65 0x0115 Ä›
A2 237 237 C269 0x00ed Ã
A3 239 239 CF64 0x010f Ä
A4 242 242 CF6E 0x0148 ň
A5 243 243 C26F 0x00f3 ó
A6 244 244 C36F 0x00f4 ô
A7 246 246 C86F 0x00f6 ö
A8 248 248 CF72 0x0159 Å™
A9 249 249 CA75 0x016f ů
AA 250 250 C275 0x00fa ú
AB 252 252 C875 0x00fc ü
BC 253 253 C279 0x00fd ý
Hi,
could the EPG grabber support above mentionet standart? This standart tells how are encoded latin and non-latin characters in EPG.
As now all data are threated as ASCII only whic is true for latin1 characters. Non-latin characters are encoded as two byte characters - first byte accent + second byte base character.
The problem propably lies in in file DVBEPG.cs where texts are decoded using code like this
Code:
th.ThemeText = System.Text.Encoding.ASCII.GetString(data, themesNames, 15);
More info about standard should be there http://tv-plattform.de/download/DVB-T/MinAnfo/A1_E-Book2-02.pdf
page 85 - Anex A
Small excerpt from coding tables:
Columns:
1. Windows(ISO 1250) hex
2. Windows(ISO 1250) dec
3. ISO Latin 2(ISO 8859-2) dec
4. ISO 6937 hex
5. Unicode hex
6. Character (could be wrong due to the www)
1. 2. 3. 4. 5. 6.
80 138 169 CF53 0x0160 Å
81 141 171 CF54 0x0164 Ť
82 142 174 CF5A 0x017d Ž
83 154 185 CF73 0x0161 Å¡
84 157 187 CF74 0x0165 ť
85 158 190 CF7A 0x017e ž
86 188 188 CF4C 0x013d Ľ
87 190 190 CF6C 0x013e ľ
88 192 192 C252 0x0154 Å”
89 193 193 C241 0x00c1 Ã
8A 196 196 C841 0x00c4 Ä
8B 197 197 C24C 0x0139 Ĺ
8C 200 200 CF43 0x010c Č
8D 201 201 C245 0x0116 É
8E 204 204 CF45 0x0114 Äš
8F 205 205 C249 0x00cd Ã
90 207 207 CF44 0x010e ÄŽ
91 210 210 CF4E 0x0147 Ň
92 211 211 C24F 0x00d3 Ó
93 212 212 C34F 0x00d4 Ô
95 214 214 C84F 0x00d6 Ö
96 216 216 CF52 0x0158 Ř
97 217 217 CA55 0x016e Å®
98 218 218 C255 0x00da Ú
99 220 220 C855 0x00dc Ü
9A 221 221 C259 0x00dd Ã
9B 224 224 C272 0x0155 Å•
9C 225 225 C261 0x00e1 á
9D 228 228 C861 0x00e3 ä
9E 229 229 C26C 0x013a ĺ
9F 232 232 CF63 0x010d Ä
A0 233 233 C265 0x0117 é
A1 236 236 CF65 0x0115 Ä›
A2 237 237 C269 0x00ed Ã
A3 239 239 CF64 0x010f Ä
A4 242 242 CF6E 0x0148 ň
A5 243 243 C26F 0x00f3 ó
A6 244 244 C36F 0x00f4 ô
A7 246 246 C86F 0x00f6 ö
A8 248 248 CF72 0x0159 Å™
A9 249 249 CA75 0x016f ů
AA 250 250 C275 0x00fa ú
AB 252 252 C875 0x00fc ü
BC 253 253 C279 0x00fd ý