I'm trying to develop a new WebEPG grabber for sweden, but I got a few problem in parsing the html-file to get all programs.
In the file attached to this post the problem is that there are a few different ways to do things.
the first thing is in the program at 11:30.
The <h3> tag is the title of the program, but as you see there is some extra info in the tag, there is a <span> tag that I dont need and dont know how to remove.
the second thing is in the program at 19:30.
That program i "missing" the subtitle tag, that is the <span> tag between the <h3> and the <p>.
I have a TemplateText that is almost working, but I'm missing four of the programs from this file and it is because I cant figure out the problems above.
If I make those tags optional, I'll get all programs, but not all info for the programs.
In the file attached to this post the problem is that there are a few different ways to do things.
the first thing is in the program at 11:30.
The <h3> tag is the title of the program, but as you see there is some extra info in the tag, there is a <span> tag that I dont need and dont know how to remove.
the second thing is in the program at 19:30.
That program i "missing" the subtitle tag, that is the <span> tag between the <h3> and the <p>.
I have a TemplateText that is almost working, but I'm missing four of the programs from this file and it is because I cant figure out the problems above.
If I make those tags optional, I'll get all programs, but not all info for the programs.