*NEW* IMDb scraper (With localized versions for DE/ES/FR/IT/PT) (1 Viewer)

RoChess

Extension Developer
  • Premium Supporter
  • March 10, 2006
    4,434
    1,897
    • Thread starter
    • Moderator
    • #11
    I also found some small bugs:
    - when no German summary is found, the English is displayed along with two squares before the English description starts

    Those are TAB characters, they are side effects of the CDATA method in the source which means to use the text as-is, but to keep in line with the indenting of the rest of the scrapers XML (with dirty hack to get this to work), those TABs are used. However I was aware that they would not get displayed within the GUI which is normally the only place you would look at the summary. And positive side effect is, that it highlights easy the summaries that should be contributed to the localized IMDb site, in this case imdb.de

    - if no German certification is found, either the American one or no certification at all is displayed --> for example "Blind Side" has the US certification displayed and "Up in the Air" or "Shutter Island" have no certification at all

    I am totally reliant on what the IMDb.de website gives me.

    Code:
    Blind Side = http://www.imdb.de/title/tt0878804/
      \_ no 'Deutschland' entry for 'Altersfreigabe'
    
    Up in the Air = http://www.imdb.de/title/tt1193138/
      \_ Deutschland:o.Al.
    
    Shutter Island = http://www.imdb.de/title/tt1130884/
      \_ no 'Deutschland' entry for 'Altersfreigabe'

    So I do not understand how US Certification got displayed for Blind side, it should have been empty.. So I like to get a movingpictures.log file with just 'that' movie getting re-imported and with scraper debug mode enabled, so that the log file actually contains what I need (FAQ explains how to do this, the green little bug icon).

    Shutter Island means somebody has to register at imdb.de and contribute the proper German certification.

    Up in the Air is the weird one, because I thought German certification was always a number, so that means if "o.Al." is a known rating (can also be a typo or mistake from whomever contributed it), then let me know and I will have to make an enhancement for it.
     

    Pilgrimon

    Portal Pro
    April 15, 2010
    63
    1
    Thanks for your answer RoChess!

    I enabled the debug mode in MediaPortal and MovingPictures and attached the movingpictures.log for the rescan of "Blind Side" to my reply. May it be, that the certification PG-13 from "Blind Side" is still there, because the IMDb.de Scraper found no certification so the one from IMDb.com Scraper is kept?

    "o.Al." is a known and valid German rating and is equal to 0 years, but we use "o.Al.", which means that the movie is suitable for everybody, too.
     

    RoChess

    Extension Developer
  • Premium Supporter
  • March 10, 2006
    4,434
    1,897
    • Thread starter
    • Moderator
    • #13
    Thanks for your answer RoChess!

    I enabled the debug mode in MediaPortal and MovingPictures and attached the movingpictures.log for the rescan of "Blind Side" to my reply. May it be, that the certification PG-13 from "Blind Side" is still there, because the IMDb.de Scraper found no certification so the one from IMDb.com Scraper is kept?

    "o.Al." is a known and valid German rating and is equal to 0 years, but we use "o.Al.", which means that the movie is suitable for everybody, too.

    Yes, it is unable to find certification value, but that means it does not update any existing values. I thought this was for a new movie import, so what you can do is either manually delete the old certification value, edit it manually to what you believe it should be (or preferably contribute to imdb.de), or you can sent the movie back to the importer instead of updating so that everything is first removed and then re-added.

    As for "o.Al.", would you as a German user prefer to see "o.Al." or should I display '0' instead?

    PS: You also have a few TV shows in your MovingPictures import folder that fail to be detected. Those are better suited for the MP-TVSeries plugin, so you might want to move those to another folder (so they are not detected by MovingPictures in the future), ignoring them, or manually force them via the configuration screen by using the imdb-tt ID directly.

    "i-camf.s03e11.avi" = tt1087505
    "S02E02 Karriere mit Koepfchen.avi" = tt0701211

    But MovingPictures is clearly not designed to handle TV Shows (with exception of Mini-Series) as IMDb website lacks a lot of info and no easy way to seperate individual episodes.

    MP-TVSeries relies on TheTVdB which has German support as well, so you end up with information such as: Die Simpsons: Karriere mit Köpfchen
     

    Pilgrimon

    Portal Pro
    April 15, 2010
    63
    1
    Instead of rescanning "Blind Side", I send it back to the importer and now everything works as expected (certification is now empty as it should be, sorry for the false alarm).

    Regarding the "o.Al." I would prefer to display 0 in order to be consitent with the other numerical certifications in Germany. So as long as no other German user thinks different the 0 looks good to me. :D

    As for the TV shows in my import folder I have to say, that this is just my MediaPortal test-setup on my notebook. So everything is kind of messy and not sorted into different folders as on my HTPC. Nevertheless, thanks for pointing this out to me. :)

    If there is anything else I can do to help you testing the IMDb.de Scraper, just let me know!
     

    RoChess

    Extension Developer
  • Premium Supporter
  • March 10, 2006
    4,434
    1,897
    • Thread starter
    • Moderator
    • #15
    If there is anything else I can do to help you testing the IMDb.de Scraper, just let me know!

    Please test IMDb.de v1.5.0 scraper as attached to first post.

    This should turn "o.Al." rating to '0' when you update it for "Up in the Air" (no need to sent back to importer).

    Also test it on other movies to make sure nothing broke, but sent those back to the importer, so that it reveals errors (as it will re-uses old value on failure otherwise).
     

    Pilgrimon

    Portal Pro
    April 15, 2010
    63
    1
    I tested the IMDb.de Scaper 1.5.0 and updated "Up in the Air". Everything works as expected, the certification displayed correctly as "0" and all other information was not updated.

    Furthermore I sent the other movies from my test folder back to the importer and noticed no errors, when they were reimported and the information was gathered from IMDb.de.
     

    RoChess

    Extension Developer
  • Premium Supporter
  • March 10, 2006
    4,434
    1,897
    • Thread starter
    • Moderator
    • #17
    Thank you for testing.

    Please try to use it on as many movies as you can for a while longer, and if no other problems show up, then I will rename it version 1.5.4 to be in line with the IMDb.com scraper version, because aside from the German specific parts, everything else is the same.
     

    Langi

    Portal Member
    October 5, 2009
    13
    3
    Home Country
    Austria Austria
    RoChess

    Thanks for IMDb.de scraper. I tested it yesterday in conjunction with ofdb-scraper and it works perfect for me. So it is the perfect supplement for the ones supplied with Moving Pictures.

    Keep up the good work and many :D

    Best regards,
    Langi
     

    Pilgrimon

    Portal Pro
    April 15, 2010
    63
    1
    Hi RoChess,

    until now I have tested 142 movies from various decades. I did not recognize any bugs, whether I named the folder with the English movie title, the German movie title or with the IMDb-ID in a text-file. Everything worked just fine and all information was gathered correctly from IMDb.de.

    However, when I wanted to import the movie "Inglourious Basterds" only the Title, Alternate Titles, Certification and Summary were imported from IMDb.de and no details like actor or director. But after a rescan from IMDb.de the missing information was added correctly. To double check I send the movie back to the importer and all information was gathered correctly this time.
    Something similar happened with "P.S. I love you". This movie has no German summary, but the English short summary was not imported. Again, after a rescan from IMDb.de everything was fine and the English short summary was present. This time I also send the movie back to the importer and there was no problem at all when importing again.

    So I do not know why this happend in first place, but afterwards everything was working alright. If you need some logs with only importing these two movies, just let me know. Furthermore, if it is necessary to test more movies, I will do so.

    Nevertheless, I am going to update some of the movie details on IMDb.de which were missing during my tests.
     

    Users who are viewing this thread

    Top Bottom