*NEW* IMDb scraper (With localized versions for DE/ES/FR/IT/PT)

Discussion in 'Moving Pictures' started by RoChess, April 1, 2010.

  1. RoChess
    • Premium Supporter

    RoChess Extension Developer

    Joined:
    March 10, 2006
    Messages:
    4,239
    Likes Received:
    1,336
    Ratings:
    +1,745 / 2
    IMDb v1.5.4 scraper fixes:

    • AKA titles during search-node lookup
    • Popularity vote numbers
    • Short summary trailing '|'
    • Fixed RegExp conditional problem
    • Improved speed by using YEAR in IMDb search query when available
    • Fixed obtaining Cover Image (v1.5.4)
    Auto-approve is up to 97% now on a large test collection (300 filenames), up from 95% on previous scraper. Big thanks to LRFalk for taking the time to help me test the English scraper.

    Close to giving up on Portugese version, because I guess nobody using MovingPictures knows the language, the rest works:

    Install instructions available on the FAQ at "Adding a new, or updating an existing scraper".

    Thank you.



    PS: I've added v1.4.9 on the Portugese scraper, but there is no confirmation if it works, and will not be included in the default MovingPictures configuration until somebody gets back to me. I also do not plan to keep it updated without feedback (what's the point otherwise).
     

    Attached Files:

    • Like Like x 14
  2. Google AdSense Guest Advertisement



    to hide all adverts.
  3. alexeiei
    • Premium Supporter

    alexeiei MP Donator

    Joined:
    April 8, 2009
    Messages:
    27
    Likes Received:
    4
    Gender:
    Male
    Location:
    Barcelona
    Ratings:
    +4 / 0
    Home Country:
    Spain Spain
    Re: *NEW* IMDb scraper

    Spanish:

    No hay resumen disponible en Español, por favor contribuyan creando uno en imdb.es.

    Inglés: There is no available summary in Spanish, please contribute by making one in imdb.es


    If the sentence is too long:

    Spanish: Sin resumen en Español, puede crearlo en imdb.es.
    Inglés: No summary in Spanish, you may create it in imdb.es.
     
  4. Pilgrimon

    Pilgrimon Portal Pro

    Joined:
    April 15, 2010
    Messages:
    63
    Likes Received:
    1
    Ratings:
    +1 / 0
    Re: *NEW* IMDb scraper

    Formal German:

    Keine deutsche Zusammenfassung gefunden, bitte erstellen Sie eine bei imdb.de.

    Englisch: .....(short English summary will go here).....



    Informal German:

    Keine deutsche Zusammenfassung gefunden, bitte erstelle eine bei imdb.de.

    Englisch: .....(short English summary will go here).....



    Short German:

    Keine deutsche Zusammenfassung, bitte erstellen bei imdb.de.

    Englisch: .....(short English summary will go here).....
     
  5. DaRtHmAuL999

    DaRtHmAuL999 New Member

    Joined:
    April 6, 2010
    Messages:
    1
    Likes Received:
    1
    Ratings:
    +1 / 0
    Re: *NEW* IMDb scraper


    Hi RoChess,

    here are two correct sentences :

    Aucun résumé en français n'existe, veuillez, s'il vous plaît, y contribuer sur IMDB.fr.

    OR

    Aucun résumé en français disponible, vous pouvez apporter votre contribution sur IMDB.fr.

    Bye
     
    • Like Like x 1
  6. boombastic

    boombastic Portal Pro

    Joined:
    November 17, 2007
    Messages:
    86
    Likes Received:
    2
    Ratings:
    +2 / 0
    Re: *NEW* IMDb scraper

    For the Italian one:



    Italian:

    Nessun riassunto disponibile in Italiano, per favore contribuisci scrivendone uno su imdb.it.

    Inglese:There is no available summary in Italian, please contribute by making one in imdb.it


    Short Version:

    Nessun riassunto in Italiano, scrivene uno su imdb.it.

    Inglese: No available summary in Italian, create one in imdb.it



    Thanks for your work and hope it helps.
     
  7. RoChess
    • Premium Supporter

    RoChess Extension Developer

    Joined:
    March 10, 2006
    Messages:
    4,239
    Likes Received:
    1,336
    Ratings:
    +1,745 / 2
    Italian version done, only waiting on German and Portugese testers to get back to me. If anybody else knows those languages and is familiar with IRC let me know please.
     
    • Like Like x 2
  8. RoChess
    • Premium Supporter

    RoChess Extension Developer

    Joined:
    March 10, 2006
    Messages:
    4,239
    Likes Received:
    1,336
    Ratings:
    +1,745 / 2
    You are welcome.

    This should do it:

    Code (Text):
    1.  
    2. <set name="rx_rotten_rating">
    3.     <![CDATA[
    4.     <span class="per[cf]e[nc]t".+?>(\d{1,3}?)</span>
    5.     ]]>
    6. </set>
     
  9. Fragga

    Fragga Portal Member

    Joined:
    January 20, 2010
    Messages:
    24
    Likes Received:
    5
    Ratings:
    +5 / 0
    Home Country:
    Germany Germany
    Is the german version still beeing developed? Because it would be realy cool if you could release it!
    Thx
     
  10. RoChess
    • Premium Supporter

    RoChess Extension Developer

    Joined:
    March 10, 2006
    Messages:
    4,239
    Likes Received:
    1,336
    Ratings:
    +1,745 / 2
    Bump for v1.5.4.

    Enjoy.
     
    • Like Like x 1
  11. Pilgrimon

    Pilgrimon Portal Pro

    Joined:
    April 15, 2010
    Messages:
    63
    Likes Received:
    1
    Ratings:
    +1 / 0
    Hi RoChess,

    thanks for releasing the German and Portugese Scrapers, although you did not get feedback from the testers. However, I downloaded the German Scraper and run a few tests.

    First of all the Scraper works pretty well and I want to thank you for your effort!
    - Movies were correctly identified
    - Movie Name is pulled correctly from Imdb.de
    - Certification is displayed according to German standard
    - German summary is pulled (I tested the long summary and by altering your script the short summary)
    - If no German summary is found, the short summary is displayed in English from imdb.com along with the message to contribute one on Imdb.de

    I also found some small bugs:
    - when no German summary is found, the English is displayed along with two squares before the English description starts --> for example "American Psycho" (Copy/Paste of the text was not working):
    [​IMG]

    - if no German certification is found, either the American one or no certification at all is displayed --> for example "Blind Side" has the US certification displayed and "Up in the Air" or "Shutter Island" have no certification at all


    Hope this helps a little bit and you can maybe solve the bugs. If you have any further questions or need some more testing, I will try to help. But either way you did a great job! :D


    EDIT: Forgot to mention, that the two squares are not visible when browsing through the videos in MovingPictures. So maybe they are just placeholders or something like that.
     
Loading...

Users Viewing Thread (Users: 0, Guests: 0)

  1. This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
    By continuing to use this site, you are consenting to our use of cookies.
    Dismiss Notice
  • About The Project

    The vision of the MediaPortal project is to create a free open source media centre application, which supports all advanced media centre functions, and is accessible to all Windows users.

    In reaching this goal we are working every day to make sure our software is one of the best.

             

  • Support MediaPortal!

    The team works very hard to make sure the community is running the best HTPC-software. We give away MediaPortal for free but hosting and software is not for us.

    Care to support our work with a few bucks? We'd really appreciate it!