IMDb Scrapping Issues?

Discussion in 'Moving Pictures' started by fforde, September 19, 2008.

  1. fforde

    fforde Community Plugin Dev

    Joined:
    June 7, 2007
    Messages:
    2,666
    Likes Received:
    1,690
    Occupation:
    Software Engineer
    Location:
    Texas
    Ratings:
    +1,696 / 0
    Home Country:
    United States of America United States of America
    Reporting Scraper Issues
    If you are having issues with IMDb scraping, please post details here. To properly debug issues I need:

    1) The title of the movie you are having problems with.
    2) The IMDb ID of the movie you are having problems with. This will be listed on the Movie Manager panel of the configuration screen. Please look here for the IMDb ID associated with the movie you are having trouble with, do not make any assumptions.
    3) The specific field or fields you are having trouble with.
    4) What you are getting, and what you expected.



    Known Issues
    • Runtime seems to only be matching about 50% of the time.
    • Some TV Series are sometimes returned in search results.
     
  2. Google AdSense Guest Advertisement



    to hide all adverts.
  3. Bingle

    Bingle Portal Member

    Joined:
    August 8, 2006
    Messages:
    16
    Likes Received:
    0
    Ratings:
    +0 / 0
    Hi,

    This is a minor issue with the scraping, but on one title I seem to have gotten HTML mixed in with the plain text description of the movie.

    1) Cite des Enfants Perdue, La (City of Lost Children)
    2) tt0112682
    3) Summary field
    4) Result: Krank (<a href="/name/nm0256399/">Daniel Emilfork</a>), who cannot dream, kidnaps young children to steal their dreams. One (<a href="/name/nm0000579/">Ron Perlman</a>), a former whale hunter who is as strong as a horse, sets forth to search for Denree, his little brother who was kidnapped by Krank's men. Helped by young Miette (<a href="/name/nm0900136/">Judith Vittet</a>), he soon arrives in La Cite des Enfants Perdus (The City of Lost Children).

    Expected result:
    Krank (Daniel Emilfork), who cannot dream, kidnaps young children to steal their dreams. One (Ron Perlman), a former whale hunter who is as strong as a horse, sets forth to search for Denree, his little brother who was kidnapped by Krank's men. Helped by young Miette (Judith Vittet), he soon arrives in La Cite des Enfants Perdus (The City of Lost Children).

    This may be the fault of the summary writer, who peppered the summary with links to the various actors' pages - still, I can't imagine that this is the only movie with that problem!
     
  4. Bobb25

    Bobb25 Portal Pro

    Joined:
    November 26, 2006
    Messages:
    232
    Likes Received:
    0
    Location:
    Durban
    Ratings:
    +0 / 0
    Home Country:
    South Africa South Africa
    Ok, whenever i scan for 300, the only option i get is "Hot Fuzz" :/ is there any way i can FORCE it to go to an IMDB code (tt0416449)?
     
  5. lillis_55

    lillis_55 Portal Member

    Joined:
    November 20, 2006
    Messages:
    24
    Likes Received:
    1
    Ratings:
    +1 / 0
    Home Country:
    Sweden Sweden
    You can add the imdb number to an .nfo file, and put it in the same folder. Dont know if there's an eaiser way but it works =)
     
  6. fforde

    fforde Community Plugin Dev

    Joined:
    June 7, 2007
    Messages:
    2,666
    Likes Received:
    1,690
    Occupation:
    Software Engineer
    Location:
    Texas
    Ratings:
    +1,696 / 0
    Home Country:
    United States of America United States of America
    In the GUI there is currently no way to search by IMDb ID, although that will be coming soon. You could do what lillis_55 suggested, however I was able to add 300 fine on my setup. You might have a problem with the search string. In the importer just click the movie and the click rescan with custom search string. Your filename is probably just not getting parsed correctly. As a side note if you post your filename, we will see if we can improve the scanner to pickup whatever formating it uses and is having trouble with.
     
  7. Bobb25

    Bobb25 Portal Pro

    Joined:
    November 26, 2006
    Messages:
    232
    Likes Received:
    0
    Location:
    Durban
    Ratings:
    +0 / 0
    Home Country:
    South Africa South Africa
    Thanks fforde. It was named "300.mkv"... now for some reason whatever i do it just adds it to the db as Hot Fuzz. I have tried numerous file names "300 (2006).mkv, The 300.mkv etc etc with no joy. Not a big problem, just a little irritating :) Also. the nfo names as imdb tt0416449.nfo didnt seem to work.
     
  8. lillis_55

    lillis_55 Portal Member

    Joined:
    November 20, 2006
    Messages:
    24
    Likes Received:
    1
    Ratings:
    +1 / 0
    Home Country:
    Sweden Sweden
    Think you need to have the imdb number in the .nfo file. And name the file whatever you want =)
     
  9. Bobb25

    Bobb25 Portal Pro

    Joined:
    November 26, 2006
    Messages:
    232
    Likes Received:
    0
    Location:
    Durban
    Ratings:
    +0 / 0
    Home Country:
    South Africa South Africa
    Thanks Lillis.... that did the trick! For some reason it appears the NFO was showing the incorrect IMDB number... now its 100% :D
     
  10. Boiler

    Boiler Portal Pro

    Joined:
    July 29, 2007
    Messages:
    160
    Likes Received:
    7
    Ratings:
    +7 / 0
    Home Country:
    Switzerland Switzerland
    I can't add the movie House of 9.

    filename: House of 9.iso
    imdb-name: House of 9
    imdb-id: tt0395585


    also, something else i noticed is, that when i make an imdb import via ant movie and compare this to the moving pcitures import, a lot more covers seem to be imported via ant. for instance, 10'000 B.C. imports no covers via moving pictures, while it works just fine via ant.
     
  11. fforde

    fforde Community Plugin Dev

    Joined:
    June 7, 2007
    Messages:
    2,666
    Likes Received:
    1,690
    Occupation:
    Software Engineer
    Location:
    Texas
    Ratings:
    +1,696 / 0
    Home Country:
    United States of America United States of America
    Yep, the posters importer still needs some work, this was mentioned in the post announcing the new release. We are working on improvements for the next release.

    We will look at that movie you listed though, it should be matching.
     
Loading...

Users Viewing Thread (Users: 0, Guests: 0)

  1. This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
    By continuing to use this site, you are consenting to our use of cookies.
    Dismiss Notice
  • About The Project

    The vision of the MediaPortal project is to create a free open source media centre application, which supports all advanced media centre functions, and is accessible to all Windows users.

    In reaching this goal we are working every day to make sure our software is one of the best.

             

  • Support MediaPortal!

    The team works very hard to make sure the community is running the best HTPC-software. We give away MediaPortal for free but hosting and software is not for us.

    Care to support our work with a few bucks? We'd really appreciate it!