IMDb+ Scraper (Force English title, Auto-Rename titles to group, and more) v3.1.7 (1 Viewer)

Should this be the default imdb scraper?

  • Yes, I do not want to re-import

    Votes: 19 95.0%
  • No, keep this one seperate

    Votes: 0 0.0%
  • Who cares, I got movies to watch

    Votes: 1 5.0%

  • Total voters
    20
  • Poll closed .

RoChess

Extension Developer
  • Premium Supporter
  • March 10, 2006
    4,434
    1,897
    • Thread starter
    • Moderator
    • #61
    Re: IMDb+ Scraper (short/long summary, imdb/RT score, US/UK rating, and more)

    Version 3.1.2 released (please use first post to obtain it).

    I would have waited longer, but I stumbled on a rare yet important problem with English movies that got issued a new USA title. It's hard to tell how many movies this affects on imdb.com, but I didn't wanted to wait too long giving you this new version as I also made a few other improvements.

    After further investigation, it seems you only have to exclude folder "C:\ProgramData\Team MediaPortal\MediaPortal\log" in Microsoft Security Essentials to prevent any delays during import. This most likely relates to the scraper debug mode that I use a lot, but figured I point that out incase somebody else uses it.
     

    Knives03

    Portal Pro
    March 6, 2009
    103
    2
    Re: IMDb+ Scraper (short/long summary, imdb/RT score, US/UK rating, and more)

    RoChess, I wanted to thank you for all the time you've spent on putting together a very functional and versatile scraper for all of us in your spare time. I waited until the scraper had been originally updated a while back before even importing movies because of how important it is to me to have everything just how i like it.

    I am going to try doing a full re-import of my movies from scratch this weekend and will report back if there is anything that I find to be odd. The only problem that I noticed was that sometimes the score wasn't being imported, but that was several versions ago so I will let you know about that after trying 3.1.2.

    Thanks again for working on this for us, and hopefully sometime soon this will become the default scraper for MovingPictures as it is by far the best one available in my opinion :)
     

    RoChess

    Extension Developer
  • Premium Supporter
  • March 10, 2006
    4,434
    1,897
    • Thread starter
    • Moderator
    • #63
    Re: IMDb+ Scraper (short/long summary, imdb/RT score, US/UK rating, and more)

    RoChess, I wanted to thank you for all the time you've spent on putting together a very functional and versatile scraper for all of us in your spare time. I waited until the scraper had been originally updated a while back before even importing movies because of how important it is to me to have everything just how i like it.

    Thank you :D

    Same for me, I used to do a lot of manual editing to fix titles afterwards and then when IMDb came out with their lovely original title system it drove me bonkers that on a refresh I would loose all my manual edits. The IMDb+ scraper is now at the point where the only renames I have to do relate to my own groups, for example I renamed "Casino Royale" into "James Bond 21: Casino Royale", so that it is grouped with all the others. Eventhough I have a 'James Bond' custom category amongst many others, I still like all the titles like that, so that they are all sorted in correct order (I could sort on year, but I hate extra work inside GUI).

    I'm actually still working on a way to make this scraper do that fully-automatic (via option of course as not everybody might appreciate their titles like that). But it is very hard to do that as IMDb has very conflicting data.

    I am going to try doing a full re-import of my movies from scratch this weekend and will report back if there is anything that I find to be odd. The only problem that I noticed was that sometimes the score wasn't being imported, but that was several versions ago so I will let you know about that after trying 3.1.2.

    It should all work, especially on v3.1.2+, but please let me know when it fails, I'm sure there are some bugs left to squash.

    I'm actually close to do a full re-import myself. I just have a lot of other things to keep in mind, such as those group title edits, watched flags (plan to synch with Follw.it), and I need to have the MovPic rename system tackle all my media that do not yet have the IMDb tt-ID added to their filename.

    Thanks again for working on this for us, and hopefully sometime soon this will become the default scraper for MovingPictures as it is by far the best one available in my opinion :)

    You are welcome.

    The current course for the MovingPictures plugin will be to use themoviedb as primary scraper to solve the English title problem. For me personally that's not going to work, which is why I will keep working on this IMDb+ scraper. What remains is the problem that if this scraper finds an original title during search-node, it replaces that with the English titles during the details-node and this is considered confusing for first-time users by the developers. I can see where they are coming from, so I'm fine with keeping the scraper on the forum. Perhaps after Google Issue #319 gets implemented, it will make more sense to include this scraper by default, I guess time will tell.
     

    RoChess

    Extension Developer
  • Premium Supporter
  • March 10, 2006
    4,434
    1,897
    • Thread starter
    • Moderator
    • #64
    Re: IMDb+ Scraper (Force English titles, Auto-Rename titles to group, and more) v3.1.

    The scraper has finally reached the functionality that I've strived for on my own collection.

    v3.1.3 (available on first post) adds support for title manipulation to indicate special editions (3D, Unrated, Extended, etc). So in the event that you have multiple versions of the same movie, you can quickly see the differences now. To make this work, your filenames have to contain this text between brackets as well as the IMDb tt-ID number (the latter can also be made available via NFO).

    Example: "Salt (Director's Cut) (tt0944835).mkv" will be imported as "Salt (Director's Cut)" if you enable the option for this.

    The current supported edition strings are "3D", "3D Edition", "Director's Cut", "Director's Cut Edition", "Extended", "Extended Edition", Unrated", and "Unrated Edition". The "Theatrical" string was left out on purpose as this would normally be the IMDb.com entry itself.

    The big enhancement however is the new auto-rename system that will allow you to retain any manual title changes after a refresh or re-import of your collection. To use it, you have to also download the "Rename dBase IMDb+ Scraper.xml" file and place it in your 'C:\' root folder, as that was the easiest common location to use. You can relocate the file, but then you have to edit the scraper to point to that new location.

    Please edit the rename dBase to your liking if you do not agree with the default entries, and share any series that I overlooked by replying to this thead.

    As long as IMDb.com and/or RottenTomatoes websites don't make any changes to their website, then this should be the last version of the scraper. Provided of course none of you find any bugs that I overlooked :D

    Enjoy.
     

    MeMeMe

    Portal Pro
    August 4, 2010
    71
    2
    Re: IMDb+ Scraper (Force English titles, Auto-Rename titles to group, and more) v3.1.

    "I need to have the MovPic rename system tackle all my media that do not yet have the IMDb tt-ID added to their filename."

    I have found the advanced setting entry I need to alter, but I don't know what code I need to enter to rename my files to include the imdb id.
    The entry currently reads: ${movie.title} (${movie.year})
    I'm guessing I should alter it to something like:
    ${movie.title} (${movie.year}) [${movie.IMDb ID}]
    Is that right?
     

    RoChess

    Extension Developer
  • Premium Supporter
  • March 10, 2006
    4,434
    1,897
    • Thread starter
    • Moderator
    • #66
    Re: IMDb+ Scraper (Force English titles, Auto-Rename titles to group, and more) v3.1.

    "I need to have the MovPic rename system tackle all my media that do not yet have the IMDb tt-ID added to their filename."

    I have found the advanced setting entry I need to alter, but I don't know what code I need to enter to rename my files to include the imdb id.
    The entry currently reads: ${movie.title} (${movie.year})
    I'm guessing I should alter it to something like:
    ${movie.title} (${movie.year}) [${movie.IMDb ID}]
    Is that right?

    That would be ${movie.title} (${movie.year}) [${movie.imdb_id}] then.

    Other fields that can be useful to some users rename wishes are:

    • ${movie.certification} (PG-13, R, etc)
    • ${movie.runtime} (length of movie in minutes)

    There are other values available as well, but those are in array form, for example Actor information would be available as "|Actor 1|Actor 2|etc|" and this would cause a problem due to the '|' char on filenames.
     

    SilentException

    Retired Team Member
  • Premium Supporter
  • October 27, 2008
    2,617
    1,130
    Rijeka, Croatia
    Home Country
    Croatia Croatia
    Re: IMDb+ Scraper (Force English titles, Auto-Rename titles to group, and more) v3.1.

    RoChess, awesome scrapper. But I'm still to test it. Will probably re-import everything :)

    Will probably do some changes to special edition regex too, since my movie names follow "scene" convention. Is there a limit on the regex complexity? :)

    Also, is "Director's Cut" caught? I presumed \s would be better than space but maybe it doesn't matter - it depends on the regex engine used - my test tool didn't match it.

    One more question: do I need imdb ID in filenames for everything to work? Or is it optional?
     

    Furetto

    Moderator - Dutch Forums
    April 11, 2005
    664
    61
    51
    Brussels
    Home Country
    Belgium Belgium
    Re: IMDb+ Scraper (Force English titles, Auto-Rename titles to group, and more) v3.1.

    and share any series that I overlooked by replying to this thead.

    This is Groovalicious !!!

    Austin Powers is missing :)
    Karate Kid series...
     

    RoChess

    Extension Developer
  • Premium Supporter
  • March 10, 2006
    4,434
    1,897
    • Thread starter
    • Moderator
    • #69
    Re: IMDb+ Scraper (Force English titles, Auto-Rename titles to group, and more) v3.1.

    Will probably do some changes to special edition regex too, since my movie names follow "scene" convention. Is there a limit on the regex complexity? :)

    Also, is "Director's Cut" caught? I presumed \s would be better than space but maybe it doesn't matter - it depends on the regex engine used - my test tool didn't match it.

    One more question: do I need imdb ID in filenames for everything to work? Or is it optional?

    Your test tool probably doesn't ignore whitespace then, but the default parsing settings that MovPic uses does. Normally I would use \s myself as well, but in this case it allows the complex code to be a little bit easier to read for those that wish to edit it. And yes, you can go as complex as you want, if you look at the rest of the scraper script, you can see that I had to resort to some borderline crazy methods to find a workaround for the English title problem at imdb.

    And yes for the special editions to work, you need to also provide the IMDb tt-ID during the search node process. This means it has to be inside an NFO file (with simple NFO scanning enabled), or part of the filename. You could also provide it manually during import (that binocular icon), but that would kinda defeat the purpose. The reason for this is because I abuse the AKA field to transfer the filename details from search node to details node. I can only reliable do this when I already know the AKA field is not going to be used to decide if the title found can be auto-approved. This is why it only works when the IMDb tt-ID is provided, otherwise you might end up having to manually approve a lot of movies.

    I probably should have had another look at the noise filter regex we build together, as we put most scene convention possabilities in there, but this is like the beta implementation, so I wanted to keep it as simple as possible first. By enforcing brackets around the special edition string, I knew I would avoid any false positives. But whatever you come up with, please share and I'll incorporate it into the next version, so that the scraper will work easier for more users out of the box.

    This is Groovalicious !!!

    Austin Powers is missing :)
    Karate Kid series...

    Yikes, can't believe I overlooked those, as I have both series myself, thanks for catching those :)

    Austin Powers I will show as follows:

    • Austin Powers I: International Man of Mystery
    • Austin Powers II: The Spy Who Shagged Me
    • Austin Powers III: Goldmember
    • Austin Powers IV: (Not Yet Released)

    As for The Karate kid series, I'll use the working title to seperate the 2010 remake as "The Kung Fu Kid", which production wise is the 5th movie in the series.

    • The Karate Kid I
    • The Karate Kid II
    • The Karate Kid III
    • The Karate Kid IV: The Next Karate Kid
    • The Karate Kid V: The Kung Fu Kid

    If you want to add them yourself right now, add the following to the XML file:

    Code:
    	<rename id="tt0118655" title="Austin Powers I: International Man of Mystery" />
    	<rename id="tt0145660" title="Austin Powers II: The Spy Who Shagged Me" />
    	<rename id="tt0295178" title="Austin Powers III: Goldmember" />
    	<rename id="tt1218992" title="Austin Powers IV: (Not Yet Released)" />
    	<rename id="tt0087538" title="The Karate Kid I" />
    	<rename id="tt0091326" title="The Karate Kid II" />
    	<rename id="tt0097647" title="The Karate Kid III" />
    	<rename id="tt0110657" title="The Karate Kid IV: The Next Karate Kid" />
    	<rename id="tt1155076" title="The Karate Kid V: The Kung Fu Kid" />

    Or wait till I've finished all the other edits I wanted to do and upload a new XML file to this thread.
     

    drealit

    Portal Pro
    March 15, 2008
    190
    17
    Re: IMDb+ Scraper (Force English titles, Auto-Rename titles to group, and more) v3.1.

    Don't forget the Friday the 13th series:

    Code:
    	<rename id="tt0080761" title="Friday the 13th" />
    	<rename id="tt0082418" title="Friday the 13th 02: Part 2" />
    	<rename id="tt0083972" title="Friday the 13th 03: Part III" />
    	<rename id="tt0087298" title="Friday the 13th 04: The Final Chapter" />
    	<rename id="tt0089173" title="Friday the 13th 05: A New Beginning" />
    	<rename id="tt0091080" title="Friday the 13th 06: Jason Lives" />
    	<rename id="tt0095179" title="Friday the 13th 07: The New Blood" />
    	<rename id="tt0097388" title="Friday the 13th 08: Jason Takes Manhattan" />
    	<rename id="tt0107254" title="Friday the 13th 09: Jason Goes to Hell - The Final Friday" />
    	<rename id="tt0211443" title="Friday the 13th 10: Jason X" />
    	<rename id="tt0329101" title="Friday the 13th 11: Freddy vs. Jason" />
    	<rename id="tt0758746" title="Friday the 13th 12: Part 12" />
     

    Users who are viewing this thread

    Top Bottom