IMDb+ Scraper (Force English title, Auto-Rename titles to group, and more) v3.1.7 (4 Viewers)

Should this be the default imdb scraper?

  • Yes, I do not want to re-import

    Votes: 19 95.0%
  • No, keep this one seperate

    Votes: 0 0.0%
  • Who cares, I got movies to watch

    Votes: 1 5.0%

  • Total voters
    20
  • Poll closed .

SilentException

Retired Team Member
  • Premium Supporter
  • October 27, 2008
    2,617
    1,130
    Rijeka, Croatia
    Home Country
    Croatia Croatia
    Re: IMDb+ Scraper (Force English titles, Auto-Rename titles to group, and more) v3.1.

    Don't forget the Friday the 13th series:

    Code:
    	<rename id="tt0080761" title="Friday the 13th" />
    	<rename id="tt0082418" title="Friday the 13th II: Part 2" />
    	<rename id="tt0083972" title="Friday the 13th III: Part III" />
    	<rename id="tt0087298" title="Friday the 13th IV: The Final Chapter" />
    	<rename id="tt0089173" title="Friday the 13th V: A New Beginning" />
    	<rename id="tt0091080" title="Friday the 13th VI: Jason Lives" />
    	<rename id="tt0095179" title="Friday the 13th VII: The New Blood" />
    	<rename id="tt0097388" title="Friday the 13th VIII: Jason Takes Manhattan" />
    	<rename id="tt0107254" title="Friday the 13th IX: Jason Goes to Hell - The Final Friday" />
    	<rename id="tt0211443" title="Friday the 13th X: Jason X" />
    	<rename id="tt0329101" title="Friday the 13th XI: Freddy vs. Jason" />
    	<rename id="tt0758746" title="Friday the 13th XII: Part 12" />

    Is this going to be sorted correctly in Moving Pictures? Coz I'm pretty sure it's gonna go

    I
    II
    III
    IV
    IX
    V
    VII
    VIII
    X
    XI
    XII

    :)
     

    drealit

    Portal Pro
    March 15, 2008
    190
    17
    Re: IMDb+ Scraper (Force English titles, Auto-Rename titles to group, and more) v3.1.

    Lol I didn't even think of it while I was typing it out... just rolling!

    edit: Rochess a thought, is there anyway for you to modify the scraper so that it's just affecting the sort by field for the new auto renaming function? The reason I ask is because while I understand that it's knocking out 2 birds with one stone (title and sorting), some of the titles that are being produced are going to be fairly ugly from a front end perspective. Specifically long titles and those that I just hate to see butchered (the Bourne series for instance). I'd much rather (my own personal taste) have the original titles still displayed but the sorting be corrected.

    Again just a thought, I already appreciate the current way it is working but I'm finding myself wanting more now that I have this :).
     

    RoChess

    Extension Developer
  • Premium Supporter
  • March 10, 2006
    4,434
    1,897
    • Thread starter
    • Moderator
    • #73
    Re: IMDb+ Scraper (Force English titles, Auto-Rename titles to group, and more) v3.1.

    Yeah, darn Roman numerals stop sorting correct past 8. That is why I had already used numbers for James Bond and Hellraiser series.

    :D for saving me the work of having to lookup IMDb tt-ID on all those Friday the 13th movies, I added them to a large revision I was working on already, so now it's up to 418 renamed title entries.

    Enjoy.
     

    RoChess

    Extension Developer
  • Premium Supporter
  • March 10, 2006
    4,434
    1,897
    • Thread starter
    • Moderator
    • #74
    Re: IMDb+ Scraper (Force English titles, Auto-Rename titles to group, and more) v3.1.

    Lol I didn't even think of it while I was typing it out... just rolling!

    edit: Rochess a thought, is there anyway for you to modify the scraper so that it's just affecting the sort by field for the new auto renaming function? The reason I ask is because while I understand that it's knocking out 2 birds with one stone (title and sorting), some of the titles that are being produced are going to be fairly ugly from a front end perspective. Specifically long titles and those that I just hate to see butchered (the Bourne series for instance). I'd much rather (my own personal taste) have the original titles still displayed but the sorting be corrected.

    Again just a thought, I already appreciate the current way it is working but I'm finding myself wanting more now that I have this :).

    Yes, it is possible to only make these changes to the SortBy field.

    However, fforde warned me that it should normally not be possible to adjust SortBy field in details node, so I'm not sure if this function will stop working later. I'm currently already editing the SortyBy field anyway by clearing it out, this was needed to overcome a side-effect of the SortBy field relying on the search node and this was interfering with my title changes in details node.

    I'm tagging along with wife in a moment on some art-walk (women right :mad:), so check back tomorrow or later for an updated scraper script version that will include this option. I'm going to see if it is possible for me to use:

    <rename id="tt...." title="Title Adjustment" sortby="SortBy Adjustment" />​

    That way you can use any combination possible. In order to save typing, I'm going to allow empty fields to ignore the modification.

    • <rename id="tt0258463" title="The Bourne I: The Bourne Identity" sortby="" /> (only adjust title, SortBy field will be done automatic, respecting the article removal settings, so result for SortBy field will become "bourne I: the bourne identity the")
    • <rename id="tt0258463" title="" sortby="Bourne 1" /> (only adjust SortBy field, so it will be sorted under 'B', and title will be used as-is, in this case "The Bourne Identity")
    • <rename id="tt0381061" title="007: Casino Royale" sortby="James Bond 21" /> (adjust both title and SortBy with different values, to keep title shorter, but still grouped)

    But I'll have to do a lot more testing to make sure this won't break anything.
     

    Furetto

    Moderator - Dutch Forums
    April 11, 2005
    664
    61
    50
    Brussels
    Home Country
    Belgium Belgium
    Re: IMDb+ Scraper (Force English titles, Auto-Rename titles to group, and more) v3.1.

    I rescanned my whole collection. Just before going to bed (reason why I do not provide more details), here is a small list from my collection

    - something is up with Art of War II if you have both foreign_title and rename_titles. Guess the source is on IMDB side. Did not investigate further. It show "Art of War 2 (The Art of War II: Betrayal)" on my setup.
    - You might want to add
    Code:
    	<rename id="tt1028528" title="Grindhouse I: Death Proof" />
    	<rename id="tt1077258" title="Grindhouse II: Planet Terror" />
    - I got one movie from the Halloween series, no clue how the series will behave
    - I think there is an error in the Harry Potters, 6 should be the Half-Blood Prince
    - I believe Resident Evil 2 and 3 are swapped
    - Return to the blue lagoon (tt0102782) might be marked as Blue Lagoon II (please, don't tell anyone I mentioned this :D)
    - Maybe you should consider the Millenium series, though it could become a mess in combination with the swedish names.

    Edit: instead of The Bourn, why not "Jason Bourne x:"... or "the bourne trilogy x:"
     

    RoChess

    Extension Developer
  • Premium Supporter
  • March 10, 2006
    4,434
    1,897
    • Thread starter
    • Moderator
    • #76
    Re: IMDb+ Scraper (Force English titles, Auto-Rename titles to group, and more) v3.1.

    - something is up with Art of War II if you have both foreign_title and rename_titles. Guess the source is on IMDB side. Did not investigate further. It show "Art of War 2 (The Art of War II: Betrayal)" on my setup.

    Weird, that would mean it is not detecting the English title properly, because it should detect the Canadian country with English language and use "The Art of War II: Betrayal" title, instead it ends up skipping that, look at AKA titles and use the first (English title) it encounters, which is the Japan entry with "Art of War 2". I did not encounter this myself, so might be that it is a combination side effect of the other settings you have (or your location and how imdb.com adjusts itself).

    - You might want to add (Gringhouse I/II)

    Will do :)

    - I got one movie from the Halloween series, no clue how the series will behave
    - I think there is an error in the Harry Potters, 6 should be the Half-Blood Prince
    - I believe Resident Evil 2 and 3 are swapped
    - Return to the blue lagoon (tt0102782) might be marked as Blue Lagoon II (please, don't tell anyone I mentioned this :D)
    - Maybe you should consider the Millenium series, though it could become a mess in combination with the swedish names.

    Edit: instead of The Bourn, why not "Jason Bourne x:"... or "the bourne trilogy x:"

    Nice catch on the Harry Potter one, I totally miscounted that one. Resident Evil I caught already and had already uploaded a revised version. This also includes Halloween series, old and new, I decided to split them up so I could keep using Roman numerals for old one and keep the titles shorter.

    Will add the other 2 as well :)

    As for title names, I keep them as close to the original title as possible, so that locating them is easier on expected sort order via SMS input and expected order in list ('B' for The Bourne .... movies). But you are free to edit the XML file for your own usage of course, the XML as provided is more a default template.
     

    RoChess

    Extension Developer
  • Premium Supporter
  • March 10, 2006
    4,434
    1,897
    • Thread starter
    • Moderator
    • #77
    Re: IMDb+ Scraper (Force English titles, Auto-Rename titles to group, and more) v3.1.

    Millenium series is a tough one. With foreign title showing you already get a long title such as "The Girl with the Dragon Tattoo (Män som hatar kvinnor)".

    Some options are:

    1.

    Millenium I: The Girl with the Dragon Tattoo (Män som hatar kvinnor)
    Millenium II: The Girl Who Played with Fire (Flickan som lekte med elden)
    Millenium III: The Girl Who Kicked the Hornet's Nest (Luftslottet som sprängdes)

    2.

    Millenium I: Dragon Tattoo (Män som hatar kvinnor)
    Millenium II: Played with Fire (Flickan som lekte med elden)
    Millenium III: Kicked the Hornet's Nest (Luftslottet som sprängdes)

    3.

    The Girl with the Dragon Tattoo I (Män som hatar kvinnor)
    The Girl with the Dragon Tattoo II: The Girl Who Played with Fire (Flickan som lekte med elden)
    The Girl with the Dragon Tattoo III: The Girl Who Kicked the Hornet's Nest (Luftslottet som sprängdes)

    4.

    The Girl I: with the Dragon Tattoo (Män som hatar kvinnor)
    The Girl II: Who Played with Fire (Flickan som lekte med elden)
    The Girl III: Who Kicked the Hornet's Nest (Luftslottet som sprängdes)

    My own preference would normally go towards option #4, as it is closest to the original titles (keeping it sorted under 'G' with default article removal settings), without making it extremly long and keeping the correct sort order.

    But either way it is going to get ugly when you mix in the English remakes, starting with: The Girl with the Dragon Tattoo (2011) - IMDb

    To keep each group together, and in line with how I solved it for others, I would go with the following then:

    The Girl (Remake) I: with the Dragon Tattoo
    The Girl (Remake) II: Who Played with Fire
    The Girl (Remake) III: Who Kicked the Hornet's Nest

    Let me know what you think, of course everybody can edit the XML file to use their own preference, but it would be nice if the default one can be the one that most users would expect to see.
     

    RoChess

    Extension Developer
  • Premium Supporter
  • March 10, 2006
    4,434
    1,897
    • Thread starter
    • Moderator
    • #78
    Re: IMDb+ Scraper (Force English titles, Auto-Rename titles to group, and more) v3.1.

    Ok, v3.1.4 now has SortBy adjustment support.

    I also updated the "Rename dBase IMDb+ Scraper.xml" file, which now has 429 entries. Please keep in mind that if you have a slow computer, say an Atom based HTPC that this rename system can add a delay. In that case you are wise to remove any entries that are not part of your collection or simply not use the rename system at all. Test it as-is first otherwise and see if the delay bothers you.

    On this slow workstation I noticed a small delay per movie, but on my dual-core HTPC it was barely noticable (except for a much larger Scraper-Debug-Enabled log file :D

    drealit, for you it will be easy, open up the rename XML file in notepad, use CTRL+H and replace 'title=' with 'sortby=', this will then keep the title the way IMDb.com site has it, but will adjust the way the movies are sorted and grouped based on the sortby field. Just keep in mind that for some movies this will give very weird results, such as the 'Casino Royale' example getting sorted under letter 'J' from adjusted "James Bond 21" sortby title. As explained in the comments, you could always adjust both title and sortby in those cases.
     

    RoChess

    Extension Developer
  • Premium Supporter
  • March 10, 2006
    4,434
    1,897
    • Thread starter
    • Moderator
    • #80
    Re: IMDb+ Scraper (Force English titles, Auto-Rename titles to group, and more) v3.1.

    Art of War 2 solved itself after a rescan :)

    You missed Star Trek from 2009: Star Trek (2009) - IMDb
    Star Trek - Wikipedia, the free encyclopedia

    Glad to see it solved itself, kinda curious what happened though.

    And darnit, I went over that list so many times I was seeing titles multiply by themselves, but I overlooked that one still ;)

    Code:
    	<rename id="tt0796366" title="Star Trek 11: Star Trek" />
    	<rename id="tt1408101" title="Star Trek 12: (Not Yet Released)" />

    I'll hold back updating first post with new revision, incase you find more. And for those worried on all the "(Not Yet Released)" entries; they will be updated in future revisions before those movies ever hit your collection :D
     

    Users who are viewing this thread

    Top Bottom