- March 10, 2006
- 4,434
- 1,897
- Thread starter
- Moderator
- #91
Re: IMDb+ Scraper (Force English titles, Auto-Rename titles to group, and more) v3.1.
As explained in the scraper comment above it via:
It was not enough to filter on 'just' the language to find the right movies, this is because they sometimes would release an English language movie in for example Japan (ja), but then use the foreign title. Thanks for posting that though, it made me verify my personal scraper versus the one I upload to first post and I notice that Canada (ca) is missing. You can manually add |ca| to the country_filter settings for now, or wait till I release v3.1.5 later this month (still trying to optimize).
As for the rename dBase, nice catch on the remake, I'll also add some of your other entries. I see that you had to add "City of the Living Dead" to it as well, because it is an English language movie released in Italy with the correct title. Adding |it| to the country_filter would solve that, however it might break other movies. My sample set on Italian movies is too small to verify this, so hopefully you can help me with this. However the AKA title for USA/English is correct, so I wonder why you had to add an entry to force rename, will have to debug that movie.
For the Bachelor Party series, the original part 1 is the version from 1984 with Tom Hanks, you can verify this in the movie connections page at the imdb.com site, so I took the liberty of correcting this for v0.6 of the Rename dBase (check first post). This version also contains the adjustments to have every movie series use Roman Numerals and retain the correct order on 9+ via the SortBy field.
I'll hold off on v3.1.5 of the scraper with the |ca| change, incase you can help me find out if adding |it| makes sense as well. Maybe I should work the other way around and use |ja| as a blacklist, but then I'll probably end up with a large list as well for China, Korea, Taiwan, and other countries that might release an English language movie with the wrong title. So help me test please
What does the following line do? Looks like english speaking nations, but can't find any info on what the line actually does.
<set name="global_options_country_filter" value="us|gb|ie|au|nz" />
Does it do the same as the
<set name="global_options_language_filter" value="no|sv|qae|da|en|us"
But use the country tag instead of language? Do I have to use both, or can I delete one?
As explained in the scraper comment above it via:
Code:
The country filter was added to avoid mistakes on foreign movies in English
It was not enough to filter on 'just' the language to find the right movies, this is because they sometimes would release an English language movie in for example Japan (ja), but then use the foreign title. Thanks for posting that though, it made me verify my personal scraper versus the one I upload to first post and I notice that Canada (ca) is missing. You can manually add |ca| to the country_filter settings for now, or wait till I release v3.1.5 later this month (still trying to optimize).
As for the rename dBase, nice catch on the remake, I'll also add some of your other entries. I see that you had to add "City of the Living Dead" to it as well, because it is an English language movie released in Italy with the correct title. Adding |it| to the country_filter would solve that, however it might break other movies. My sample set on Italian movies is too small to verify this, so hopefully you can help me with this. However the AKA title for USA/English is correct, so I wonder why you had to add an entry to force rename, will have to debug that movie.
For the Bachelor Party series, the original part 1 is the version from 1984 with Tom Hanks, you can verify this in the movie connections page at the imdb.com site, so I took the liberty of correcting this for v0.6 of the Rename dBase (check first post). This version also contains the adjustments to have every movie series use Roman Numerals and retain the correct order on 9+ via the SortBy field.
I'll hold off on v3.1.5 of the scraper with the |ca| change, incase you can help me find out if adding |it| makes sense as well. Maybe I should work the other way around and use |ja| as a blacklist, but then I'll probably end up with a large list as well for China, Korea, Taiwan, and other countries that might release an English language movie with the wrong title. So help me test please