Spanish Scraper FilmAffinity.com with IMDb.es bonus to get fanarts -- v2.1.0 (1 Viewer)

peque

Moderator - Spanish Forums
  • Premium Supporter
  • August 4, 2007
    861
    99
    Home Country
    Spain Spain
    Fixed @RoChess. THANKS again.

    If I find any problem I'll provide log from now over. :)
     

    peque

    Moderator - Spanish Forums
  • Premium Supporter
  • August 4, 2007
    861
    99
    Home Country
    Spain Spain
    I found two rather unknown child films which have same behaviour: "send to importer" doesn't retrieve imdb id, when you "refresh movie info", then you get it. Log attached for both films.
     

    Attachments

    • movingpictures.rar
      14.5 KB

    RoChess

    Extension Developer
  • Premium Supporter
  • March 10, 2006
    4,434
    1,897
    @peque, one quick thing to notice, is that "La.Estrella.de.Laura.(Spanish.English).DVD-Rip.XviD-AC3.by.mercedes.avi" is turned into "La Estrella de Laura (Spanish English) DVD-Rip - by mercedes" to search for a title. It then finds "La Estrella de Laura", but it can not auto-approve, because there is a 56-char difference in exact title match.

    This is what the noise filter in advanced settings is for, you could for example add |\(.+ at the end it always strip everything from the title after the first parenthesis. Then "La.Estrella.de.Laura.(Spanish.English).DVD-Rip.XviD-AC3.by.mercedes.avi" is turned into "La Estrella de Laura" and an auto-match would occur.

    Of course you might have a filename that then fails, so you would have to provide me with a full list of all your filenames with a parenthesis in it, so I can expand on the regular expression for the noise filter to not generate false positives.

    (will look at the rest of the log file later for why they fail to get the IMDb.com details after you manually approved them)
     

    RoChess

    Extension Developer
  • Premium Supporter
  • March 10, 2006
    4,434
    1,897
    Ok, found problem faster then I thought it would.

    Filmaffinity detail page = http://www.filmaffinity.com/es/film234240.html
    Filmaffinity title = "La estrella de Laura"

    IMDb.com search = http://akas.imdb.com/find?s=tt&q=La%20estrella%20de%20Laura
    IMDb.com search result = no match found for title+year​

    Solution though is to look for Filmaffinity "Título original = Lauras Stern (Laura's Star)"

    IMDb.com search then becomes = http://akas.imdb.com/find?s=tt&q=Lauras+Stern
    First match = right one (though still complicated to match due to additional AKA situation, but at least possible to figure something out)​

    Will have to think about it a bit more to see what the best solution is; as it is also never a good idea to go by one movie result that could cause other results to break.
     

    alamagar

    Portal Pro
    August 6, 2008
    51
    0
    Hi Rochess,
    I am getting a mistaken info for popularity field. Number got is very low and does not correspond to "votos" on filmaffinity. For instance "Diario de un escandalo" gives to me 170 for popularity and correct number is 10467.
    Maybe some change in filmaffinity?

    Thanks in advance.
    Mario
     

    alamagar

    Portal Pro
    August 6, 2008
    51
    0
    Hi RoChess, I just noticed I did not thank you for your last solution. so Thanks now.
    Some change must be happen in filmaffinity because since last days actors field become something like this instead of the name of the actors.
    Any idea?

    |<span itemprop="actor" itemscope itemtype="http://schema.org/Person">
    <a itemprop='url' href="/es/search.php?stype=cast&sn&stext=Aubrey+Plaza">
    <span itemprop="name">Aubrey Plaza</span></a>, </span>
    <span itemprop="actor" itemscope itemtype="http://schema.org/Person">
    <a itemprop='url' href="/es/search.php?stype=cast&sn&stext=Mark+Duplass">
    <span itemprop="name"> Mark Duplass</span></a>, </span>
    <span itemprop="actor" itemscope itemtype="http://schema.org/Person">
    <a itemprop='url' href="/es/search.php?stype=cast&sn&stext=Jake+Johnson">
    <span itemprop="name"> Jake Johnson</span></a>, </span>
    <span itemprop="actor" itemscope itemtype="http://schema.org/Person">
    <a itemprop='url' href="/es/search.php?stype=cast&sn&stext=Karen+Soni">
    <span itemprop="name"> Karen Soni</span></a>, </span>
    <span itemprop="actor" itemscope itemtype="http://schema.org/Person">
    <a itemprop='url' href="/es/search.php?stype=cast&sn&stext=Mary+Lynn+Rajskub">
    <span itemprop="name"> Mary Lynn Rajskub</span></a>, </span>
    <span itemprop="actor" itemscope itemtype="http://schema.org/Person">
    <a itemprop='url' href="/es/search.php?stype=cast&sn&stext=Lauren+Carlos">
    <span itemprop="name"> Lauren Carlos</span></a>, </span>
    <span itemprop="actor" itemscope itemtype="http://schema.org/Person">
    <a itemprop='url' href="/es/search.php?stype=cast&sn&stext=Jeff+Garlin">
    <span itemprop="name"> Jeff Garlin</span></a>, </span>
    <span itemprop="actor" itemscope itemtype="http://schema.org/Person">
    <a itemprop='url' href="/es/search.php?stype=cast&sn&stext=David+Schultz">
    <span itemprop="name"> David Schultz</span></a>, </span>
    <span itemprop="actor" itemscope itemtype="http://schema.org/Person">
    <a itemprop='url' href="/es/search.php?stype=cast&sn&stext=Kristen+Bell">
    <span itemprop="name"> Kristen Bell</span></a>, </span>
    <span itemprop="actor" itemscope itemtype="http://schema.org/Person">
    <a itemprop='url' href="/es/search.php?stype=cast&sn&stext=Lynn+Shelton">
    <span itemprop="name"> Lynn Shelton</span></a> </span>|
     

    RoChess

    Extension Developer
  • Premium Supporter
  • March 10, 2006
    4,434
    1,897
    Try v2.1.3

    FilmAffinity changed their HTML code again, wrapped a <span> around the actors name that was not there before, but v2.1.3 will compensate for that now.
     

    RoChess

    Extension Developer
  • Premium Supporter
  • March 10, 2006
    4,434
    1,897
    Enjoy.

    I am sure FilmAffinity will change something again soon, so just post again what they broke and I'll do my best to fix it.
     

    Users who are viewing this thread

    Top Bottom