Small Plot IMDB Scraper!? (1 Viewer)

rasmuskarlsen

Portal Pro
October 9, 2008
235
12
Home Country
Denmark Denmark
Hey where can i get a small plot IMDB scraper that is working!?

No one!?? :eek:

Thee was one a while ago, but can't get it to workanymore..

I'm tired of all my movies are downloaded with a very large plot... It's impossible to read all that, just to see what the movie is about!!
 

ysmp

Design Group
  • Team MediaPortal
  • May 17, 2008
    1,860
    835
    Seoul.
    Home Country
    South Korea South Korea
    Hey where can i get a small plot IMDB scraper that is working!?

    No one!?? :eek:

    Thee was one a while ago, but can't get it to workanymore..

    I'm tired of all my movies are downloaded with a very large plot... It's impossible to read all that, just to see what the movie is about!!

    I thing i read on google code for shurt plot or long plot will be in the new version 0.8 that will come out soon ....
     

    rasmuskarlsen

    Portal Pro
    October 9, 2008
    235
    12
    Home Country
    Denmark Denmark
    Hey where can i get a small plot IMDB scraper that is working!?

    No one!?? :eek:

    Thee was one a while ago, but can't get it to workanymore..

    I'm tired of all my movies are downloaded with a very large plot... It's impossible to read all that, just to see what the movie is about!!

    I thing i read on google code for shurt plot or long plot will be in the new version 0.8 that will come out soon ....

    As far as i can see it's not going to be included in v. 0.8.0...
     

    ysmp

    Design Group
  • Team MediaPortal
  • May 17, 2008
    1,860
    835
    Seoul.
    Home Country
    South Korea South Korea
    rasmuskarlsen;535498 As far as i can see it's not going to be included in v. 0.8.0...[/QUOTE said:
    sorry it will be in v 0.9 look in screenshot # 318
     

    mortstar

    MP Donator
  • Premium Supporter
  • January 30, 2008
    414
    41
    Home Country
    England England
    If you look at the current imdb scraper script you will see this section:

    Code:
          <!-- Plot Summary -->
          <retrieve name='summary_page' url='http://www.imdb.com/title/${movie.site_id}/plotsummary'/>
          <parse name="summary" input="${summary_page}" regex="${rx_plot}"/>
          <set name="summary_clean" value="${summary[0][0]:striptags}" />
          <set name="movie.summary" value="${summary_clean:htmldecode}" />
    
          <!-- Plot Summary (if first method fails) -->
          <if test="${movie.summary}=">
            <parse name="summary2" input="${details_page}" regex="${rx_plot2}"/>
            <set name="summary_clean" value="${summary2[0][0]:striptags}" />
            <set name="movie.summary" value="${summary_clean:htmldecode}" />
          </if>
    What is happening here is that the scraper is looking for the long plot from the /plotsummary page at imdb. If this method fails to retrieve the (long) plot it falls back to secondary method, which is to grab the (short) plot from the film's main imdb page.

    To change this behaviour, you could edit the scraper xml and reimport it into MovingPictures. You could change the code above to this:

    Code:
          <!-- Do not retrieve long plot
          <retrieve name='summary_page' url='http://www.imdb.com/title/${movie.site_id}/plotsummary'/>
          <parse name="summary" input="${summary_page}" regex="${rx_plot}"/>
          <set name="summary_clean" value="${summary[0][0]:striptags}" />
          <set name="movie.summary" value="${summary_clean:htmldecode}" /> -->
    
          <!-- Plot Summary (Retrieve short plot) -->
          <parse name="summary2" input="${details_page}" regex="${rx_plot2}"/>
          <set name="summary_clean" value="${summary2[0][0]:striptags}" />
          <set name="movie.summary" value="${summary_clean:htmldecode}" />
          </if>

    This should now pick up sort plot summary for you.
     

    ysmp

    Design Group
  • Team MediaPortal
  • May 17, 2008
    1,860
    835
    Seoul.
    Home Country
    South Korea South Korea
    If you look at the current imdb scraper script you will see this section:

    Code:
          <!-- Plot Summary -->
          <retrieve name='summary_page' url='http://www.imdb.com/title/${movie.site_id}/plotsummary'/>
          <parse name="summary" input="${summary_page}" regex="${rx_plot}"/>
          <set name="summary_clean" value="${summary[0][0]:striptags}" />
          <set name="movie.summary" value="${summary_clean:htmldecode}" />
    
          <!-- Plot Summary (if first method fails) -->
          <if test="${movie.summary}=">
            <parse name="summary2" input="${details_page}" regex="${rx_plot2}"/>
            <set name="summary_clean" value="${summary2[0][0]:striptags}" />
            <set name="movie.summary" value="${summary_clean:htmldecode}" />
          </if>
    What is happening here is that the scraper is looking for the long plot from the /plotsummary page at imdb. If this method fails to retrieve the (long) plot it falls back to secondary method, which is to grab the (short) plot from the film's main imdb page.

    To change this behaviour, you could edit the scraper xml and reimport it into MovingPictures. You could change the code above to this:

    Code:
          <!-- Do not retrieve long plot
          <retrieve name='summary_page' url='http://www.imdb.com/title/${movie.site_id}/plotsummary'/>
          <parse name="summary" input="${summary_page}" regex="${rx_plot}"/>
          <set name="summary_clean" value="${summary[0][0]:striptags}" />
          <set name="movie.summary" value="${summary_clean:htmldecode}" /> -->
    
          <!-- Plot Summary (Retrieve short plot) -->
          <parse name="summary2" input="${details_page}" regex="${rx_plot2}"/>
          <set name="summary_clean" value="${summary2[0][0]:striptags}" />
          <set name="movie.summary" value="${summary_clean:htmldecode}" />
          </if>

    This should now pick up sort plot summary for you.

    hi mortstar ! but wher is the moving picture imdb scraper file ? i canot find it ?
     

    rasmuskarlsen

    Portal Pro
    October 9, 2008
    235
    12
    Home Country
    Denmark Denmark
    --> ysmp

    IMDB Scraper...

    IMDb.xml - moving-pictures - Project Hosting on Google Code

    It's working, if you remove the: </If> , at the bottom:

    <!-- Do not retrieve long plot
    <retrieve name='summary_page' url='http://www.imdb.com/title/${movie.site_id}/plotsummary'/>
    <parse name="summary" input="${summary_page}" regex="${rx_plot}"/>
    <set name="summary_clean" value="${summary[0][0]:striptags}" />
    <set name="movie.summary" value="${summary_clean:htmldecode}" /> -->

    <!-- Plot Summary (Retrieve short plot) -->
    <parse name="summary2" input="${details_page}" regex="${rx_plot2}"/>
    <set name="summary_clean" value="${summary2[0][0]:striptags}" />
    <set name="movie.summary" value="${summary_clean:htmldecode}" />

    But instead of "commenting out" the long plot. Can't you reverse it, so if no short plot is found, then the long plot is used instead!? Have tried playing with the code, but can't make it work :/

    And thanks for the help, until now :)
     

    mortstar

    MP Donator
  • Premium Supporter
  • January 30, 2008
    414
    41
    Home Country
    England England
    --> ysmp

    IMDB Scraper...

    IMDb.xml - moving-pictures - Project Hosting on Google Code

    It's working, if you remove the: </If> , at the bottom:

    <!-- Do not retrieve long plot
    <retrieve name='summary_page' url='http://www.imdb.com/title/${movie.site_id}/plotsummary'/>
    <parse name="summary" input="${summary_page}" regex="${rx_plot}"/>
    <set name="summary_clean" value="${summary[0][0]:striptags}" />
    <set name="movie.summary" value="${summary_clean:htmldecode}" /> -->

    <!-- Plot Summary (Retrieve short plot) -->
    <parse name="summary2" input="${details_page}" regex="${rx_plot2}"/>
    <set name="summary_clean" value="${summary2[0][0]:striptags}" />
    <set name="movie.summary" value="${summary_clean:htmldecode}" />

    But instead of "commenting out" the long plot. Can't you reverse it, so if no short plot is found, then the long plot is used instead!? Have tried playing with the code, but can't make it work :/

    And thanks for the help, until now :)

    You are right, the </if> tag does need removing from my first post. To Reverse the logic you need something like:

    Code:
          <!-- Short Plot Summary -->
            <parse name="summary2" input="${details_page}" regex="${rx_plot2}"/>
            <set name="summary_clean" value="${summary2[0][0]:striptags}" />
            <set name="movie.summary" value="${summary_clean:htmldecode}" />
    
          <!-- Long Plot Summary (if first method fails) -->
          <if test="${movie.summary}=">
             <retrieve name='summary_page' url='http://www.imdb.com/title/${movie.site_id}/plotsummary'/>
             <parse name="summary" input="${summary_page}" regex="${rx_plot}"/>
             <set name="summary_clean" value="${summary[0][0]:striptags}" />
             <set name="movie.summary" value="${summary_clean:htmldecode}" />
          </if>
     

    Users who are viewing this thread

    Similar threads

    Thankyou. It is working now.
    Thankyou. It is working now.
    Trying to scan Cinderella (2021), tt10155932. When I get the IMDB results, the last digit is missing. So it is named "unknown"...
    Replies
    2
    Views
    2K
    MP1 MP2 NAS DE
    And just found out that "if" the NAS has turned off, MePo sends the WOL when i again try to access the shares.. Love that feature :)
    And just found out that "if" the NAS has turned off, MePo sends the WOL when i again try to access the shares.. Love that feature :)
    Small question. Since MEPO 2 apprently is dead, and i have to switch back. Is there a way to ensure that the NAS will be started...
    Replies
    3
    Views
    682
    I'm using all related TV skin files provided and current, with no change since our last discussion over the skin issue. If you do not have problems with using it, then I must conclude it's not a skin problem, it's a deeper problem. and seems to be somewhat repeatable. It might be related to the combination of my network tuners...
    I'm using all related TV skin files provided and current, with no change since our last discussion over the skin issue. If you do...
    I'm having a small issue. (with MP 1.7 final (and also 1.8 pre-test)) - In Basic Home and various modules, when I have live (OTA)...
    Replies
    6
    Views
    1K
    Check this web page for the informations about he plugin. To fix the issue, I recommend to uninstal and install the plugin again (if you need it).
    Check this web page for the informations about he plugin. To fix the issue, I recommend to uninstal and install the plugin again...
    I've just installed MP 1.31 with Streamed skin, IMDB+ plugin, Moving Pictures and MP-TV Series. First time starting up MP to...
    Replies
    5
    Views
    2K
    maybe setting loglevel to debug might pinpoint the difference between normal and stuck? Preferable with as much disabled as possible also try to see if anything is happening during stuck for the mediaportal process(resource monitor, disk activity, cpu usage, anything) maybe other programs competing for resouces?
    maybe setting loglevel to debug might pinpoint the difference between normal and stuck? Preferable with as much disabled as...
    Since over a year my MP sometimes stucks while showing the welcome screen. It started once a week now it happens every second time...
    Replies
    4
    Views
    1K
    Top Bottom