home
products
contribute
download
documentation
forum
Home
Forums
New posts
Search forums
What's new
New posts
All posts
Latest activity
Members
Registered members
Current visitors
Donate
Log in
Register
What's new
Search
Search
Search titles only
By:
New posts
Search forums
Search titles only
By:
Menu
Log in
Register
Navigation
Install the app
Install
More options
Contact us
Close Menu
Forums
MediaPortal 1
MediaPortal 1 Plugins
Popular Plugins
Moving Pictures
IMDb Scraper with RottenTomatoes rating (check end of thread for final versions)
Contact us
RSS
JavaScript is disabled. For a better experience, please enable JavaScript in your browser before proceeding.
You are using an out of date browser. It may not display this or other websites correctly.
You should upgrade or use an
alternative browser
.
Reply to thread
Message
<blockquote data-quote="RoChess" data-source="post: 556022" data-attributes="member: 18896"><p><strong>Re: IMDb Scraper with RottenTomatoes rating</strong></p><p></p><p>That's exactly the procedure.</p><p></p><p>So let's take '300' as an example.</p><p></p><ul> <li data-xf-list-type="ul">The existing imdb.com scraper will locate: h**p://www.imdb.com/title/tt0416449/ and get all the information from it. <strong>So please verify when you go into your Movie Details, that '300' has tt0416449 in the imdb_id field.</strong></li> <li data-xf-list-type="ul">At the very end of the imdb scraper, my little bit of code kicks in and uses that number, strips it of the 'tt', so it ends up with 0416449.</li> <li data-xf-list-type="ul">Then it loads up: h**p://www.rottentomatoes.com/alias?type=imdbid&s=0416449</li> <li data-xf-list-type="ul">Which then in turn ends up giving: h**p://www.rottentomatoes.com/m/300/</li> <li data-xf-list-type="ul">And from this page, I extract the Average rating (or the TomatoMeter one with the other version).</li> </ul><p></p><p>Since the entire HTML page from "h**p://www.rottentomatoes.com/m/300/" is obtained, it would not be difficult at all to use any other information as well. For example you might prefer the "Synopsis" from RottenTomatoes (click on [More] first) compared to the 'Summary' that IMDb provides. Or use the "Consensus" for the 'Tagline'.</p><p></p><p>There is one big problem with the reviews, and that is that the full reviews are on a seperate page, adding even more URL requests to be scraped, so I prefer to avoid that (would also make code more complex). However if you scroll down to the bottom, there is a wide range of short review burps. It would be possible to filter out only the real critic reviews (the one who have a photo) and join them together in a text format (don't think MovingPictures supports images inside the 'Summary').</p><p></p><p>Infact it would even be possible to join/merge these RottenTomatoes reviews into the IMDb summary.</p><p></p><p>So you get for example, something like this:</p><p></p><p>[collapse][code]</p><p>RottenTomatoes Reviews (124 total with 106 fresh and 18 rotten):</p><p>----------------------------------------------------------------------------</p><p>Tony Macklin = ROTTEN (02/02/09): 300 is a fun-fest of blood, mayhem, and absurdity. At its best, it is entertaining; when it's not at its best, it's pretty dumb. It's pretty dumb much of the time. </p><p>******************</p><p>Roger Ebert = ROTTEN (08/08/08): 300 has one-dimensional caricatures who talk like professional wrestlers plugging their next feud.</p><p>******************</p><p>Tricia Olszewski = ROTTEN (03/04/08): All this bellowing and testosterone gets old fast -- especially since there's not much of a plot outside of the combat scenes, and the not-much-of-a-plot scenes are laden with dialogue worthy of Anakin and Padmé.</p><p>******************</p><p>Jeff Bayer = FRESH (03/03/08): The look and feel of this historic battle is perfect for the virtual backgrounds and obscene violence. Again, trust me on the violence.</p><p>******************</p><p>Brandon Fibbs = FRESH (02/28/08): 300 is an orgy of gore, a blood-letting on a titanic scale, a ballet of butchery in which half-naked men and the torrents of blood they elicit move in perfect, slow-motion choreography to a thunderous soundtrack. And I loved every minute of it.</p><p>******************</p><p>Luke Y. Thompson = FRESH (01/03/08): I still wish David Wenham weren't the narrator, but everything else about the movie is a brutal kind of hypnotic that keeps me coming back.</p><p>******************</p><p>Brian Webster = ROTTEN (08/05/07): While far from sophisticated in its 'I have filled my heart with hate' messaging, it resonates with the 'us versus them' worldview that's wildly popular in some circles.</p><p>******************</p><p>John J. Puccio = FRESH (07/17/07 ): ...relentless in its attempt to recreate the graphic novel's vision of the carnage at Thermopylae.</p><p>******************</p><p></p><p>IMDb Summary:</p><p>-------------------</p><p>(this is where existing summary of IMDb could go, but can also be placed first)</p><p>[/code][/collapse]</p><p></p><p>PS: If images are possible inside Summary field, then I could replace FRESH and ROTTEN with the same images that RottenTomatoes.com uses.</p><p></p><p>Adding it to the existing Summary like the above example, is something I could make work without any skin modifications, but not sure if you like that idea. And do you prefer IMDb summary, or the RottenTomatoes synopsis. Also what you see is what you get, I've taken '300' as an example, but that's a lousy sampling rate to see if they all have photo reviews. So there is the risk of ending up with an empty addition because no positive match was found. I'm not going to waste much time making this fool proof for every possible combination (at least not at first), so you might want to manually check on RT to see if it would work.</p><p></p><p>And you have to be expecting added delays, because all this extra scraper processing will end up costing CPU cycles.</p><p></p><p>Let me know.</p></blockquote><p></p>
[QUOTE="RoChess, post: 556022, member: 18896"] [b]Re: IMDb Scraper with RottenTomatoes rating[/b] That's exactly the procedure. So let's take '300' as an example. [list] [*]The existing imdb.com scraper will locate: h**p://www.imdb.com/title/tt0416449/ and get all the information from it. [b]So please verify when you go into your Movie Details, that '300' has tt0416449 in the imdb_id field.[/b] [*]At the very end of the imdb scraper, my little bit of code kicks in and uses that number, strips it of the 'tt', so it ends up with 0416449. [*]Then it loads up: h**p://www.rottentomatoes.com/alias?type=imdbid&s=0416449 [*]Which then in turn ends up giving: h**p://www.rottentomatoes.com/m/300/ [*]And from this page, I extract the Average rating (or the TomatoMeter one with the other version). [/list] Since the entire HTML page from "h**p://www.rottentomatoes.com/m/300/" is obtained, it would not be difficult at all to use any other information as well. For example you might prefer the "Synopsis" from RottenTomatoes (click on [More] first) compared to the 'Summary' that IMDb provides. Or use the "Consensus" for the 'Tagline'. There is one big problem with the reviews, and that is that the full reviews are on a seperate page, adding even more URL requests to be scraped, so I prefer to avoid that (would also make code more complex). However if you scroll down to the bottom, there is a wide range of short review burps. It would be possible to filter out only the real critic reviews (the one who have a photo) and join them together in a text format (don't think MovingPictures supports images inside the 'Summary'). Infact it would even be possible to join/merge these RottenTomatoes reviews into the IMDb summary. So you get for example, something like this: [collapse][code] RottenTomatoes Reviews (124 total with 106 fresh and 18 rotten): ---------------------------------------------------------------------------- Tony Macklin = ROTTEN (02/02/09): 300 is a fun-fest of blood, mayhem, and absurdity. At its best, it is entertaining; when it's not at its best, it's pretty dumb. It's pretty dumb much of the time. ****************** Roger Ebert = ROTTEN (08/08/08): 300 has one-dimensional caricatures who talk like professional wrestlers plugging their next feud. ****************** Tricia Olszewski = ROTTEN (03/04/08): All this bellowing and testosterone gets old fast -- especially since there's not much of a plot outside of the combat scenes, and the not-much-of-a-plot scenes are laden with dialogue worthy of Anakin and Padmé. ****************** Jeff Bayer = FRESH (03/03/08): The look and feel of this historic battle is perfect for the virtual backgrounds and obscene violence. Again, trust me on the violence. ****************** Brandon Fibbs = FRESH (02/28/08): 300 is an orgy of gore, a blood-letting on a titanic scale, a ballet of butchery in which half-naked men and the torrents of blood they elicit move in perfect, slow-motion choreography to a thunderous soundtrack. And I loved every minute of it. ****************** Luke Y. Thompson = FRESH (01/03/08): I still wish David Wenham weren't the narrator, but everything else about the movie is a brutal kind of hypnotic that keeps me coming back. ****************** Brian Webster = ROTTEN (08/05/07): While far from sophisticated in its 'I have filled my heart with hate' messaging, it resonates with the 'us versus them' worldview that's wildly popular in some circles. ****************** John J. Puccio = FRESH (07/17/07 ): ...relentless in its attempt to recreate the graphic novel's vision of the carnage at Thermopylae. ****************** IMDb Summary: ------------------- (this is where existing summary of IMDb could go, but can also be placed first) [/code][/collapse] PS: If images are possible inside Summary field, then I could replace FRESH and ROTTEN with the same images that RottenTomatoes.com uses. Adding it to the existing Summary like the above example, is something I could make work without any skin modifications, but not sure if you like that idea. And do you prefer IMDb summary, or the RottenTomatoes synopsis. Also what you see is what you get, I've taken '300' as an example, but that's a lousy sampling rate to see if they all have photo reviews. So there is the risk of ending up with an empty addition because no positive match was found. I'm not going to waste much time making this fool proof for every possible combination (at least not at first), so you might want to manually check on RT to see if it would work. And you have to be expecting added delays, because all this extra scraper processing will end up costing CPU cycles. Let me know. [/QUOTE]
Insert quotes…
Verification
Post reply
Forums
MediaPortal 1
MediaPortal 1 Plugins
Popular Plugins
Moving Pictures
IMDb Scraper with RottenTomatoes rating (check end of thread for final versions)
Contact us
RSS
Top
Bottom