Automated Artist / Album Scraping (1 Viewer)

Status
Not open for further replies.

jameson_uk

Retired Team Member
  • Premium Supporter
  • January 27, 2005
    7,258
    2,528
    Birmingham
    Home Country
    United Kingdom United Kingdom
    OK v8 in first post. This is I guess nearly as far as this can go (and has made me realise that the allmusic data is not actually as good quality as I thought...)

    I have added some more checking on albums (messes around with punctuation marks as was having issues matching some albums due to extra . or ! characters). Also I have fixed the brackets now (I believe...) and I have cleaned up the code a fair bit.

    I guess there are a couple of questions really...

    Do people actually want all artists scraping or just album artists?
    We are currently just scraping main albums and compilations do people want to scrape singles? The quality seems to go downhill the further you get away from main albums... Currently we pick up the first match but I am noticing that some albums have multiple entries in the compilation tab. Singles are even worse and there are masses of entries (some of which look rubbish). Most singles also have very few details anyway so not sure of the value in scraping these.

    The only improvement I was thinking of was to look through the artists where there are duplicates and if there is only a single entry with years active set then pick that. This however could lead to lots more false positives.

    Any feedback on what is missing still appreciated.
     

    ysmp

    Design Group
  • Team MediaPortal
  • May 17, 2008
    1,863
    744
    Seoul.
    Home Country
    South Korea South Korea
    Hi jameson_uk ,
    i try v8 ,still get error (befure error 22,23- now error 26) , but for sure mach more data pickt up and download faster with version 8 ...
     

    jameson_uk

    Retired Team Member
  • Premium Supporter
  • January 27, 2005
    7,258
    2,528
    Birmingham
    Home Country
    United Kingdom United Kingdom
    OK, I have made the change that as a last resort it will go through the list of artists and if there is only a single entry with a years active populated it will choose that one. My limited testing seems to suggest this works quite well.

    I have now uploaded this to MPEI. You can install it via the MPEI installer, the MPEI plugin or direct from musicinfohandler - Artist / Album Info Scraper for MediaPortal - Google Project Hosting. This is however listed as beta so you need to untick the only show stable releases option.

    What I will probably do next is create a debug version that will produce a list of everything not picked up and then I can start working through those to see if there is anything we can do
     

    Pog

    Retired Team Member
  • Premium Supporter
  • September 7, 2009
    401
    315
    Wicklow
    Home Country
    Ireland Ireland
    Using V8 brackets still causing problems.

    Looks like they get processed correctly but don't show up in MP, not added to the DB?

    2011-09-04 13:59:03.497476 [Debug][(32)]: MusicInfoHandler: Album: Processing Album: beatles, the - abbey road (remastered)

    No not able to process error

    Will add some more feedback to your other post in a bit...
     

    jameson_uk

    Retired Team Member
  • Premium Supporter
  • January 27, 2005
    7,258
    2,528
    Birmingham
    Home Country
    United Kingdom United Kingdom
    Using V8 brackets still causing problems.

    Looks like they get processed correctly but don't show up in MP, not added to the DB?

    2011-09-04 13:59:03.497476 [Debug][(32)]: MusicInfoHandler: Album: Processing Album: beatles, the - abbey road (remastered)

    No not able to process error

    Will add some more feedback to your other post in a bit...

    I think this is fixed in the MPEI version I have just uploaded. Basically it was the same as the artist stuff and was storing the album name from allmusic.com but looking up based on the album name you have stored in tags. It is nowing storing album ifo against the name you have stored in your tags
     

    jameson_uk

    Retired Team Member
  • Premium Supporter
  • January 27, 2005
    7,258
    2,528
    Birmingham
    Home Country
    United Kingdom United Kingdom
    Hi jameson_uk ,
    i try v8 ,still get error (befure error 22,23- now error 26) , but for sure mach more data pickt up and download faster with version 8 ...

    Not sure what you mean by the numbers but if you are talking about
    Code:
    [ERROR][(25)]
    then ignore that as it just as ID.

    The errors in your file this time are timeouts (3 seconds). There is little I can do about this except increase the timeout. When you start MP again I think those that timed out should be picked up again
     

    emphatic

    Design Group
  • Team MediaPortal
  • August 25, 2006
    3,736
    1,196
    Alingsås
    Home Country
    Sweden Sweden
    I get some error about dependencies when I try the MPEI installer. I'm running RC1.

    Emph
     

    jameson_uk

    Retired Team Member
  • Premium Supporter
  • January 27, 2005
    7,258
    2,528
    Birmingham
    Home Country
    United Kingdom United Kingdom
    I get some error about dependencies when I try the MPEI installer. I'm running RC1.

    Emph

    Googlecode is playing up at the minute so I can not upload an update. I will do this as soon as I can.

    In the meantime, installer attached to first post which I think should work
     

    Pog

    Retired Team Member
  • Premium Supporter
  • September 7, 2009
    401
    315
    Wicklow
    Home Country
    Ireland Ireland
    Looking good, installs ok and deals with prefixes.

    Now I've found something very interesting and maybe a way to solve finding artists/albums. Google is your friend, site search with I'm feeling lucky to be precise:

    Sinead O'Connor:
    www.google.com/search?q=site:www.allmusic.com+sinead%20o%27connor&btnI

    Al Green:
    www.google.com/search?q=site:www.allmusic.com+al%20green&btnI

    Biosphere:
    www.google.com/search?q=site:www.allmusic.com+Biosphere&btnI

    Not tried everything but it seems to work great for artists that are not currently been picked up.
     
    Status
    Not open for further replies.

    Users who are viewing this thread

    Top Bottom