Wikipedia Scraper 13.08.08
I need to scrape music information from “es.wikipedia.org”. I need four scripts
1. Singer Wikipedia page location Script
Given a list singer names, like (”Leona Lewis”, “Cliff Richard”,…. )this script must locate the correct Wikipedia URL for each artist, and output a list of the wikipedia URLs for each artist.
2. Artist details scraper
Given an artists URL on “es.Wikipedia.org”, e.g.
http://es.wikipedia.org/wiki/Michael_Jackson,
http://es.wikipedia.org/wiki/Leona_Lewis,
http://es.wikipedia.org/wiki/12twelve
the script must scrape the biography and discography and store it in a mysql table. The script must be robust and be able to handle a wide range of artist pages.
The script must extract the song list of the each album in the discography, where available.
3. Simple Edit Application
The script generates a web page with a drop down list of the scraped artists, and allows the content to be reviewed, with a simple edit option
4. Artist List Macro Script
Given a list of artists, the macro script must call each of the above script 1 and 2, to generate and store all the information on the list of artists



