Freelance Projects

All freelance projects at One Location


Wikipedia Scraper  13.08.08

I need to scrape music information from “es.wikipedia.org”. I need four scripts

1. Singer Wikipedia page location Script
Given a list singer names, like (”Leona Lewis”, “Cliff Richard”,…. )this script must locate the correct Wikipedia URL for each artist, and output a list of the wikipedia URLs for each artist.

2. Artist details scraper

Given an artists URL on “es.Wikipedia.org”, e.g.

http://es.wikipedia.org/wiki/Michael_Jackson,
http://es.wikipedia.org/wiki/Leona_Lewis,
http://es.wikipedia.org/wiki/12twelve

the script must scrape the biography and discography and store it in a mysql table. The script must be robust and be able to handle a wide range of artist pages.

The script must extract the song list of the each album in the discography, where available.

3. Simple Edit Application
The script generates a web page with a drop down list of the scraped artists, and allows the content to be reviewed, with a simple edit option

4. Artist List Macro Script

Given a list of artists, the macro script must call each of the above script 1 and 2, to generate and store all the information on the list of artists



If you liked this project, make sure you
Subscribe to Freelance Projects RSS feed!



    • Your Ads Here
    • Your Ads Here