I am looking at developing a project involving YouTube, the video hosting site, and the video section of MySpace ([login to view URL]). I need someone to scrape both the sites to find 1) The name of each video 2) It's length 3) When it was added (as that info is listed on the site, e.g. "1 year ago") 4) The UserID of the uploader 5) The number of views it's had 6) The number of stars it's been given 7) The number of ratings it has 8) Its YouTube/MySpace category; there are around 12 on each site. NOTE I DO NOT NEED THE VIDEO ITSELF If you start at the YouTube Home page, and then go to Categories, you will find all of the offerings, grouped together by category. On MySpace you can page through this [login to view URL] There are approximately 3.7 million videos on MySpace and perhaps double that on YouTube. Each is growing by about 30,000 per day. What I am looking for is to crawl each web site twice, so two crawls of MySpace and two crawls of YouTube. Each crawl of the two sites will be approximately a month apart. I've attached a picture of the proposed schema of the application that might help clarify the above. The deliverable would be a MySQL Database file.
## Deliverables
1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.
2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):
a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.
b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.
3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).
## Platform
MySQL Database Can have scraper on any platform.