I need data from several Catalogues.
**I will provide about 40 URLs, *each of which* contains anywhere from 20 to 50 Catalogues.**? The data to be scraped from? ***each***? Catalogue is estimated at 30mB, so a very fast connection is necessary.
In essence:
-? The coder must *write an application* which fetches the URLs I will? provide and perform a scan/scrape of the different catalogues, parsing and downloading all relevant data. (Images, Price, Description)
- The said application should be able to store the parsed data from each catalog into an excel 97 file (.xls) and the images (.jpg)? in a separate folder
If interesed, please let me know and I will send you a sample of a URL for you to do in order to demonstrate your ability.
Thanks!
## Deliverables
I do not need the actual script or scraping algorithm.? All I will need is the actual data.? You may use any tricks or methods you wish to accomplish this task