We have an overview website with blogs, we want to scrape this website and get the blogs. For each blog get the id, title, data (basic info) of the articles.
This should be converted to a format like xml/csv so we can process the data.
Knowledge of object oriented development is required, as we plan to include more website in the future and want the code to be easily extendible to serve more websites.
Also checks must be implemented to see if the crawl was done successfully. (eg in case the website changes)
[url removed, login to view] looks like a good and easy tool to do so. Suggestions however are welcome.
- Preference goes to software engineers / computer science engineers (students welcome)
- Preference to eastern European engineers (time zone difference)
- This is a really small project. A VPS with login will be given.
- We are looking for reliable partners to outsource different tasks of our software company. We are working very flexible, but require real geeks as quality is important.
Please post "CRAWL-ENGINEER" in your jobreply, so we know you have at least taken the time to read this...
18 freelancer đang chào giá trung bình €142 cho công việc này
CRAWL-ENGINEER. Hi, I have more than a year of experience in web scraping using python and I always deliver high-quality software. Look forward to hearing from You.