Đã Đóng

Build app to scrape data from a web page

When bidding for this project, please include the sentence: "I have read the specs. This is no automated reply".

The project involves building an application for Mac OS X (as well as being compatible for Windows, if possible) to scrape information of a password protected member directory on a web page. We have legal access to the site and the forums in question, which is located on LinkedIn. We basically need to convert personal information of fellow members on different forums, to excel.

Ouput in excel:

1. Name

2. Picture

3. Current (title)

4. Current (place of work)

5. Phone

6. Birthday

7. Marital status

8. Summary

9. Specialties

10. Experience (multiple listings) --> Title, company, time period

11. Education (multiple listings) --> Place of study, field of study, time period

12. Twitter profile

13. Company website (possibly multiple listings)

14. Group affiliations

For more information on what type of information is available through a LinkedIn profile, and how it is organized and in what format, plz refer to LinkedIn directly. Please describe specifically, in your bid, how you will deal with the complexity of multiple layers of information in the conversion to excel (experience, education and company website). I also wish to be able to download a copy of pictures. Please also describe how you suggest to solve this with regard to creating the output in excel.

The application provided, must have an easily understood user interphase, so it can be managed without any need for programming experience, neither in html or other programming languages.

The application should be easily modified for use for scraping other websites as well. The application should, therefore, have programming options, where LinkedIn is already provided as a default setting, but where one has the opportunity to modify settings with respect to what "categories" to convert both from and to.

As well, the developed app cannot be part of any license restricted platform. The app must utilize open source solutions with regard to the final product transferred to buyer.

The application must have an "update feature" - where it can be set to do automatic updates of the different scrapes, on a regular basis. Both updates and the scraping itself must use proxies, so our IPs do not get banned. Duplicate information filtering also required with regard to update feature.

When bidding for this project, the project winner transfers - without reservation - all ownership and rights to the application, including source codes. These rights are transferred to the buyer, when the agreed compensation is transferred from buyer to developer.

Kĩ năng: Bất kì công việc gì, Khai thác dữ liệu, Xử lí dữ liệu, Excel, Web Scraping

Xem nhiều hơn: build app scrape website, what is time complexity in c, what is time complexity, what are programming languages, websites programming languages, websites build, website for c programming solve, web scraping solutions, web scraping part time, web scraping legal, web scraping application, web programming solutions, web programming license, web page programming languages, web page format, web page build, web developer with codes, web developer website question, web developer profile summary, web developer personal page, web developer os, web developer options, web developer for mac, web developer directory, web developer bidding site

Về Bên Thuê:
( 0 nhận xét ) Oslo, Norway

ID dự án: #1042081