We need the following web scraping software:
The basis is an Excel list of company website domains (it can also be provided in a
different format if necessary).
Certain company data needs to be extracted from these company websites including
The extracted structured data has to be saved in an Excel file.
This is the company data that needs to be extracted:
- Company name
- Street and street number
- Zip code
- Phone number
- Fax number
- E-mail address
- Number of employees
- Founding year
- Certificates (For example "DIN EN ISO 9001")
(These data are also required, but maybe it's not possible to extract them easily:
- Correspondence language(s)
- Type of company (either service provider, manufacturer or distributor)
- Product portfolio
- Export countries)
We are looking for freelancers with experience in this area and would like to see a
sample of similar work.
Payment is executed once a software is 100% finished. Please don't bid if you don't
agree with this.
Thank you in advance for your offers.
There were a few questions about which websites need to be scraped.
We want the software to collect those website URLs from an Excel sheet.
So that we can repeat the scraping over and over again with new URLs.
Not only the starting page but also the subdomains need to be scraped!
Enclosed you will find an example for a list of websites that need to be scraped.
The other Excel sheet shows you the results we would like to have.