Break down HTML pages by Header and associated text

Đã hoàn thành Đã đăng vào 3 năm trước Thanh toán khi bàn giao
Đã hoàn thành Thanh toán khi bàn giao

I need a program that:

1) Accepts a list of one or more (up to 1000) top level domains from a .txt file (one TLD per line) list the

2) Spiders the home page fromt he TLD in the list to generate a partial sitemap for that domain

a) nothing that requires a login or transaction

b) just the pages that link directly to the hope page

3) Creates a report in .csv format that contains the following

a) URL

b) Header Level

c) Header Text

d) Text immediately before header. Must allow null because the first Header won't have a paragraph before

e) Text immediately after header. Must allow null because the last Header on a page won't have a paragraph after

The program should be triggerable from a web page, where the .txt file containing the list of top level domains can be specified through a "choose file" mechanism, and a "download results" button should appear when the program is finished running

Don't care what behind the scenes programming language is used to perform the spidering or sound check. DO care that the results are accurate

Input file

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

etrade

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

[login to view URL]

Output example:

[login to view URL], H2, "Stay at home", null, "only go outside for food, health reasons or work (but only if you cannot work from home"

[login to view URL], H1, "Welcome to [login to view URL]", null, "best place to find government services and information"

...

HTML JavaScript Kiến trúc phần mềm Python

ID dự án: #24629356

Về dự án

28 đề xuất Dự án từ xa 3 năm trước đang mở

Được trao cho:

w4po

⭐ Hello, Sheri⭐ I can do this easily, I'll automate this job, Cost is $100 in Less than 3 days (maybe ~12 hours), My previous work below: ///// This is a video to demonstrate Process automation, programmed way Of a m Thêm

$100 USD trong 3 ngày
(19 Đánh Giá)
4.7

28 freelancer chào giá trung bình$166 cho công việc này

liveexperts123

Hi there,I'm biddin on your project "Break down HTML pages by Header and associated text" I have read your project description and i'm an expert in Python and machine learning therefore i can do this project for you pe Thêm

$250 USD trong 4 ngày
(81 Nhận xét)
7.5
schoudhary1553

Hello, I can help you with your project - Break down HTML pages by Header and associated text I have gone through your job posting and become very much interested to work with you. I am an expert in this field. I ha Thêm

$220 USD trong 4 ngày
(134 Nhận xét)
7.2
umg536

Hi there, I'm biddin on your project "Break down HTML pages by Header and associated text" Being an expert in Python and matlab programming I can do this project for you. please leave a message on my chat so we can di Thêm

$250 USD trong 4 ngày
(29 Nhận xét)
6.6
ZnDevelopers

Hi, I am ready to start it. But i have some questions for you. So kindly leave message for me, then ill discuss it in depth.

$150 USD trong 3 ngày
(172 Nhận xét)
7.0
nicrosoft

Dear Prospective Employer, I hope you are doing well and thank you for sharing your project requirements, I'm willing to work on your project immediately. Please send me a message in chat so we can further discuss your Thêm

$200 USD trong 5 ngày
(49 Nhận xét)
6.3
BelisaG

Hey dear I am very interested in your project. I have understood fully your requirement As for me, I am a full-stack developer with 5+ years experiences Specially, I have good talent about web scrapping and html an Thêm

$200 USD trong 3 ngày
(5 Nhận xét)
5.5
shantanupython

Very interesting concept, i like it ! would much enjoy developing it, for the web interface, do you have a server ? or want me to suggest a small linux server which will handle the web app and the scraper ?

$187 USD trong 4 ngày
(77 Nhận xét)
5.7
writingapp

Hi. I am ready to write your project. I have written many automation projects. Will complete within 3 days

$190 USD trong 3 ngày
(54 Nhận xét)
5.4
privatecaptain

Hi, I can make this program for you by tomorrow. Shoot me msg & let's get started tonight. Thanks

$250 USD trong 1 ngày
(25 Nhận xét)
5.4
umairkaramat24

Hi! How are you doing? I have read the project description and really interested in this job, I have 4 years’ experience doing similar jobs regarding to these skills Software Architecture, Javascript, HTML and Python. Thêm

$155 USD trong 10 ngày
(15 Nhận xét)
4.5
saadtariq329

Hello there. I have seen your job posting. I will like to ask some questions. Please come over the chat so we can discuss things. Some intro about me. I am an enthusiastic developer/implementer who does not stop until Thêm

$140 USD trong 7 ngày
(8 Nhận xét)
4.4
umartechboy

Its completely doable. I've done similar automation things many times before for my own convenience. i can additionally deploy the application of an amazon EC2 machine where it can sit for one year without any charg Thêm

$178 USD trong 1 ngày
(1 Nhận xét)
3.3
christian010

Hello Sir, This project could be done in 48 hours, the budget you are setting is generous and since I am an honest person I am requesting only the real value for this project. I could start inmediatly and develope the Thêm

$100 USD trong 2 ngày
(8 Nhận xét)
3.2
AVKor

Hello, I'm a software/web developer. I can create a web app in ruby/Sinatra that does read your text input file, get all required data for all TLD in it with showing you a progress of that process and then output the r Thêm

$140 USD trong 7 ngày
(5 Nhận xét)
2.9
dark

Dear Sirs, I have extensive software development experience and would be happy to drive this project to successful completion. Best Regards, Igor D

$200 USD trong 7 ngày
(1 Nhận xét)
1.9
alpaboghara901

Hi, My name is Alpa Patel, - 7 Years of Professional work experience in creative Website Design, Design Landing Page, Blog Install, logo design and Graphic Design as per current UI/UX market trends. - I have careful Thêm

$200 USD trong 5 ngày
(3 Nhận xét)
1.0
kuetedonald

Experienced(6 Years), Android and Ios(Native or Hybrid), PHP, Delphi , Python, Vb, C# and Java developer and web designer with the depth knowledge related to MySQL,Codeigniter,Laravel,WordPress,MSSQL,JSON,JAVASCRIPT,AJ Thêm

$100 USD trong 7 ngày
(0 Nhận xét)
0.0
pankajsamaspuria

Work with me is great deal for you I have done your job with my experience and give you a soft touch Relevant Skills and Experience I have done so much project like that and have 5 years experience of tha field

$78 USD trong 3 ngày
(0 Nhận xét)
0.0
coolrock991

I will make python web based application wich will accept accept domain from text file. And then will provide with csv file with the format provided by you Relevant Skills and Experience My expertise are in python , H Thêm

$156 USD trong 3 ngày
(0 Nhận xét)
0.0
yassineelkasmy

Im Python Developer since 2015 , Advanced in Web Scraping and i have done similar projects before ... I can do this quickly for you.

$180 USD trong 2 ngày
(0 Nhận xét)
0.0