Break down HTML pages by Header and associated text
$30-250 USD
Thanh toán khi bàn giao
I need a program that:
1) Accepts a list of one or more (up to 1000) top level domains from a .txt file (one TLD per line) list the
2) Spiders the home page fromt he TLD in the list to generate a partial sitemap for that domain
a) nothing that requires a login or transaction
b) just the pages that link directly to the hope page
3) Creates a report in .csv format that contains the following
a) URL
b) Header Level
c) Header Text
d) Text immediately before header. Must allow null because the first Header won't have a paragraph before
e) Text immediately after header. Must allow null because the last Header on a page won't have a paragraph after
The program should be triggerable from a web page, where the .txt file containing the list of top level domains can be specified through a "choose file" mechanism, and a "download results" button should appear when the program is finished running
Don't care what behind the scenes programming language is used to perform the spidering or sound check. DO care that the results are accurate
Input file
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
etrade
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
Output example:
[login to view URL], H2, "Stay at home", null, "only go outside for food, health reasons or work (but only if you cannot work from home"
[login to view URL], H1, "Welcome to [login to view URL]", null, "best place to find government services and information"
...
ID dự án: #24629356
Về dự án
28 freelancer chào giá trung bình$166 cho công việc này
Hi there,I'm biddin on your project "Break down HTML pages by Header and associated text" I have read your project description and i'm an expert in Python and machine learning therefore i can do this project for you pe Thêm
Hello, I can help you with your project - Break down HTML pages by Header and associated text I have gone through your job posting and become very much interested to work with you. I am an expert in this field. I ha Thêm
Hi, I am ready to start it. But i have some questions for you. So kindly leave message for me, then ill discuss it in depth.
Very interesting concept, i like it ! would much enjoy developing it, for the web interface, do you have a server ? or want me to suggest a small linux server which will handle the web app and the scraper ?
Hi. I am ready to write your project. I have written many automation projects. Will complete within 3 days
Hi, I can make this program for you by tomorrow. Shoot me msg & let's get started tonight. Thanks
Hi! How are you doing? I have read the project description and really interested in this job, I have 4 years’ experience doing similar jobs regarding to these skills Software Architecture, Javascript, HTML and Python. Thêm
Hello there. I have seen your job posting. I will like to ask some questions. Please come over the chat so we can discuss things. Some intro about me. I am an enthusiastic developer/implementer who does not stop until Thêm
Its completely doable. I've done similar automation things many times before for my own convenience. i can additionally deploy the application of an amazon EC2 machine where it can sit for one year without any charg Thêm
Hello Sir, This project could be done in 48 hours, the budget you are setting is generous and since I am an honest person I am requesting only the real value for this project. I could start inmediatly and develope the Thêm
Dear Sirs, I have extensive software development experience and would be happy to drive this project to successful completion. Best Regards, Igor D
Hi, My name is Alpa Patel, - 7 Years of Professional work experience in creative Website Design, Design Landing Page, Blog Install, logo design and Graphic Design as per current UI/UX market trends. - I have careful Thêm
Experienced(6 Years), Android and Ios(Native or Hybrid), PHP, Delphi , Python, Vb, C# and Java developer and web designer with the depth knowledge related to MySQL,Codeigniter,Laravel,WordPress,MSSQL,JSON,JAVASCRIPT,AJ Thêm
Work with me is great deal for you I have done your job with my experience and give you a soft touch Relevant Skills and Experience I have done so much project like that and have 5 years experience of tha field
I will make python web based application wich will accept accept domain from text file. And then will provide with csv file with the format provided by you Relevant Skills and Experience My expertise are in python , H Thêm
Im Python Developer since 2015 , Advanced in Web Scraping and i have done similar projects before ... I can do this quickly for you.