Need a python program that would extract information from text files (.rtf). Each .rtf file is a collection of newspaper articles published on a certain date; each .rtf file is named yymmdd_#.rtf. Each newspaper article in the text file is separated by a page break.
For each newspaper article in the text file I am looking for the following information to be organized into a .csv format: (1) year, (2) full_date (yymmdd), (3) publisher, (4) type, (5) title, (6) body of newspaper article. Please see the attached sample_csv_output files for illustration; the information of the first newspaper article appearing in [url removed, login to view] (also attached) is added for illustrative purposes.
In addition to the .csv file, the python program would also need to save the body of each newspaper article into separate .txt files by the following naming scheme: yymmdd_publishername.txt. An example .txt output file is attached (920415_The New York [url removed, login to view]) for the first newspaper article appearing in 920415_1.rtf.
Please note that I am looking for the python program (.py) that would perform the above tasks NOT the output files.
53 freelancer đang chào giá trung bình $135 cho công việc này
hello, I'd be glad to develop the python scrit that processes the test files for you. Looking forward to chat with you soon for more details. Best regards,
Dear sir I am a python developer with 8 years of professional experience. I am interested in this job and confident. I am ready to start the work. Best, Yongtao J
I just found a way to dump the info of a RTF into a python3 script. Contact me so we could talk better about the project. Thanks Proof: [url removed, login to view]