Split File into Largest Even Multiple Given Number of Records
$30-250 USD
Thanh toán khi bàn giao
I need help with a task. Details below.
Given file A with x number of records, split the file into y number of smaller files containing equal number of records.
Notes:
Records are can only be identified by the line starting with "GROUP"
ALL text is surrounded by quotes ("")
All text is delimited by semicolon (;)
All text ends in a new line
All files end in a newline
All quotes and delimiters must remain intact.
Just splitting the file as is, no other changes. Keeping order.
File may contain any number of records - primarily used for files with >10k records
Records lengths vary by number of (new)lines
All files will not have the same order of body text between records and may end with different text; the only marker of a new record is a line beginning with "GROUP"
[login to view URL] holds the sample text. It holds 20050 'records' (Lines that start with group - all text after until the next 'Group' is part of the same 'record')
The primary issue I was having with this project was identifying blocks of records in order to manipulate them (see line 234-238 in [login to view URL] - tried to use a 'pointer', really want to use a map function?). My work so far is in [login to view URL] located at [login to view URL] This is just to show my thought process. Can discuss at end of project.
Input: path to file
Output: N number of files each with y number of records
Deliverables
Rnotebook that splits a file with contents following the structure outlined above
Preferably solved with a Tidy solution or Base R solution
Please include comments throughout code
Suggestions on next steps to make distributable
Next steps for me are to make into a shiny app and host on AWS or Azure for users to select their file and receive split files in return
Opportunity for ongoing codementor help
EXAMPLE:
[login to view URL] holds 20050 records. I add the path to the sample text in the R script or Notebook. I enter the number of resulting files I want. The script determines how many records should approximately be in each file, some left over in the last file is okay. If I enter 5 for the number of output files I want, the script should return 5 files, each with 4010 records. If I enter 6 for the number of output files, the script should return 5 files with 3340 records and the 6th would hold the remainder.
Summary and 'pseudocode'
Count number of records
Identify records
Find the number of records (y) that would split closest to evenly to result in user defined number of output files with y records in each
For line in notice_line:
For the number of lines in notice_line
If a line starts with "Group", Create an empty file
Name the empty file File_n , n for line number in notice_line
Put the line in the empty record
If the line is not group
Add the line to the existing file
Until the file has the number of records that would make all resulting files have approximately y number of records, where y is the number of resulting files the user would like to have outputted
Deliverables
Deliverables
Rnotebook that splits a file with contents following the structure outlined above
Preferably solved with a Tidy solution or Base R solution
Please include comments throughout code
Suggestions on next steps to make distributable
ID dự án: #27434677
Về dự án
10 freelancer chào giá trung bình$154 cho công việc này
Hi, Greetings! ✅checked your project details: Split File into Largest Even Multiple Given Number of Records ✅Completed Time: In project deadline We have worked on 600 + Projects. I have 6 + years of the experien Thêm
Hi. I have written many automation apps for formatting files. Can format to any templates. Ready to write your project. Will complete within 3 days
Hello, please contact me for more details.... .....................................................
Hi I am 100% sure this project . I am ready to start. I am expert in Microsoft Office specially Excel, Word and Access Database. Data Entry and Processing is my passion with years of experiences. I am talented and very Thêm
Hello I am a powershell, perl and shell script expert and did similar split file scripting in my past and i can help you to split the file via any one of scripting Kindly confirm are you comfortable with Powershell o Thêm
__________________I am available right now______________________ Hi there, Quality and time is my commitment. I have done this many times, I want to say that, I will start right now and funds will be after your satisf Thêm
Hi, I can help you as your instruction. Please contact with me. I am interestedly to do the job. Thank you.
Hi, I have been working a global company as computer engineer. I had worked a lot of excel/data entry projects. I can work on your jobs and can finish as soon as possible.