Split File into Largest Even Multiple Given Number of Records

Đã Đóng Đã đăng vào 3 năm trước Thanh toán khi bàn giao
Đã Đóng Thanh toán khi bàn giao

I need help with a task. Details below.

Given file A with x number of records, split the file into y number of smaller files containing equal number of records.

Notes:

Records are can only be identified by the line starting with "GROUP"

ALL text is surrounded by quotes ("")

All text is delimited by semicolon (;)

All text ends in a new line

All files end in a newline

All quotes and delimiters must remain intact.

Just splitting the file as is, no other changes. Keeping order.

File may contain any number of records - primarily used for files with >10k records

Records lengths vary by number of (new)lines

All files will not have the same order of body text between records and may end with different text; the only marker of a new record is a line beginning with "GROUP"

[login to view URL] holds the sample text. It holds 20050 'records' (Lines that start with group - all text after until the next 'Group' is part of the same 'record')

The primary issue I was having with this project was identifying blocks of records in order to manipulate them (see line 234-238 in [login to view URL] - tried to use a 'pointer', really want to use a map function?). My work so far is in [login to view URL] located at [login to view URL] This is just to show my thought process. Can discuss at end of project.

Input: path to file

Output: N number of files each with y number of records

Deliverables

Rnotebook that splits a file with contents following the structure outlined above

Preferably solved with a Tidy solution or Base R solution

Please include comments throughout code

Suggestions on next steps to make distributable

Next steps for me are to make into a shiny app and host on AWS or Azure for users to select their file and receive split files in return

Opportunity for ongoing codementor help

EXAMPLE:

[login to view URL] holds 20050 records. I add the path to the sample text in the R script or Notebook. I enter the number of resulting files I want. The script determines how many records should approximately be in each file, some left over in the last file is okay. If I enter 5 for the number of output files I want, the script should return 5 files, each with 4010 records. If I enter 6 for the number of output files, the script should return 5 files with 3340 records and the 6th would hold the remainder.

Summary and 'pseudocode'

Count number of records

Identify records

Find the number of records (y) that would split closest to evenly to result in user defined number of output files with y records in each

For line in notice_line:

For the number of lines in notice_line

If a line starts with "Group", Create an empty file

Name the empty file File_n , n for line number in notice_line

Put the line in the empty record

If the line is not group

Add the line to the existing file

Until the file has the number of records that would make all resulting files have approximately y number of records, where y is the number of resulting files the user would like to have outputted

Deliverables

Deliverables

Rnotebook that splits a file with contents following the structure outlined above

Preferably solved with a Tidy solution or Base R solution

Please include comments throughout code

Suggestions on next steps to make distributable

Excel Nhập liệu Xử lí dữ liệu Khai thác dữ liệu Tìm kiếm web

ID dự án: #27434677

Về dự án

10 đề xuất Dự án từ xa 3 năm trước đang mở

10 freelancer chào giá trung bình$154 cho công việc này

schoudhary1553

Hi, Greetings! ✅checked your project details: Split File into Largest Even Multiple Given Number of Records ✅Completed Time: In project deadline We have worked on 600 + Projects. I have 6 + years of the experien Thêm

$220 USD trong 4 ngày
(415 Nhận xét)
8.2
writingapp

Hi. I have written many automation apps for formatting files. Can format to any templates. Ready to write your project. Will complete within 3 days

$90 USD trong 3 ngày
(68 Nhận xét)
5.5
Valuesolutions

Hello, please contact me for more details.... .....................................................

$140 USD trong 7 ngày
(10 Nhận xét)
4.6
Banze

I can I help you in Split File into Largest Even Multiple Given Number of Records. I have read and understood all your initial requirements, and I feel,I am producing quality data entry for my clients including; Web Re Thêm

$200 USD trong 7 ngày
(32 Nhận xét)
4.5
azfatima512

Hi I am 100% sure this project . I am ready to start. I am expert in Microsoft Office specially Excel, Word and Access Database. Data Entry and Processing is my passion with years of experiences. I am talented and very Thêm

$30 USD trong 1 ngày
(21 Nhận xét)
4.1
goodwork20k

Hello I am a powershell, perl and shell script expert and did similar split file scripting in my past and i can help you to split the file via any one of scripting Kindly confirm are you comfortable with Powershell o Thêm

$200 USD trong 7 ngày
(3 Nhận xét)
3.0
hoorulain7586

__________________I am available right now______________________ Hi there, Quality and time is my commitment. I have done this many times, I want to say that, I will start right now and funds will be after your satisf Thêm

$120 USD trong 1 ngày
(2 Nhận xét)
1.5
sojibahammedwd

Hi, I can help you as your instruction. Please contact with me. I am interestedly to do the job. Thank you.

$278 USD trong 6 ngày
(0 Nhận xét)
0.0
Yco89

Dear client, my name is Yesi Cortes, I have read your project and I have a lot of experience in handling Excel, I can make your spreadsheet the way you request it and in the time required and in total I can carry out t Thêm

$140 USD trong 7 ngày
(0 Nhận xét)
0.0
Aliistek

Hi, I have been working a global company as computer engineer. I had worked a lot of excel/data entry projects. I can work on your jobs and can finish as soon as possible.

$120 USD trong 7 ngày
(0 Nhận xét)
0.0