Split File into Largest Even Multiple Given Number of Records
$30-250 USD
Closed
Posted over 3 years ago
$30-250 USD
Paid on delivery
I need help with a task. Details below.
Given file A with x number of records, split the file into y number of smaller files containing equal number of records.
Notes:
Records are can only be identified by the line starting with "GROUP"
ALL text is surrounded by quotes ("")
All text is delimited by semicolon (;)
All text ends in a new line
All files end in a newline
All quotes and delimiters must remain intact.
Just splitting the file as is, no other changes. Keeping order.
File may contain any number of records - primarily used for files with >10k records
Records lengths vary by number of (new)lines
All files will not have the same order of body text between records and may end with different text; the only marker of a new record is a line beginning with "GROUP"
[login to view URL] holds the sample text. It holds 20050 'records' (Lines that start with group - all text after until the next 'Group' is part of the same 'record')
The primary issue I was having with this project was identifying blocks of records in order to manipulate them (see line 234-238 in [login to view URL] - tried to use a 'pointer', really want to use a map function?). My work so far is in [login to view URL] located at [login to view URL] This is just to show my thought process. Can discuss at end of project.
Input: path to file
Output: N number of files each with y number of records
Deliverables
Rnotebook that splits a file with contents following the structure outlined above
Preferably solved with a Tidy solution or Base R solution
Please include comments throughout code
Suggestions on next steps to make distributable
Next steps for me are to make into a shiny app and host on AWS or Azure for users to select their file and receive split files in return
Opportunity for ongoing codementor help
EXAMPLE:
[login to view URL] holds 20050 records. I add the path to the sample text in the R script or Notebook. I enter the number of resulting files I want. The script determines how many records should approximately be in each file, some left over in the last file is okay. If I enter 5 for the number of output files I want, the script should return 5 files, each with 4010 records. If I enter 6 for the number of output files, the script should return 5 files with 3340 records and the 6th would hold the remainder.
Summary and 'pseudocode'
Count number of records
Identify records
Find the number of records (y) that would split closest to evenly to result in user defined number of output files with y records in each
For line in notice_line:
For the number of lines in notice_line
If a line starts with "Group", Create an empty file
Name the empty file File_n , n for line number in notice_line
Put the line in the empty record
If the line is not group
Add the line to the existing file
Until the file has the number of records that would make all resulting files have approximately y number of records, where y is the number of resulting files the user would like to have outputted
Deliverables
Deliverables
Rnotebook that splits a file with contents following the structure outlined above
Preferably solved with a Tidy solution or Base R solution
Please include comments throughout code
Suggestions on next steps to make distributable
Hi, Greetings!
✅checked your project details: Split File into Largest Even Multiple Given Number of Records
✅Completed Time: In project deadline
We have worked on 600 + Projects. I have 6 + years of the experience in same kind of projects. If you are looking for a true Freelancer, I am the Right person for you. I am available almost 24-7 and am very responsive. I feel proud that I am a trusted Freelancer who pleases almost every single client. You can rest assure, your work will be delivered well in advance of others, with passion and accuracy. I guarantee you instant communication & responses when you need me. Why choose me? I think every client is the reason for my success. I only take projects which I am sure I can do quickly.
My Portfolio Items: https://www.freelancer.com/u/schoudhary1553
I would really like to work with you on this project. If interested, Kindly contact me via chat for further details and discussion..
Thank you
Sandeep
Digital screencast
I can I help you in Split File into Largest Even Multiple Given Number of Records. I have read and understood all your initial requirements, and I feel,I am producing quality data entry for my clients including; Web Research, Data Mining, Internet Research. I can provide you 100% quality even in a short period. I can assure you. Waiting for your message.!!
Hi I am 100% sure this project . I am ready to start. I am expert in Microsoft Office specially Excel, Word and Access Database. Data Entry and Processing is my passion with years of experiences. I am talented and very hard working also know the value of time so, always try to deliver work on time.
Hello I am a powershell, perl and shell script expert and did similar split file scripting in my past and i can help you to split the file via any one of scripting
Kindly confirm are you comfortable with Powershell or perl? or you are looking for solution form some other scripting language
Ping me for further discussion
Thanks
__________________I am available right now______________________
Hi there, Quality and time is my commitment. I have done this many times, I want to say that, I will start right now and funds will be after your satisfaction.
Consider!
Dear client, my name is Yesi Cortes, I have read your project and I have a lot of experience in handling Excel, I can make your spreadsheet the way you request it and in the time required and in total I can carry out the project for you.
if you want write me!
Hi,
I have been working a global company as computer engineer. I had worked a lot of excel/data entry projects.
I can work on your jobs and can finish as soon as possible.