File parser to extract information and create report

In Progress Posted Mar 4, 2013 Paid on delivery
In Progress Paid on delivery

Below is the description of the program I want to be implemented.

############### The algorithm ##########################

Initialization phase: Load the attributes from the “[login to view URL]” to the memory.

Repeat every “parsing_interval” seconds

Copy the next “oldest” file (say “[login to view URL]”) from the “inbox_dir” directory to the “work_dir” directory (the oldest file can be determined from the file name; the file name convention is described below)

Unzip the [login to view URL] file (located in the “work_dir” directory). It will result in a text file, say “[login to view URL]”

Parse the “[login to view URL]” as described in the section “Parsing Procedure below” (and add parsing results to the report file).

Delete both “[login to view URL]” and “[login to view URL]” from the “work” directory.

############## The Parsing Procedure #####################

While not EOF do

Read next line from the file “[login to view URL]”

If the value of the “Primary attribute” is equal to ANY of the possible values listed in the “[login to view URL]” then do the following

If the value of of each secondary attribute is equal to ANY of the possible values listed in the "[login to view URL]" then

Get ALL the values associated with the secondary attributes (loaded from the [login to view URL] )

Append the primary attribute name, primary attribute value, and all secondary attribute names and values to the “report_file” (If the report_file is not present, then create a new one)

############## The Config File ##########################

# time interval for parsing the next file in seconds

parsing_interval: 60

# Report lifespan in days

Report_life: 30

# Directory paths

Inbox_dir: C:\Users\jsmith\Documents\inbox_dir

Work_dir: C:\Users\jsmith\Documents\work_dir

Report_dir: C:\Users\jsmith\Documents\report_dir

# Primary Attribute

Attribute2: value2_1, value2_2, value2_3

# Secondary Attribute Lists

Attribute1: Value1_1, Value1_2

Attribute3: Value3_1, Value3_2, Value3_3

############### Important notes ##########################

1) The “gzip” files are very huge (It could be 400MB) so the code should be scalable

2) The code should run on both Windows and Unix/Linus environments.

3) The files in the “inbox_dir” will not be deleted or moved.

4) The files in the “inbox_dir” have the following naming convention: YYYYMMDD-SEQNO_*.gzip.

############ Sample contents of the “[login to view URL]” file ########################

timestamp attribute1:value1 attribute2:value2 attribute3:value3 attribute4:value4 attribute5:value5 attribute6:value6 ..... attributen:valuen

mestamp attribute1:value1 attribute2:value2 attribute4:value4 attribute5:value5 attribute6:value6 ..... attributen:valuen

timestamp attribute1:value1 attribute2:value2 attribute3:value3 attribute4:value4 attribute5:value5 attribute6:value6 ..... attributen:valuen

timestamp attribute1:value1 attribute2:value2 attribute4:value4 attribute5:value5 attribute6:value6 ..... attributen:valuen

timestamp attribute1:value1 attribute2:value2 attribute3:value3 attribute4:value4 attribute5:value5 attribute6:value6 ..... attributen:valuen

timestamp attribute1:value1 attribute2:value2 attribute3:value3 attribute5:value5 attribute6:value6 ..... attributen:valuen

timestamp attribute1:value1 attribute2:value2 attribute3:value3 attribute5:value5 attribute6:value6 ..... attributen:valuen

timestamp attribute1:value1 attribute2:value2 attribute3:value3 attribute4:value4 attribute5:value5 attribute6:value6 ..... attributen:valuen

##########################################################

1) Please let me know if you have any questions. I’ll send you a sample gzip file via private message so that you can test the application on real data.

2) If you respond to this request, please tell me the language you'll use and the approximate timeframe you'll need to do this and the approximate cost and I'll send you more details.

Please let me know if you have any questions.

C Programming Data Processing Java Perl Python

Project ID: #4302168

About the project

34 proposals Remote project Active Mar 8, 2013

Awarded to:

ShawarmaMan

Hi, I have over 10 years of experience with Perl and I'd be happy to help you with this. Please see my PM for more details.

$200 USD in 3 days
(0 Reviews)
0.0

34 freelancers are bidding on average $186 for this job

gangabass

I can do this for you. See PMB for details.

$100 USD in 2 days
(568 Reviews)
7.8
dobreiiita

Hi, I am JAVA expert and can surely help you here, Please share sample gzip file. Thanks

$400 USD in 10 days
(460 Reviews)
7.6
mantislin

Hi sir, please check PM, thx Kimi.

$317 USD in 6 days
(120 Reviews)
7.0
hbxfnzwpf

I am c++ expert, I can handle this, please check you private message.

$125 USD in 2 days
(176 Reviews)
7.1
samitXI

Please check your inbox. Thanks

$315 USD in 7 days
(184 Reviews)
7.3
wbslivera

hello, I can help you. Thanks

$150 USD in 7 days
(261 Reviews)
6.5
idleswell

Hello, I am a Perl scripting expert interested in your project. I will provide details of my bid in a PMB. A IDLER

$189 USD in 6 days
(185 Reviews)
6.3
buzzcoder

Hi I can help you, please check your pm,thanks.

$400 USD in 4 days
(56 Reviews)
6.3
atchayagandhi

Hi, The project is easily realisable with java. We have expertise with the same. Request you to kindly refer your private message board for further details.

$60 USD in 2 days
(46 Reviews)
6.4
fejs

Hi Sir. I'm expert in Python programming. I've done similar project recently, so this task shouldn't be problem. Can You send me zip file to test application? Best regards, Fejs.

$50 USD in 2 days
(139 Reviews)
6.3
shupantha

I have over 10 years of experience in applications development in C++. Please check your PM for my comments on your project.

$250 USD in 7 days
(42 Reviews)
6.2
chinbochen

Hi sir, I'm senior java developer, I'm interested. Thanks.

$350 USD in 20 days
(20 Reviews)
6.0
dipakjadhav08

I have 4 years experience in Java. Also I am Oracle certified for java 6. I can help you.

$100 USD in 1 day
(99 Reviews)
5.5
mjef

I have more than 7 years of professional experience in cross-platform C++ programming. I can help you.

$200 USD in 7 days
(16 Reviews)
5.3
j2eesoafree

Hi I can deliver this ensuring high quality and flexibility using Java. Please let me know any questions, and share a sample file. regards, kD

$400 USD in 20 days
(26 Reviews)
5.5
russianitgroup

Hello, Will be glad to discuss the project. Regards, IT-Group

$170 USD in 5 days
(23 Reviews)
4.6
DenialWang

can do in both java and c#, while c# code can't be used in unix/linux; we may discuss this later. thanks/denial

$100 USD in 3 days
(6 Reviews)
4.3
charleslewisjose

I can help you out with this project.

$250 USD in 7 days
(12 Reviews)
4.2
compositesoft

Hi, I have over 15 years of experience in the software industry and have worked on a wide variety of applications, technologies, databases and languages. I am an expert in java having worked in java for around 10 More

$350 USD in 10 days
(2 Reviews)
4.0
cahbou

This job could easily be done in Python or Java, but given the size of your dataset I would consider Java a better choice. I would most likely complete the project over night (US time), especially if you were to cut bi More

$80 USD in 2 days
(3 Reviews)
3.7