Hi,
I have gone through the project description and have few question to ask before we proceed further.
How data would be compared and on which basis, what is the size of the data, If spark and storm need to be installed then how many node cluster would be required for comparison of data. what is the content of the files structured or unstructured . Please let me know these details. i will wait for your response.
Thanks and Regards,
Manoj