This project is to extract text using Java RegEx.? The attached sample file contains entries (one entry per line) which correspond to name, address, etc.? I want another CSV file with the following columns:
Original entry, name, address, city, state, zip, phone number, ssn, tax id, alias_1,…, alias_18.
Each entry has the following structure.? Most of the elements are optional.
Person or business name
Address
City
State
Zip
Phone number
SSN
Tax ID
And a repeating block of
Alias type which may be one of dba, fdba, pdba, aka, fka, ow, …
Alias name
Although a large sample file is attached, the implementation should anticipate variations in the input and log unexpected conditions so as to help diagnose problems and tune the code.? Ex.? The alias types mentioned above are not exhaustive and we should examine the sample to construct a compete list.
This implementation will be used within a larger system.? Hence, the core should be robust and well designed for reuse.? At a a minimum, there should be one method,
public Record extract (String in)
that operates on each input line and populates a Record object.? The Record would have properties name, address,? city, state, zip, phone number, ss, tax_id, List<String> aliases.