I need code or program that extracts 20-F forms from SEC's edgar site and allows me to do textual analysis (frequency of specific terms). For example : i want to be able to search for the frequency of the following terms: Leas*, pension, revenu*, IAS 17. I have a list of terms I am looking for. I previously had someone write me a program in which I input the webpage of the 20-F form and then type each term into a search box that returns the count for that term. However, this takes too long as I have to type each search word in individually for each 20-F form. I want something in which I can load all of my search terms at once and for a 20-F form and have it return the count for each word. I have found information on how to extract 20-F forms from edgar using Perl or Python but would rather have an experienced person do it. I can send you a excel spreadsheet with the search terms and one with the ID code for all the 20-F forms I need if this will help clarify my request.
25 freelancers are bidding on average $428 for this job
OK lets start discussion , we have great team for that development . Thanks Relevant Skills and Experience python Proposed Milestones $736 USD - latter we will discuss
extract and query 20-F edgar forms Relevant Skills and Experience Python Data mining DataBase - SQL, MONGODB, POSTGRESQL web automation NLP Proposed Milestones $500 USD - final