openedgar icon indicating copy to clipboard operation
openedgar copied to clipboard

What's expected as outcome of processing?

Open abolotnov opened this issue 7 years ago • 3 comments

Hi,

I've gotten it installed per the instruction in local mode and downloaded filing index for 2018 and process_all_filing_index(year=2018, form_type_list=["10-Q"])

celery picked up and after some time ended up with a lot of txt (looks like mixture of txt and HTML) content in edgar/data folder and records in _companyinfo and _filing, _filingdata records. But no actual content broken down into sections/individual pieces. Is this expected outcome? Do I need to do additional processing to extract the actual content?

Also, the django app - is this just a skeleton and not supposed to do anything other than user registration and login/logout?

thanks!

abolotnov avatar Dec 17 '18 22:12 abolotnov

Last I checked, this script was written for 10-Ks. Try, "10-K" instead. I know C++, but I am learning Python so I can figure out how to get this work. Do you use Visual Studios for coding?

johllmichael avatar Mar 13 '19 13:03 johllmichael

I gave up on this one, it does work, but I don't understand how to validate it completed everything properly because I don't understand what it's outcome should ultimately be. Tika keeps dying with out of memory, regardless of different configurations I tried, including large instances. Besides, looks like developers abandoned the project or something.

abolotnov avatar Mar 13 '19 20:03 abolotnov

@abolotnov did you find anything better?

jcrben avatar Dec 20 '19 23:12 jcrben