Đã Hủy

Extract financial statement data from 10-Q & 10-K filings in SEC EDGAR Database formatted in HTML and XBRL

SEC 10-Q and 10-K filings are filed as text SGML HTML or XBRL documents, with variations in the layout of the tables containing financial data (with the names of tables, order of columns, and the labeling of rows being different filing to filing). The goal of this project is to write a program to extract the basic financial information ("fundamentals") from 10-Q filings and import said information into a MySQL database. The program needs to be able to parse and extract the information from the document at a specified URL.

To make this project doable, the Freelancer needs only to demonstrate the program accomplishing the assigned task for all 3 statements referenced above converting from both HTML and XBRL format for 5 different companies. All of the tests must be correctly converted in order for this project to be considered complete.

The URL below presents the documents filed with the SEC for a specific company. As you will see, the filings are made in both HTML and XBRL formats. The program needs to be able to work with both and automatically accomplish the task with minimal user input. Being able to input a company's stock ticker (such as "AAPL" for apple) into the program and have it look up the statements on EDGAR would be a big bonus and shouldn't be overly difficult to achieve given the simplistic nature of EDGAR's structure.

[url removed, login to view]

Description Continued:

Often there are all kinds of supporting tables in a 10-Q and 10-K, however you are only interested in the following three tables: Balance Sheet, Statement of Income/Operations, and Statement of Cash Flows. Within each table at the start of each row is an accounting term that is the label for that row. You will need to have enough basic accounting knowledge to be able to guess which XBRL tags most closely match the row labels used in the filings you are converting.

Further, for each quarter's or year's filing, you are only interested in that quarter's or year's numbers (normally other quarters are included for comparison), which means you are only interested in a single column of numbers in each table. This means you are really only interested in parsing out about 70 numbers at most from the whole document (in addition to downloading the line item names, such as "Net Revenue") as well.

This realization (that you are only trying to output list of about 70 line item names and corresponding numbers in XML format) should help you get a much better grip on the project as XBRL and 10-Qs initially seem daunting due to the flexibility of XBRL and all the extra text and tables (that you don't care about) in 10-Qs.

As we expect the conversion process to be heuristic, don't worry if you end up producing a "hacked up" script as long as it works with all the test URLs and all data output comes from the document being parsed (in other words, no hardcoding the data into the script).

Consult the SEC XBRL page and the XBRL site to get the correct XML schemas for XBRL for 10-Q and 10-K data.

[url removed, login to view]

[url removed, login to view]


1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.

2) Deliverables must be in ready-to-run condition: i.e. A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.

3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).

## Platform

Required Language: Your choice.

Required Platform: Windows Server 2008 r2

Kĩ năng: MySQL, PHP, Kiến trúc phần mềm

Xem nhiều hơn: filings, edgar database, your legal consult, year end process in accounting, www freelancer org in, www freelancer in hire buyer, write to documents freelancer, words that end in bid, windows 10 freelancer install, will freelancer run on windows 10, user test freelancer, ticker line, the freelancer archives, tables in data structure, table data structure, source documents required accounting, source document in accounting, software in accounting, software freelancer hire, run freelancer on windows 10, qs freelancer, q net freelancer, project required from freelancer, project names in freelancer, process data structure

Về Bên Thuê:
( 0 nhận xét ) Lexington, United States

ID dự án: #4546628