Đã Hủy

Extract financial statement data from public company filings in SEC EDGAR Database formatted in HTML and XBRL

SEC 10-Q and 10-K filings are filed as text SGML HTML or XBRL documents, with variations in the layout of the tables containing financial data (with the names of tables, order of columns, and the labeling of rows being different filing to filing). The goal of this project is to write a program to extract the basic financial information ("fundamentals") from 10-Q filings and import said information into a MySQL database. The program needs to be able to parse and extract the information from the document at a specified URL.

To make this project doable, the Freelancer needs only to demonstrate the program accomplishing the assigned task for all 3 statements referenced above converting from both HTML and XBRL format for 5 different companies. All of the tests must be correctly converted in order for this project to be considered complete.

The URL below presents the documents filed with the SEC for a specific company. As you will see, the filings are made in both HTML and XBRL formats. The program needs to be able to work with both and automatically accomplish the task with minimal user input. Being able to input a company's stock ticker (such as "AAPL" for apple) into the program and have it look up the statements on EDGAR would be a big bonus and shouldn't be overly difficult to achieve given the simplistic nature of EDGAR's structure. Willing to pay extra for software capable of downloading all company filings in EDGAR database and able to monitor the database for new filings and download them when they occur.

[url removed, login to view]

Description Continued:

Often there are all kinds of supporting tables in a 10-Q and 10-K, however you are only interested in the following three tables: Balance Sheet, Statement of Income/Operations, and Statement of Cash Flows. Within each table at the start of each row is an accounting term that is the label for that row. You will need to have enough basic accounting knowledge to be able to guess which XBRL tags most closely match the row labels used in the filings you are converting.

Further, for each quarter's or year's filing, you are only interested in that quarter's or year's numbers (normally other quarters are included for comparison), which means you are only interested in a single column of numbers in each table. This means you are really only interested in parsing out about 70 numbers at most from the whole document (in addition to downloading the line item names, such as "Net Revenue") as well.

This realization (that you are only trying to output list of about 70 line item names and corresponding numbers in XML format) should help you get a much better grip on the project as XBRL and 10-Qs initially seem daunting due to the flexibility of XBRL and all the extra text and tables (that you don't care about) in 10-Qs.

As we expect the conversion process to be heuristic, don't worry if you end up producing a hacked up script as long as it works with all the test URLs and all data output comes from the document being parsed.

Consult the SEC XBRL page and the XBRL site to get the correct XML schemas for XBRL for 10-Q and 10-K data.

[url removed, login to view]

[url removed, login to view]

Deliverables:

1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.

2) Deliverables must be in ready-to-run condition: i.e. A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.

3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).

Kỹ năng: Lập trình C, HTML, MySQL, Kiến trúc phần mềm, Màn hình Windows

Xem thêm: automatically download data edgar database, company financial statement database, parse sec xbrl statements, extract company financial sec filing, sec edgar url explained, extract sec, sec filing extract, mysql financial database, extract edgar tables, financial statement names, year end process in accounting, www freelancer org in, www freelancer in hire buyer, write to documents freelancer, windows 10 freelancer install, will freelancer run on windows 10, user test freelancer, ticker line, the layout company, the freelancer company, the freelancer archives, tables in data structure, table data structure, source document in accounting, software in accounting

Về Bên Thuê:
( 0 nhận xét ) Lexington, United States

Mã Dự Án: #4546655

1 freelancer đang chào giá trung bình $222 cho công việc này

GeorgeKazi

See your inbox.

$222 USD trong 14 ngày
(0 Đánh Giá)
3.9