Kiến thức

Search code, repositories, users, issues, pull requests…

Securities and Exchange Commission (SEC) EDGAR database. EDGAR contains regulatory filings from publicly-traded US corporations including their annual and quarterly reports:

All companies, foreign and domestic, are required to file registration statements, periodic reports, and other forms electronically through EDGAR. Anyone can access and download this information for free. [from the SEC website]

See http://www.sec.gov/edgar/searchedgar/companysearch.html

687474703a2f2f77656273686f742e6f6b666e6c6162732e6f72672f6170692f67656e65726174653f75726c3d687474702533412532462532467777772e7365632e676f7625324665646761722532467365617263686564676172253246636f6d70616e797365617263682e68746d6c

EDGAR provides bulk access via FTP: ftp://ftp.sec.gov/ – [official documentation][ftp-doc]. We summarize here the main points.

Each company in EDGAR gets an identifier known as the CIK which is a 10 digit number. You can find the CIK by searching EDGAR using a name of stock market ticker.

For example, searching for IBM by ticker shows us that the the CIK is 0000051143.

Note that leading zeroes are often omitted (e.g. in the ftp access) so this would become 51143.

687474703a2f2f77656273686f742e6f6b666e6c6162732e6f72672f6170692f67656e65726174653f75726c3d687474702533412532462532467777772e7365632e676f762532466367692d62696e25324662726f7773652d656467617225334643494b25334469626d253236616374696f6e253344676574636f6d70616e792677696474683d31303234266865696768743d373638

Next each submission receives an ‘Accession Number’ (acc-no). For example, IBM’s quarterly financial filing (form 10-Q) in October 2013 had accession number: 0000051143-13-000007.

Given a company with CIK (company ID) XXX (omitting leading zeroes) and document accession number YYY (acc-no on search results) the path would be:

File paths are of the form:

For example, for the IBM data above it would be:

ftp://ftp.sec.gov/edgar/data/51143/0000051143-13-000007.txt https://www.sec.gov/Archives/edgar/data/51143/0000051143-13-000007.txt

EDGAR has retired HTTP services. Instead use the HTTPS equivalent.

https://www.sec.gov/Archives/edgar/data/51143/0000051143-13-000007.txt

Note, if you are looking for a nice HTML version you can find it at in the Archives section with a similar URL (just add -index.html):

http://www.sec.gov/Archives/edgar/data/51143/000005114313000007/0000051143-13-000007-index.htm

If you want to get a list of all filings you’ll want to grab an Index. As the help page explains:

The EDGAR indices are a helpful resource for HTTPS retrieval, listing the following information for each filing: Company Name, Form Type, CIK, Date Filed, and File Name (including folder path).

Four types of indexes are available:

  • company — sorted by company name
  • form — sorted by form type
  • master — sorted by CIK number
  • XBRL — list of submissions containing XBRL financial files, sorted by CIK number; these include Voluntary Filer Program submissions

URLs are like:

Chuyên gia chia sẻ  Cách lấy địa chỉ ví blockchain vô cùng đơn giản

ftp://ftp.sec.gov/edgar/full-index/2008/QTR4/master.gz https://www.sec.gov/Archives/edgar/full-index/2008/QTR4/master.gz

That is, they have the following general form:

So for XBRL in the 3rd quarter of 2010 we’d do:

ftp://ftp.sec.gov/edgar/full-index/2010/QTR3/xbrl.gz https://www.sec.gov/Archives/edgar/full-index/2010/QTR3/xbrl.gz

[ftp-doc]: https://www.sec.gov/edgar/searchedgar/ftpusers.htm

There’s a full list of all companies along with their CIK code here: http://www.sec.gov/edgar/NYU/cik.coleft.c

If you want to look up a CIK or company by its ticker you can do the following query against the normal search system:

http://www.sec.gov/cgi-bin/browse-edgar?CIK=ibm&Find=Search&owner=exclude&action=getcompany&output=atom

Then parse the atom to grab the CIK. (If you prefer HTML output just omit output=atom).

There is also a full-text company name to CIK lookup here:

http://www.sec.gov/edgar/searchedgar/cik.htmL

(Note this does a POST to a ‘text’ API at http://www.sec.gov/cgi-bin/cik.pl.c)

See scripts and README file there.

  • CorpWatch have an excellent API and DB dump covering a lot of EDGAR info – see the CorpWatch DataHub Entry

Đánh giá bài viết post

Phạm Văn Sỹ

Tôi là Phạm Văn Sỹ chuyên gia uy tín trong lĩnh vực kinh tế và kinh doanh là sinh viên của trường Đại học Ngoại Thương. Với kiến thức sâu rộng sau 12 năm ở bên ngoài thương trường thị trường tôi mong muốn chia sẻ các kiến thức chuyên sâu hữu ích dành cho mọi người.

Related Articles

Back to top button