Skip to main content
Financial Datasets is a primary-source data provider of SEC filing data. We collect, normalize, and serve SEC-sourced datasets ourselves - giving you a direct, auditable path from regulatory filings to API response. For market data such as stock prices, we partner with a leading licensed, institutional-grade data provider to ensure the same level of quality and reliability. This page details exactly where each category of data originates and how it reaches your application.

Why Provenance Matters

Institutional workflows demand auditability. When a model output or trading signal is questioned, you need to trace every input back to its authoritative source. Opaque data supply chains introduce risk - you inherit unknown collection methodologies, latency, and error surfaces without visibility into any of it. Our architecture minimizes that risk. For SEC data, you get a direct, auditable path from source to API response. For market data, we vet and partner with institutional-grade providers so you always know where your data comes from.

Data Sources

Financial Statements & Fundamentals

All financial statements - income statements, balance sheets, and cash flow statements - are sourced directly from SEC filings (10-K, 10-Q, 8-K, and related exhibits). We parse the original XBRL and HTML submissions, normalize line items across reporting standards, and deliver a clean, structured dataset spanning 30+ years. Source: U.S. Securities and Exchange Commission (SEC) EDGAR system

SEC Filings

Our SEC filings endpoints serve filing metadata, full-text content, and structured exhibits pulled directly from EDGAR. There is no intermediary between the SEC’s public dissemination system and our API. Source: SEC EDGAR

Insider Transactions

Form 3, Form 4, and Form 5 filings are ingested directly from SEC EDGAR as they become available. We parse each filing into structured records with standardized fields for transaction type, share quantity, price, and ownership details. Source: SEC EDGAR

Institutional Ownership

13F filings are sourced and parsed directly from SEC EDGAR, providing quarterly snapshots of institutional holders, position sizes, and portfolio changes. Source: SEC EDGAR

Analyst Estimates & Earnings

Earnings data, including actuals, is derived directly from SEC filings and issuer disclosures. Source: SEC EDGAR and issuer publications

Stock Prices

Equity price data - including open, high, low, and close - is sourced from official exchange feeds via Databento, our market data partner. Databento delivers real-time and historical market data direct from colocation sites and is trusted by 3,000+ leading firms and high-growth startups. Sources: NYSE, NASDAQ, ARCA (via Databento)

News

Company news is sourced from publicly available news feeds. Articles are collected close to the point of publication and indexed against our company universe for precise ticker attribution. Source: Public news feeds

Provenance Guarantees

We maintain full transparency over every data source in our pipeline. This means:
  • Full auditability - every record traces to a named primary source or licensed data partner.
  • Primary-source SEC data - all SEC-derived datasets are collected and parsed by us directly from EDGAR, with no intermediary.
  • Vetted market data partners - where we use licensed data providers, we select institutional-grade partners and maintain full visibility into the data supply chain.
  • Controlled latency - we own the ingestion and processing path, so delays are measurable and within our SLA.
  • Single point of accountability - if something is wrong with the data, we own the fix end-to-end.

Questions

For due-diligence inquiries or detailed methodology documentation, contact us at [email protected].