Why Provenance Matters
Institutional workflows demand auditability. When a model output or trading signal is questioned, you need to trace every input back to its authoritative source. Opaque data supply chains introduce risk - you inherit unknown collection methodologies, latency, and error surfaces without visibility into any of it. Our architecture minimizes that risk. For SEC data, you get a direct, auditable path from source to API response. For market data, we vet and partner with institutional-grade providers so you always know where your data comes from.Data Sources
Financial Statements & Fundamentals
All financial statements - income statements, balance sheets, and cash flow statements - are sourced directly from SEC filings (10-K, 10-Q, 8-K, and related exhibits). We parse the original XBRL and HTML submissions, normalize line items across reporting standards, and deliver a clean, structured dataset spanning 30+ years. Source: U.S. Securities and Exchange Commission (SEC) EDGAR systemSEC Filings
Our SEC filings endpoints serve filing metadata, full-text content, and structured exhibits pulled directly from EDGAR. There is no intermediary between the SEC’s public dissemination system and our API. Source: SEC EDGARInsider Transactions
Form 3, Form 4, and Form 5 filings are ingested directly from SEC EDGAR as they become available. We parse each filing into structured records with standardized fields for transaction type, share quantity, price, and ownership details. Source: SEC EDGARInstitutional Ownership
13F filings are sourced and parsed directly from SEC EDGAR, providing quarterly snapshots of institutional holders, position sizes, and portfolio changes. Source: SEC EDGARAnalyst Estimates & Earnings
Earnings data, including actuals, is derived directly from SEC filings and issuer disclosures. Source: SEC EDGAR and issuer publicationsStock Prices
Equity price data - including open, high, low, and close - is sourced from official exchange feeds via Databento, our market data partner. Databento delivers real-time and historical market data direct from colocation sites and is trusted by 3,000+ leading firms and high-growth startups. Sources: NYSE, NASDAQ, ARCA (via Databento)News
Company news is sourced from publicly available news feeds. Articles are collected close to the point of publication and indexed against our company universe for precise ticker attribution. Source: Public news feedsProvenance Guarantees
We maintain full transparency over every data source in our pipeline. This means:- Full auditability - every record traces to a named primary source or licensed data partner.
- Primary-source SEC data - all SEC-derived datasets are collected and parsed by us directly from EDGAR, with no intermediary.
- Vetted market data partners - where we use licensed data providers, we select institutional-grade partners and maintain full visibility into the data supply chain.
- Controlled latency - we own the ingestion and processing path, so delays are measurable and within our SLA.
- Single point of accountability - if something is wrong with the data, we own the fix end-to-end.