BLOG

Behind the numbers.

Code examples, market analysis, and data quality deep-dives.

Is MSTR a Leveraged Bitcoin Proxy? Rolling Beta Analysis in Python
Is Micron's Memory Cycle Recovering? Inventory and Margin Forecasting in Python
Which Sectors Work When Bonds Rally? Rate-Sensitive Rotation in Python
Do One-Month Price Extremes Reverse? Signal Evaluation in Python
Do Low-Volatility S&P 500 Stocks Reduce Drawdowns? Factor Test in Python
Is AI Capex Paying Back Fast Enough? Revenue Hurdle Forecasting in Python
Could Shorter AI Asset Lives Hit Earnings? Depreciation Stress Test in Python
How Much AI Capex Risk Can a Portfolio Remove? Constrained Optimization in Python
Is the AI Capex Trade Crowded? Rolling Volatility and Sector Rotation in Python
Did the AI Boom Come From Existing S&P 500 Members? Point-in-Time Momentum Test in Python
Is AI Revenue Circular? Customer-Vendor Capex Loop Analysis in Python
Is the AI Trade Connected to Private Credit? Rolling Correlation Network in Python
Is Apollo More Balance-Sheet Sensitive Than Peers? Leverage Screen in Python
Are AI Earnings Supported by Cash Flow? Accrual and Capex Screen in Python
Can Defensive Stocks Hedge AI Drawdowns? Basket Regime Test in Python
How Fast Does the Market Price In Fed Decisions? FOMC Event Study in Python
How Much Are Options Sellers Overpaid? The Variance Risk Premium in Python
Which Companies Have the Worst Earnings Quality? Sloan Accrual Screen with Geographic Revenue Data in Python
Does the Oil-to-Gold Ratio Signal Recessions? XLE/GLD Backtest in Python
Is AI Spending Crowding Out Free Cash Flow? Capex Sustainability Across the Mag 7 in Python
Does a Long Energy / Short Bonds Portfolio Capture Inflation Surprises? Factor Construction in Python
Can a Hidden Markov Model Detect Oil Market Regimes? HMM Analysis in Python
Do Grain Prices Predict Food Inflation? Granger Causality Test in Python
Does the Corporate Credit Spread Predict Stock Market Crashes? BAA-AAA Spread Analysis in Python
Do Oil Stocks Hedge Inflation? Rolling Beta Analysis in Python
Which Stocks Are Most Rate-Sensitive? Equity Duration via Bond Beta in Python
Which Companies Have the Highest Accrual Ratios? Earnings Quality Screening in Python
Is Alpha Persistent or Decaying? Rolling Sharpe Ratio Analysis in Python
Are Markets Trending or Mean-Reverting? Hurst Exponent Analysis in Python
Is Consumer Discretionary vs Staples a Leading Indicator? XLY/XLP Ratio Analysis in Python
Does Heavy Capex Predict Future Stock Returns? Capital Expenditure Analysis in Python
How to Estimate Cost of Equity Using CAPM in Python
Is Volatility Predictable? Testing for Volatility Clustering in Python
Which Industrials Are Overleveraged? Net Debt to EBITDA Screening in Python
GM Before and After Bankruptcy: Why Entity Resolution Matters for Financial Data
What Is Adjusted Beta? Merrill Lynch Beta Shrinkage in Python
How Good Is a Stock Pick? Information Ratio and Tracking Error in Python
Do Stock Returns Follow a Normal Distribution? Testing for Fat Tails in Python
Which Large Caps Have the Highest Free Cash Flow Yield? FCF Screening in Python
Which Sectors Won Over 5 Years? Sector Rotation Analysis in Python
How to Forecast Stock Volatility with GARCH Models in Python
Are Stock Prices Mean-Reverting? Augmented Dickey-Fuller Test in Python
How to Calculate CAPM Alpha and Beta with Regression in Python
How to Compare Sector Sharpe Ratios and Sortino Ratios in Python
DELL: Why Stitching Historical Price Data Together Is Wrong
How to Analyze Drawdown and Recovery for Bank Stocks in Python
How to Screen SaaS Stocks by Revenue Growth and Cash Flow in Python
How to Screen REITs by Dividend Yield and Valuation in Python
How Correlated Are the Magnificent 7? Intra-Group Correlation in Python
AAPL vs XOM: Do Individual Stocks Have Seasonal Patterns?
How to Rank Large-Cap Stocks by Momentum in Python
How to Build a Multi-Endpoint Financial Dashboard in Python
How to Compare Volatility Across Energy Stocks in Python
How to Screen Healthcare Stocks by Valuation in Python
How to Build a Sector Correlation Matrix for Portfolio Diversification in Python
How to Find Oversold and Overbought Stocks Using Z-Scores in Python
How to Measure Earnings Quality: Cash Flow vs Net Income in Python
How to Build a Multi-Factor Stock Screen in Python (Value + Momentum + Quality)
How to Build a Simple DCF Model for Any Stock in Python
How to Screen Tech Stocks by Revenue Growth in Python
How to Screen Stocks by Balance Sheet Health in Python
Is "Sell in May" Real? SPY Monthly Seasonality Over 10 Years
How to Compare Sector Performance YTD Using Python
How to Track S&P 500 Additions and Removals Over Time in Python
How to Screen Dividend Stocks by Yield and Quality in Python
How to Calculate Max Drawdown and Recovery Time for Any Stock in Python
How to Compare Profitability Across Mega-Cap Tech Stocks in Python
Why Ticker Symbols Are Unreliable: The Recycling Problem Every Quant Should Know
How to Calculate and Compare Stock Volatility in Python
How to Screen Blue-Chip Stocks by P/E Ratio in Python
How to Track Companies Through Ticker Changes, Bankruptcies, and Renames in Python
S&P 500 Turnover: How Much the Index Has Changed Since 2010
How to Calculate Stock Beta and Correlation in Python
← All articles

Why Ticker Symbols Are Unreliable: The Recycling Problem Every Quant Should Know

What’s the question?

Ticker symbols are the most visible identifiers in financial markets, but they are also among the least reliable. A ticker is not a permanent label for a company; it is a temporary assignment by a stock exchange that can be revoked, reassigned, or recycled at any time. The symbol “FB” has been assigned to four completely different companies since 1971. “GM” maps to two separate legal entities on either side of a bankruptcy filing. If a quantitative analysis uses ticker symbols as the primary key for joining or filtering data, it risks silently combining price, fundamental, and reference data from unrelated companies into a single record — and no error will be raised because the join key matches. How widespread is this problem, and what does a correct solution look like?

The approach

We examine four tickers with documented recycling histories: FB (four companies across five decades), DELL (two entities separated by a privatization), GM (two entities separated by a bankruptcy), and TWX (two entities with overlapping usage). For each ticker, the resolve endpoint returns every company that has ever used that symbol, along with the exact date range of each assignment and a permanent entity_id.

To demonstrate the practical consequence, we show what happens when a naive data pull for “FB” is issued without entity resolution, then contrast it with a scoped query that uses date-range filtering to retrieve only Meta Platforms’ price data.

import xfinlink as xfl

xfl.set_api_key("your_key")  # free at https://xfinlink.com/signup

# Tickers that have been genuinely recycled between different companies
recycled = ["FB", "DELL", "GM", "TWX"]

print("=== Ticker Recycling: One Symbol, Multiple Companies ===")
print()

for ticker in recycled:
    info = xfl.resolve(ticker)
    entities = info["data"][ticker]["entities"]
    print(f'{ticker} — {len(entities)} entit{"y" if len(entities) == 1 else "ies"}')
    for e in entities:
        valid_from = e.get("ticker_valid_from", "?")
        valid_to = e.get("ticker_valid_to") or "present"
        print(f'  {e["name"]}')
        print(f'    {valid_from} → {valid_to} (entity_id={e["entity_id"]})')
    print()

print('=== The Danger: Naive Data Pulls ===')
print('If you query 30 years of "FB" price data without entity resolution,')
print("you could get a time series mixing:")
print("  1971-1983: BankBoston Corp (banking)")
print("  1994-1997: Falcon Building Products (manufacturing)")
print("  1999-2003: FBR Asset Investment (finance)")
print("  2012-2022: Meta Platforms (social media)")
print()
print("These are 4 completely unrelated companies.")
print("Backtesting on this data would be meaningless.")
print()

print('=== Safe Pull: FB prices scoped to Meta only (2021-2022) ===')
fb_prices = xfl.prices("FB", start="2021-01-04", end="2021-01-08", fields=["close"])
if len(fb_prices) > 0:
    print(fb_prices[["ticker", "entity_name", "date", "close"]].to_string(index=False))
else:
    print("(No data)")

Output:

=== Ticker Recycling: One Symbol, Multiple Companies ===

FB — 4 entities
  BANKBOSTON CORP
    1971-01-07 → 1983-04-03 (entity_id=809)
  FALCON BUILDING PRODUCTS INC
    1994-11-03 → 1997-06-17 (entity_id=17829)
  F B R ASSET INVESTMENT CORP
    1999-09-29 → 2003-03-28 (entity_id=28677)
  Meta Platforms Inc
    2012-05-18 → 2022-06-08 (entity_id=2)

DELL — 2 entities
  DELL INC
    1988-06-22 → 2013-10-29 (entity_id=10408)
  DELL TECHNOLOGIES INC
    2018-12-28 → present (entity_id=65047)

GM — 2 entities
  General Motors Corporation (pre-2009 bankruptcy)
    1962-07-02 → 2009-06-01 (entity_id=4)
  General Motors Company
    2010-11-18 → present (entity_id=5)

TWX — 2 entities
  TIME WARNER INC
    1989-12-11 → present (entity_id=14682)
  A O L INC
    2003-10-16 → 2018-06-14 (entity_id=33763)

=== The Danger: Naive Data Pulls ===
If you query 30 years of "FB" price data without entity resolution,
you could get a time series mixing:
  1971-1983: BankBoston Corp (banking)
  1994-1997: Falcon Building Products (manufacturing)
  1999-2003: FBR Asset Investment (finance)
  2012-2022: Meta Platforms (social media)

These are 4 completely unrelated companies.
Backtesting on this data would be meaningless.

=== Safe Pull: FB prices scoped to Meta only (2021-2022) ===
ticker        entity_name       date     close
    FB Meta Platforms Inc 2021-01-04 268.94000
    FB Meta Platforms Inc 2021-01-05 270.97000
    FB Meta Platforms Inc 2021-01-06 263.31000
    FB Meta Platforms Inc 2021-01-07 268.73999
    FB Meta Platforms Inc 2021-01-08 267.57001

What this tells us

The four tickers examined expose distinct categories of recycling risk. FB represents pure recycling: four companies in four different industries (banking, manufacturing, financial services, and social media) shared the same two-character symbol across non-overlapping time periods. A naive data query would stitch together price data from a 1970s bank and a 2010s social media company into a single time series, producing a return calculation that is arithmetically correct but economically meaningless.

GM illustrates the bankruptcy discontinuity problem. General Motors Corporation (entity_id 4) was liquidated in 2009. General Motors Company (entity_id 5) is a different legal entity with different shareholders, a different capital structure, and a different balance sheet. The ticker is the same, but the underlying company is not. Treating pre-2009 and post-2010 GM data as a continuous series introduces a structural break that distorts any long-horizon return or risk calculation.

DELL shows the privatization gap. Dell Inc was delisted in October 2013 and Dell Technologies re-listed in December 2018. The five-year gap during which no “DELL” traded means any data vendor that fills gaps by interpolation or carries forward stale data will produce fabricated returns for a period when the stock did not exist in public markets.

TWX presents an overlapping usage case, where Time Warner and AOL both used the ticker during partially overlapping periods following their merger and subsequent separation. This creates ambiguity that can only be resolved by examining the entity_id and validity dates.

So what?

Ticker recycling is not an edge case affecting obscure securities. It occurs regularly in the large-cap universe, affecting household names like Facebook, General Motors, and Dell. Any data pipeline, backtest, or analytical workflow that uses ticker symbols as merge keys without entity resolution is vulnerable to data contamination. The entity_id provides the solution: a permanent identifier assigned to each distinct legal entity that persists through ticker changes, name changes, and exchange transfers. When building historical analyses, the correct pattern is to resolve each ticker to its entity_id first, then scope all subsequent data queries to the entity rather than the symbol. If a data source does not support entity resolution, every long-horizon query should be audited for recycling contamination.

Built with xfinlink — free financial data API for Python. pip install xfinlink
← All articles