Lexical Density Filings Strategy (Python). StockSharp

Author: StockSharp
N: 2023
v5.0.0 (8/7/2025)
Downloads: 0

This factor strategy examines the language used in regulatory filings to gauge
future stock performance. Lexical density is measured as the fraction of unique
terms in the most recent report. Dense filings suggest rich, information-heavy
disclosures that often precede stronger returns, while sparse wording can mask
weakness.
Each quarter the universe is sorted by lexical density. The highest quintile is
held long and the lowest quintile is shorted, with positions equally weighted.
Rebalancing occurs during the first three trading days of February, May, August
and November and positions remain open between reviews without stop-losses.
Backtests on broad U.S. equities show the factor provides a steady premium with
moderate turnover, making it a useful component in multi-factor portfolios.

  • Entry Criteria: Quarterly sort by lexical density; long top quintile,

short bottom quintile

  • Long/Short: Both
  • Exit Criteria: Next rebalance
  • Stops: No
  • Default Values:

    • Quintile = 5
    • MinTradeUsd = 200
    • CandleType = TimeSpan.FromDays(1)

  • Filters:

    • Category: Fundamental
    • Direction: Both
    • Indicators: Text analytics
    • Stops: No
    • Complexity: Intermediate
    • Timeframe: Multi-month
    • Seasonality: Yes
    • Neural networks: No
    • Divergence: No
    • Risk level: Medium