Live regime intelligence for systematic crypto traders. See the free tools
Systematic Trading

Crypto Backtesting: What General Guides Don't Cover

Crypto backtesting has unique challenges that general guides miss. Here is how to handle them and avoid the most common errors specific to crypto.

8
 mins read
Intermediate
Technical
18 June 2026
TL;DR

Crypto backtesting shares the same core methodology as backtesting any trading strategy, but introduces several challenges that general guides do not address. A 24/7 market with no natural session close, pronounced multi-year bull and bear cycles, exchange-specific data fragmentation, funding rates on perpetual futures, and very short market history all affect how crypto backtests should be designed, run, and interpreted.

15
Years of tradeable Bitcoin history, most altcoins have far less
4
Distinct crypto market phases any valid backtest should span
8
Hours between perpetual futures funding rate settlements

What Makes Crypto Backtesting Different

Three properties of crypto markets make backtesting harder than backtesting traditional instruments.

Very short market history. Bitcoin has approximately 15 years of tradeable data. Most altcoins have 3 to 7 years. A strategy requiring 3 years of historical data to validate is consuming 20 to 50% of the available history. There is no good solution, only the constraint to acknowledge. More frequent live validation via shadow data tracking compensates for the limited historical baseline.

Pronounced macro cycles. Crypto has clearly distinct bull and bear phases: 2017 bull, 2018 bear, 2020-2021 bull, 2022 bear, 2023-2024 recovery. A backtest covering only 2020-2021 covers one of the strongest bull markets in asset history. Any valid crypto backtest must span at least one complete cycle, including a meaningful bear phase, to produce results applicable to the full range of market conditions.

No natural session boundaries. Crypto trades 24/7. Daily bars close at midnight UTC by convention, not by market structure. For daily strategies, results can vary depending on which timezone's midnight is used. For 4-hour and shorter timeframes, this matters less. But the absence of overnight gaps, opening auctions, and institutional flows that structure traditional market sessions means some traditional technical patterns apply differently in crypto.

Data Quality Challenges in Crypto Backtesting

Exchange-specific data. A backtest on Binance data reflects Binance prices and Binance liquidity. A live trading account on a different exchange faces different fills. Price differences across exchanges create basis risk that single-exchange backtests cannot capture. For liquid assets (BTC, ETH) the difference is small. For less liquid altcoins, it can be significant.

Missing data and gap events. Most exchanges experienced outages during high-volatility events: the LUNA collapse, the FTX collapse, multiple flash crashes. Historical data from these periods may show prices that were not tradeable at the time. Strategies that appear to handle these events in backtesting may have been executing on prices unavailable in live conditions. Review the backtest period's major events and verify the data quality around those dates.

Survivorship bias. Backtesting on current top-20 assets selects for assets that survived and appreciated. Many assets in the top 20 by market cap in 2018 or 2021 have since lost 90%+ of value or ceased to trade. A portfolio manager in 2019 would have included those assets. A backtest on today's top 20 going back to 2019 is a pre-filtered set of survivors. The survivorship bias inflates returns for any diversified crypto strategy tested on current rankings.

Volume data reliability. Crypto exchange-reported volume includes wash trading, fee rebate incentives, and inflated reporting across many smaller exchanges. Volume-based indicators that work reliably on equity exchanges are less reliable in crypto. ATR-based measures of market activity are more robust than volume-based measures in crypto backtesting.

Regime Shifts Across Crypto Market Cycles

Crypto strategy performance depends on two regime levels: the tactical regime (trending vs ranging within a cycle, measured by ADX) and the macro regime (where the market sits in its multi-year cycle). A backtest that only segments by tactical regime misses the macro context that changes strategy performance fundamentally.

A mean-reversion strategy that works well during a ranging accumulation phase produces losses during the directional leg of a bull market. A trend-following strategy producing strong returns during 2020-2021 may produce nothing in the 2022 bear market if it only handles long-side signals.

The four macro phases to test across: the bull market trending phase (high ADX, directional momentum, trend-following performs well), the bull market topping phase (high volatility, regime uncertainty, SAR-based strategies may whipsaw), the bear market (requires short-side capability or flat market exposure), and the ranging/recovery phase (low ADX, mean-reversion opportunities, trend-following largely inactive).

A strategy that performs similarly across all four phases is genuinely robust. A strategy that outperforms in one phase is phase-specific, useful in that environment and unreliable outside it. Most crypto strategies are implicitly phase-specific, optimized for the macro environment during which most of their backtesting data was collected.

For the tactical regime classification framework, see What Is a Market Regime? and Ranging vs Trending Crypto.

LIVE SYSTEM
{{tool}}

Crypto-Specific Cost Considerations

Funding rates on perpetual futures. Perpetual contracts settle funding every 8 hours. When the market is bullish and most traders are long, funding rates are positive: longs pay shorts. During 2020-2021, funding rates on BTC perpetuals often ran at 0.01% to 0.03% every 8 hours, which compounds to 2% to 9% per month in holding costs for long positions. A backtest that ignores funding rates materially overstates the profitability of long futures strategies during exactly the periods when those strategies produce their best-looking results.

Liquidation risk at leverage. Leveraged futures positions can be liquidated if the position moves against the account. This risk exists in live trading but most backtesting frameworks do not model it. A backtest showing strong overall performance may include drawdown periods where a leveraged version would have been liquidated. Position sizing should ensure drawdowns never approach liquidation thresholds, and this constraint must be validated in the backtest.

Spread and slippage during volatility events. Spreads on crypto assets widen significantly during high-volatility events. Backtests using mid-price fills overstate profitability. Using conservative spread estimates, particularly around the major event dates in the backtest period, produces more realistic results.

For the general backtesting methodology including cost handling, see How to Backtest a Trading Strategy.

Crypto Backtesting in a Systematic Framework

The shadow data system records every signal's conditions, entry price, and exit outcome in real time, a continuous out-of-sample backtest that runs alongside the live system. Historical shadow data is analyzed by macro regime phase as well as by tactical ADX regime.

One finding that emerged specifically from multi-phase analysis: the expectancy of RANGING long signals in a ranging/recovery phase showed materially higher positive expectancy than RANGING long signals produced during the high-volatility period following the prior bull run's peak. Same tactical ADX conditions, different macro context, different outcomes. The rolling live backtest revealed this pattern only after sufficient data accumulated across both macro contexts. A single historical backtest over one phase would not have shown the difference.

For the futures component, funding rate costs are estimated based on each position's expected duration and recent funding rate averages. Signals with expected holding periods exceeding 12 hours in strongly positive funding conditions receive an adjusted expected value incorporating the estimated funding cost. This is not standard in generic backtesting frameworks but is necessary for accurate futures expectancy calculation in crypto's high-funding environments.

The shadow data also tracks regime distribution over time, what percentage of bars were RANGING, TRENDING_BULLISH, and TRENDING_BEARISH. Shifts in this distribution between the backtest period and the live period are flagged as potential evidence that the market's macro character has changed, warranting a review of the strategy's phase-specific calibration.

Crypto Backtesting Mistakes to Avoid

Backtesting on bull market data only. 2020-2021 was an extreme bull market. Any long-only strategy with trend-following logic produced strong results. This does not indicate sustainable edge. It indicates the strategy captured the macro tailwind of one of the strongest bull runs in market history. Include at least 2018 or 2022 in the test period to evaluate bear market behavior.

Ignoring funding rates for futures. A perpetual futures strategy holding consistent long positions during a bullish period pays significant funding costs that never appear in the spot price. A strategy showing 15% annual return without funding costs may show 8% after realistic funding estimates. Always include funding rate assumptions in futures backtests, especially for any strategy with long average holding periods in trending markets.

Asset selection from current market cap rankings. Backtesting on today's top 10 or top 20 selects for survivors. The backtest shows how these specific assets performed, not how a real portfolio would have performed. Include assets that were historically prominent but have since declined or failed, or at minimum acknowledge that the results reflect a survivor-biased sample.

Insufficient out-of-sample periods. Given crypto's limited history, a standard 70/30 split on 3 years of data produces 10 months of out-of-sample testing. This is marginal. Compensate with continuous live tracking using shadow data from day one of deployment. The live data is the only true out-of-sample test for crypto strategies where the historical baseline is inherently limited.

PRODUCT RESEARCH
What crypto market period does most of your backtesting cover?
Bull market period only (2020-2021 or similar)
Bear market period only
A complete cycle including bull and bear
I haven't done systematic backtesting yet
FREQUENTLY ASKED
No items found.

Get early access

Get the live read before launch.

RegimeLab launches Q4 2026. Live regime across all 8 pairs, flip alerts, and the full dashboard. Free tier available.

Free tier always · Pro and API at launch