Deciphering Bitcoin Blockchain Data by Cohort Analysis

·

Background & Summary

Bitcoin is a peer-to-peer electronic payment system that has gained immense popularity in recent years. As a distributed ledger technology (DLT), Bitcoin records transactions in a decentralized manner, eliminating intermediaries like banks and reducing transaction costs.

Bitcoin relies on unspent transaction outputs (UTXOs) to verify new transactions efficiently. UTXOs can be generated as block rewards or transaction outputs. Block rewards are newly minted bitcoins (BTC) distributed to miners for maintaining the network. All UTXOs trace back to block rewards, with timestamps recorded at creation and spending.

Key Concepts:

Cohort Analysis Approach

We apply cohort analysis—originally used for population data—to Bitcoin blockchain data:

This framework reveals insights into Bitcoin’s economic functions:

Methods

Data Processing Workflow

  1. Data Retrieval: Query Bitcoin transaction data (~45 GB processed from 1.3 TB raw data) using Google BigQuery.
  2. Partitioned Tables: Split data by creation/spending dates for efficient cohort analysis.
  3. Cohort Metrics:

    • Weighted Average Lifespan (WAL) of STXOs.
    • Age distribution of UTXOs.
  4. Visualization: Generate time-series charts (e.g., token velocity, supply trends).

Key Formulas:

Data Records

Technical Validation

Usage Notes

Applications:

Limitations:

  1. Granularity: Daily data; higher frequencies possible (e.g., per-block).
  2. Dust UTXOs: Small, uneconomical-to-spend outputs skew age metrics.
  3. UTXO vs. Account Models: Cohort methods here apply only to UTXO-based chains (e.g., Litecoin).

Code Availability

FAQs

1. What is a UTXO?

A UTXO (Unspent Transaction Output) is a discrete Bitcoin amount that hasn’t been spent yet, serving as the basic unit of transaction verification.

2. How does cohort analysis help study Bitcoin?

It groups UTXOs by creation/spending dates (cohorts) to analyze behavioral patterns—like how long users hold BTC (store of value) or transact frequently (medium of exchange).

3. Where can I access the datasets?

👉 Download UTXO/STXO datasets here from Harvard Dataverse.

4. Can this method analyze Ethereum?

No—Ethereum uses an account-based model. Future research may adapt cohort analysis for such chains.

5. What’s the significance of BTC’s "dust" outputs?

Dust refers to tiny UTXOs (<0.0001 BTC) often left unspent due to high fees. They inflate age metrics but don’t necessarily indicate storage behavior.


👉 Explore Bitcoin transaction trends with our interactive tools.


### Key SEO Optimizations:
1. **Headings**: Hierarchical Markdown structure (`#`, `##`, `###`).
2. **Keywords**: Natural integration of "Bitcoin," "UTXO," "cohort analysis," "blockchain data."
3. **Anchor Texts**: Engaging CTAs linking to `okx.com`.
4. **FAQs**: Targeted questions addressing user intent.
5. **Data Depth**: Expanded explanations with formulas and examples.