Expected Value of Number of Attempts to Get Pattern

In this post, I talk about how to find the expected value of the number of coin tosses needed to find a specific pattern for the first time.

The Setup

You toss a fair coin until you reach a specific pattern. Example HH. How many tosses would you need on an average to get to this pattern for the first time.

Solution

To tackle this problem, I thought about the solution in terms of conditional probability.

Given that I have not seen the pattern yet, if the last toss was an H, how many more tosses can I expect before I see the pattern? Call this H. Given that I have not seen the pattern yet, if the last toss was a T, how many more tosses can I expect before I see the pattern? Call this T.

\[ H = 0.5*1 + 0.5*(1+T) \]

The explanation of the equation above is, as follows: Given that we have not seen the pattern yet, and we tossed an H, with 0.5 probability we can get the pattern. With another 0.5 probability we need (1+T). 1 because of the T we just rolled, and T is the expected number given the last toss was T.

\[ T = 0.5(1+H) + 0.5(1+T) \]

We now have a system of linear equations. On solving this, we get \[ \begin{aligned} H = 4\\ T = 6\\ E = 6 \end{aligned} \]

Simulate the Results

import numpy as np
import random


def simulate_game(p=0.5):
    last_res = None
    last_but_one_res = None
    n_trials = 0
    while not ((last_res == "H") & (last_but_one_res == "H")):
        last_but_one_res = last_res
        if random.uniform(0, 1) > p:
            last_res = "H"
        else:
            last_res = "T"
        n_trials += 1
    return n_trials


def simulate_expected_value(n, p):
    trials_list = []
    for i in range(n):
        trials_list.append(simulate_game(p))

    return np.mean(trials_list)


def find_conf(conf_n, n, p, conf_per=95):
    res_list = []
    for i in range(conf_n):
        res_list.append(simulate_expected_value(n, p))
    lower = np.quantile(res_list, (100 - conf_per) / 200)
    upper = np.quantile(res_list, (conf_per + (100 - conf_per) / 2) / 100)
    return (lower, upper)

Expanding Further

What if we have

A pattern HHH?
Unfair dice?
Is the expected value to get HHH the same as HTH? What about TTH?
If the pattern is HH OR HT?
- What is the probability we get HH before HT?

--- title: "Expected Value of Number of Attempts to Get Pattern" author: "Jacob Mathew" date: Sat May 24 23:24:55 CDT 2025 categories: [simulation, probability] format: html: theme: flatly toc: true toc-depth: 3 toc-location: right code-fold: true code-tools: true code-copy: true code-overflow: wrap smooth-scroll: true anchor-sections: true link-external-newwindow: true title-block-banner: true page-layout: article include-in-header: text: | <style> :root { --rs-accent: #0f4c81; --rs-accent-soft: #eaf3fb; --rs-border: #d9e4ef; --rs-ink: #1f2937; --rs-muted: #5f6b7a; --rs-code-bg: #0b1220; --rs-warm: #fff8ea; --rs-warm-border: #f3d28b; } body { text-rendering: optimizeLegibility; } .quarto-title-block .quarto-title-banner { background: linear-gradient(135deg, rgba(15,76,129,.95), rgba(31,111,235,.78)), radial-gradient(circle at top right, rgba(255,255,255,.22), transparent 32%); padding-top: 3.25rem; padding-bottom: 3rem; border-bottom: 1px solid rgba(255,255,255,.12); } .quarto-title-block .title { font-weight: 800; letter-spacing: -0.03em; max-width: 12ch; } .quarto-title-meta { font-size: .95rem; } main.content { max-width: 900px; } .content p, .content li { color: var(--rs-ink); font-size: 1.04rem; line-height: 1.8; } .content p { margin-bottom: 1rem; } .content h1, .content h2, .content h3, .content h4 { color: #10273f; font-weight: 750; letter-spacing: -0.02em; } .content h1 { margin-top: 2.75rem; margin-bottom: 1rem; padding-bottom: .45rem; border-bottom: 2px solid var(--rs-border); } .content h2 { margin-top: 2.35rem; margin-bottom: .75rem; } .content h3 { margin-top: 1.75rem; } .content a { color: var(--rs-accent); text-decoration-thickness: .08em; text-underline-offset: .14em; } .content ul, .content ol { padding-left: 1.35rem; } .content li + li { margin-top: .35rem; } pre, div.sourceCode { border-radius: 18px; border: 1px solid #172033; box-shadow: 0 14px 36px rgba(15, 23, 42, 0.16); } pre code, div.sourceCode code { font-size: .92rem; } code:not(pre code) { background: var(--rs-accent-soft); color: #17476d; border-radius: .45rem; padding: .14rem .38rem; } .cell { margin-top: 1.2rem; margin-bottom: 1.5rem; } table { width: 100%; border-collapse: separate; border-spacing: 0; margin: 1.5rem 0 2rem; overflow: hidden; border: 1px solid var(--rs-border); border-radius: 16px; box-shadow: 0 10px 30px rgba(15, 23, 42, 0.06); } thead th { background: #f4f8fc; color: #17324d; font-weight: 700; border-bottom: 1px solid var(--rs-border); } th, td { padding: .9rem .95rem; vertical-align: top; } tbody tr:nth-child(even) { background: #fbfdff; } blockquote { border-left: 4px solid var(--rs-accent); background: #f7fbff; border-radius: 0 14px 14px 0; padding: .9rem 1rem; color: var(--rs-muted); } hr { border-top: 1px solid var(--rs-border); opacity: 1; } .sidebar nav[role="doc-toc"] { border-left: 1px solid var(--rs-border); padding-left: 1rem; } .sidebar nav[role="doc-toc"] .active { color: var(--rs-accent) !important; font-weight: 700; } span[style*="color: red"] { display: inline-block; background: var(--rs-warm); color: #8a5a00 !important; border: 1px solid var(--rs-warm-border); border-radius: 999px; padding: .12rem .5rem; font-weight: 700; margin-right: .35rem; } img, .quarto-figure, .figure { border-radius: 18px; } @media (max-width: 991px) { .quarto-title-block .title { max-width: none; } main.content { max-width: 100%; } } </style> --- In this post, I talk about how to find the expected value of the number of coin tosses needed to find a specific pattern for the first time. ## The Setup You toss a fair coin until you reach a specific pattern. Example HH. How many tosses would you need on an average to get to this pattern for the first time. --- ## Solution To tackle this problem, I thought about the solution in terms of conditional probability. Given that I have not seen the pattern yet, if the last toss was an H, how many more tosses can I expect before I see the pattern? Call this H. Given that I have not seen the pattern yet, if the last toss was a T, how many more tosses can I expect before I see the pattern? Call this T. $$ H = 0.5*1 + 0.5*(1+T) $$ The explanation of the equation above is, as follows: Given that we have not seen the pattern yet, and we tossed an H, with 0.5 probability we can get the pattern. With another 0.5 probability we need (1+T). 1 because of the T we just rolled, and T is the expected number given the last toss was T. $$ T = 0.5(1+H) + 0.5(1+T) $$ We now have a system of linear equations. On solving this, we get $$ \begin{aligned} H = 4\\ T = 6\\ E = 6 \end{aligned} $$ ## Simulate the Results ```python import numpy as np import random def simulate_game(p=0.5): last_res = None last_but_one_res = None n_trials = 0 while not ((last_res == "H") & (last_but_one_res == "H")): last_but_one_res = last_res if random.uniform(0, 1) > p: last_res = "H" else: last_res = "T" n_trials += 1 return n_trials def simulate_expected_value(n, p): trials_list = [] for i in range(n): trials_list.append(simulate_game(p)) return np.mean(trials_list) def find_conf(conf_n, n, p, conf_per=95): res_list = [] for i in range(conf_n): res_list.append(simulate_expected_value(n, p)) lower = np.quantile(res_list, (100 - conf_per) / 200) upper = np.quantile(res_list, (conf_per + (100 - conf_per) / 2) / 100) return (lower, upper) ``` ## Expanding Further What if we have - A pattern HHH? - Unfair dice? - Is the expected value to get HHH the same as HTH? What about TTH? - If the pattern is HH OR HT? - What is the probability we get HH before HT?