Optimal Strategy in a Dice Game – Jacob Mathew’s Blog

In this post, I talk about my attempt to find the optimal strategy to maximize the score in a dice game.

The Game

You roll a fair 10-sided die.

If you roll a 10, the game ends, and you win nothing.
If you roll any other number, that number is added to your score.
You are not allowed to stop until your total reaches at least 10.
Once your score reaches 10, you are given a choice and you can choose to stop or continue.

How do you play this game optimally?

Strategy 1: Random Play

I first tried a randomized strategy — once I reached 10, I continued with 50% probability:


def strategy(res):
   return random.uniform(0,1) < 0.5

def roll_die(choices=[1, 2, 3, 4, 5, 6, 7, 8, 9, 10]):
    return np.random.choice(choices)

def game():
    res = 0
    while res < 10:
        roll = roll_die()
        if roll == 10:
            return 0
        else:
            res += roll

    play_again = strategy(res)
    while play_again:
        roll = roll_die()
        if roll == 10:
            return 0
        else:
        play_again = strategy(res)
            res += roll
    return res

After simulating thousands of games, the average score hovered around 11.6 to 12.0.

Strategy 2: Optimized Play

At this point, I asked myself can I calculate at each turn (after reaching a score of 10) what my score could be after the next roll.

Say, my score is 10, what possible values can my score take if I roll the die again?

Dice Throw Outcome	New Score	Probability
1	11	$1/10$
2	12	$1/10$
3	13	$1/10$
4	14	$1/10$
5	15	$1/10$
6	16	$1/10$
7	17	$1/10$
8	18	$1/10$
9	19	$1/10$
10	0	$1/10$

This means, my expected value at the end of the throw is \[ \sum_{i=11}^{19} \frac{i}{10} = 13.5 \]

This is a value greater than my current score. So, it makes sense for me to play the game once more.

So I updated my strategy as such.

def strategy(res):
    e = 0
    for i in range(1, 10):
        e += (res + i) / 10

    return e > res

What this does is ask the following question. Given my current score, do I expect a higher score if I play again?

At first, this confused me — res + i increases as res increases, so why does the gain shrink?

Turns out, the math reveals:

\[E(res)=0.9res+4.5⇒E(res)−res=−0.1res+4.5\]

So as score increases, the value of rolling decreases linearly. Once score reaches 45, the expected gain is 0. Beyond that, rolling again is actually worse than stopping.

I simulated this strategy using the code below and got a higher range for the confidence of the score (17.25, 17.95)

def simulate_game(n=10000):
    res_list = []
    for i in range(n):
        res_list.append(game())
    return res_list


def find_conf(n=10000, n_conf=10000, conf_per=95):
    mean_list = []
    for i in tqdm(range(n_conf), desc="num_simulations"):
        mean_list.append(np.mean(simulate_game(n)))
    lower = np.quantile(mean_list, (100 - conf_per) / 200)
    upper = np.quantile(mean_list, (conf_per + (100 - conf_per) / 2) / 100)
    return (lower, upper)

--- title: "Optimal Strategy in a Dice Game" author: "Jacob Mathew" date: Sat May 24 23:14:42 CDT 2025 categories: [simulation, probability, decision-theory] format: html: theme: flatly toc: true toc-depth: 3 toc-location: right code-fold: true code-tools: true code-copy: true code-overflow: wrap smooth-scroll: true anchor-sections: true link-external-newwindow: true title-block-banner: true page-layout: article include-in-header: text: | <style> :root { --rs-accent: #0f4c81; --rs-accent-soft: #eaf3fb; --rs-border: #d9e4ef; --rs-ink: #1f2937; --rs-muted: #5f6b7a; --rs-code-bg: #0b1220; --rs-warm: #fff8ea; --rs-warm-border: #f3d28b; } body { text-rendering: optimizeLegibility; } .quarto-title-block .quarto-title-banner { background: linear-gradient(135deg, rgba(15,76,129,.95), rgba(31,111,235,.78)), radial-gradient(circle at top right, rgba(255,255,255,.22), transparent 32%); padding-top: 3.25rem; padding-bottom: 3rem; border-bottom: 1px solid rgba(255,255,255,.12); } .quarto-title-block .title { font-weight: 800; letter-spacing: -0.03em; max-width: 12ch; } .quarto-title-meta { font-size: .95rem; } main.content { max-width: 900px; } .content p, .content li { color: var(--rs-ink); font-size: 1.04rem; line-height: 1.8; } .content p { margin-bottom: 1rem; } .content h1, .content h2, .content h3, .content h4 { color: #10273f; font-weight: 750; letter-spacing: -0.02em; } .content h1 { margin-top: 2.75rem; margin-bottom: 1rem; padding-bottom: .45rem; border-bottom: 2px solid var(--rs-border); } .content h2 { margin-top: 2.35rem; margin-bottom: .75rem; } .content h3 { margin-top: 1.75rem; } .content a { color: var(--rs-accent); text-decoration-thickness: .08em; text-underline-offset: .14em; } .content ul, .content ol { padding-left: 1.35rem; } .content li + li { margin-top: .35rem; } pre, div.sourceCode { border-radius: 18px; border: 1px solid #172033; box-shadow: 0 14px 36px rgba(15, 23, 42, 0.16); } pre code, div.sourceCode code { font-size: .92rem; } code:not(pre code) { background: var(--rs-accent-soft); color: #17476d; border-radius: .45rem; padding: .14rem .38rem; } .cell { margin-top: 1.2rem; margin-bottom: 1.5rem; } table { width: 100%; border-collapse: separate; border-spacing: 0; margin: 1.5rem 0 2rem; overflow: hidden; border: 1px solid var(--rs-border); border-radius: 16px; box-shadow: 0 10px 30px rgba(15, 23, 42, 0.06); } thead th { background: #f4f8fc; color: #17324d; font-weight: 700; border-bottom: 1px solid var(--rs-border); } th, td { padding: .9rem .95rem; vertical-align: top; } tbody tr:nth-child(even) { background: #fbfdff; } blockquote { border-left: 4px solid var(--rs-accent); background: #f7fbff; border-radius: 0 14px 14px 0; padding: .9rem 1rem; color: var(--rs-muted); } hr { border-top: 1px solid var(--rs-border); opacity: 1; } .sidebar nav[role="doc-toc"] { border-left: 1px solid var(--rs-border); padding-left: 1rem; } .sidebar nav[role="doc-toc"] .active { color: var(--rs-accent) !important; font-weight: 700; } span[style*="color: red"] { display: inline-block; background: var(--rs-warm); color: #8a5a00 !important; border: 1px solid var(--rs-warm-border); border-radius: 999px; padding: .12rem .5rem; font-weight: 700; margin-right: .35rem; } img, .quarto-figure, .figure { border-radius: 18px; } @media (max-width: 991px) { .quarto-title-block .title { max-width: none; } main.content { max-width: 100%; } } </style> --- In this post, I talk about my attempt to find the optimal strategy to maximize the score in a dice game. ## The Game You roll a fair 10-sided die. - If you roll a **10**, the game ends, and you win **nothing**. - If you roll any other number, that number is added to your score. - You are **not allowed to stop** until your total reaches at least **10**. - Once your score reaches 10, you are given a choice and you can **choose to stop or continue**. How do you play this game optimally? --- ## Strategy 1: Random Play I first tried a randomized strategy — once I reached 10, I continued with 50% probability: ```python def strategy(res): return random.uniform(0,1) < 0.5 def roll_die(choices=[1, 2, 3, 4, 5, 6, 7, 8, 9, 10]): return np.random.choice(choices) def game(): res = 0 while res < 10: roll = roll_die() if roll == 10: return 0 else: res += roll play_again = strategy(res) while play_again: roll = roll_die() if roll == 10: return 0 else: play_again = strategy(res) res += roll return res ``` After simulating thousands of games, the average score hovered around 11.6 to 12.0. ## Strategy 2: Optimized Play At this point, I asked myself can I calculate at each turn (after reaching a score of 10) what my score could be after the next roll. Say, my score is 10, what possible values can my score take if I roll the die again? | Dice Throw Outcome | New Score | Probability | |--------------------|-----------|-------------| | 1 | 11 | $1/10$ | | 2 | 12 | $1/10$ | | 3 | 13 | $1/10$ | | 4 | 14 | $1/10$ | | 5 | 15 | $1/10$ | | 6 | 16 | $1/10$ | | 7 | 17 | $1/10$ | | 8 | 18 | $1/10$ | | 9 | 19 | $1/10$ | | 10 | 0 | $1/10$ | This means, my expected value at the end of the throw is $$ \sum_{i=11}^{19} \frac{i}{10} = 13.5 $$ This is a value greater than my current score. So, it makes sense for me to play the game once more. So I updated my strategy as such. ```python def strategy(res): e = 0 for i in range(1, 10): e += (res + i) / 10 return e > res ``` What this does is ask the following question. Given my current score, do I expect a higher score if I play again? At first, this confused me — res + i increases as res increases, so why does the gain shrink? Turns out, the math reveals: $$E(res)=0.9res+4.5⇒E(res)−res=−0.1res+4.5$$ So as score increases, the value of rolling decreases linearly. Once score reaches 45, the expected gain is 0. Beyond that, rolling again is actually worse than stopping. I simulated this strategy using the code below and got a higher range for the confidence of the score (17.25, 17.95) ```python def simulate_game(n=10000): res_list = [] for i in range(n): res_list.append(game()) return res_list def find_conf(n=10000, n_conf=10000, conf_per=95): mean_list = [] for i in tqdm(range(n_conf), desc="num_simulations"): mean_list.append(np.mean(simulate_game(n))) lower = np.quantile(mean_list, (100 - conf_per) / 200) upper = np.quantile(mean_list, (conf_per + (100 - conf_per) / 2) / 100) return (lower, upper) ```

Dice Throw Outcome	New Score	Probability
1	11	\(1/10\)
2	12	\(1/10\)
3	13	\(1/10\)
4	14	\(1/10\)
5	15	\(1/10\)
6	16	\(1/10\)
7	17	\(1/10\)
8	18	\(1/10\)
9	19	\(1/10\)
10	0	\(1/10\)