도박사의 파산

개요

도박사의 파산(Gambler's ruin)은 유한한 자금을 가진 도박사가 도박을 계속할 경우 파산할 수밖에 없음을 보여주는 문제다. 아래 문제와 풀이를 잘 보고 도박의 늪에 빠지지 않도록 하자.

역사

1656년에 블레즈 파스칼이 피에르 드 페르마에게 다음 문제를 낸 것이 시초라고 한다. 피에르 드 카르카비(Pierre de Carcavi)는 크리스티안 하위헌스에게 보내는 편지에서 문제를 다음과 같이 요약했다.^[1]

두 사람이 주사위 세 개를 굴려서 첫 번째 사람은 11을 굴리면 득점하고, 두 번째 사람은 14를 굴리면 득점한다고 하자. 그런데 득점이 누적되는 기존 방식 대신에 상대방의 점수가 0이면 1점을 얻고, 그렇지 않으면 상대방의 점수를 빼도록 하자. 그것은 마치 두 점수가 짝을 지어 서로를 상쇄하여 지고 있는 쪽이 항상 0점인 것처럼 보인다. 12점을 먼저 얻는 사람이 승자가 된다. 그러면 각 참여자가 이길 확률은 얼마일까?

크리스티안 하위헌스는 1657년에 De Ratiociniis in Ludo Aleae라는 책을 출판하면서 이 문제를 다루었다.^[2]

문제 5. 사람 A, B는 자금 12를 가지고 시작하여 주사위 세 개를 굴려서 A는 11이 나올 때, B는 14가 나올 때 상대에게서 자금을 1 가져온다. 먼저 전액을 획득하는 사람이 승리한다. 이때 A와 B가 이길 확률의 비는 244,140,625 : 282,429,536,481이다.

문제

사람 [math]\displaystyle{ A,B }[/math]가 각각 자금 [math]\displaystyle{ i, N-i }[/math]를 가지고 있고, 앞면이 나올 확률이 p인 동전을 던져서 앞면이 나오면 A가 B에게 자금 1을 받고, 반대로 뒷면이 나오면 B가 A에게 자금 1을 받는다. 동전 던지기를 반복했을 때 A가 자금 N을 독차지할 확률은 얼마일까?

풀이

A가 자금 N을 독차지하는 사건을 E로 두고, 자금 i를 가지고 있을 때 자금 N을 독차지할 확률을 [math]\displaystyle{ P_i }[/math]라 하자. 그러면

[math]\displaystyle{ P_i=P(E) }[/math]

이고, 첫 번째 게임에서 앞면이 나오는 사건을 F라 하면 전확률의 법칙(law of total probability)에 의해

[math]\displaystyle{ P(E)=P(E|F)P(F)+P(E|F^c)P(F^c) }[/math]

이다. 만약 F가 발생하면 A는 자금 [math]\displaystyle{ i+1 }[/math]을 가지고, 반대로 [math]\displaystyle{ F^c }[/math]가 발생하면 A는 자금 [math]\displaystyle{ i-1 }[/math]을 가진다. 따라서

[math]\displaystyle{ P(E|F)=P_{i+1} }[/math]

[math]\displaystyle{ P(E|F^c)=P_{i-1} }[/math]

이다. 따라서

[math]\displaystyle{ P_i=pP_{i+1}+(1-p)P_{i-1} }[/math]

이다. 그러면

[math]\displaystyle{ pP_i+(1-p)P_i=pP_{i+1}+(1-p)P_{i-1} }[/math]

이므로

[math]\displaystyle{ P_{i+1}-P_i=\frac{1-p}{p}(P_i-P_{i-1}) }[/math]

이다. 그러면

[math]\displaystyle{ \begin{align} P_2-P_1&=\frac{1-p}{p}(P_1-P_0)\\ P_3-P_2&=\left(\frac{1-p}{p}\right)^2(P_1-P_0)\\ &\vdots\\ P_i-P_{i-1}&=\left(\frac{1-p}{p}\right)^{i-1}(P_1-P_0) \end{align} }[/math]

이고 [math]\displaystyle{ P_0=0 }[/math]이므로 양변을 더하면

[math]\displaystyle{ P_i=\left(1+\frac{1-p}{p}+\left(\frac{1-p}{p}\right)^2+\cdots+\left(\frac{1-p}{p}\right)^{i-1}\right)P_1 }[/math]

이므로 등비수열의 합의 공식에 의해

[math]\displaystyle{ P_i=\begin{cases} \frac{1-\left(\frac{1-p}{p}\right)^i}{1-\frac{1-p}{p}}P_1,&p\ne\frac{1}{2}\\ iP_1,&p=\frac{1}{2} \end{cases} }[/math]

이다. 한편 [math]\displaystyle{ P_N=1 }[/math]이므로

[math]\displaystyle{ P_N=\begin{cases} \frac{1-\left(\frac{1-p}{p}\right)^N}{1-\frac{1-p}{p}}P_1,&p\ne\frac{1}{2}\\ NP_1,&p=\frac{1}{2} \end{cases} }[/math]

에서

[math]\displaystyle{ P_1=\begin{cases} \frac{1-\frac{1-p}{p}}{1-\left(\frac{1-p}{p}\right)^N},&p\ne\frac{1}{2}\\ \frac{1}{N},&p=\frac{1}{2} \end{cases} }[/math]

을 얻는다. 따라서

[math]\displaystyle{ P_i=\begin{cases} \frac{1-\left(\frac{1-p}{p}\right)^i}{1-\left(\frac{1-p}{p}\right)^N},&p\ne\frac{1}{2}\\ \frac{i}{N},&p=\frac{1}{2} \end{cases} }[/math]

을 얻는다.

도박사가 카지노에서 도박을 한다고 가정하자. 그러면 카지노는 자금을 많이 가지고 있으므로 [math]\displaystyle{ N\to\infty }[/math]라고 가정할 수 있다. 만약 [math]\displaystyle{ p\lt \frac{1}{2} }[/math]라면, [math]\displaystyle{ \frac{1-p}{p}\gt 1 }[/math]이므로 [math]\displaystyle{ N\to\infty }[/math]일 때 [math]\displaystyle{ P_i\to 0 }[/math]이다. 즉 A는 파산할 것이 확실하다. 심지어 [math]\displaystyle{ p=\frac{1}{2} }[/math]일 때도 파산하는 건 기정사실인데, 왜냐 하면 [math]\displaystyle{ N\to\infty }[/math]이면 [math]\displaystyle{ \frac{i}{N}\to 0 }[/math]이기 때문이다. 그나마 [math]\displaystyle{ p\gt \frac{1}{2} }[/math]일 때 [math]\displaystyle{ P_i }[/math]는 0이 아닌 유한한 값으로 수렴한다. 그러나 이런 게임을 카지노가 마련해둘 리는 없다.

전략 세우기

외부 링크

같이 보기

각주

↑ Mat Willmott (2002). Variations on the Gambler’s Ruin Problem. 2015년 7월 7일에 확인.
↑ Huygens (1657). De Ratiociniis in Ludo Aleae. 2015년 7월 7일에 확인.

[1] Mat Willmott (2002). Variations on the Gambler’s Ruin Problem. 2015년 7월 7일에 확인.

[2] Huygens (1657). De Ratiociniis in Ludo Aleae. 2015년 7월 7일에 확인.

[1]

[2]