역함수 정리

개요

역함수 정리(Inverse function theorem)

일변수함수에서

진술

\(f:A\to \mathbb{R}\)이 \(a\)에서 미분가능하고 \(f'(a)\ne 0\)이면

[math]\displaystyle{ (f^{-1})'(a)=\frac{1}{f'(f^{-1}(a))} }[/math]

증명

다변수함수에서

진술

\(A\subseteq \mathbb{R}^n\)를 열린 집합, \(f:A\to \mathbb{R}^n\)을 미분가능하고 도함수가 연속인 함수라고 하자. \(Df(\mathbf{x})\)가 \(\mathbf{a}\)에서 가역이면, \(a\)의 근방 \(U\)가 존재해, \(f\)를 \(U\)에서 열린 집합 \(V=f(U)\)로 옮기는 함수로 간주하면 \(f\)가 일대일 대응이고, 역함수 또한 미분가능하고 도함수가 연속이다.

증명

영어 위키백과엔 증명이 나와 있지 않고, 증명에 이용될 수 있는 다른 정리를 소개한다. 바나흐 고정점 정리, 최대·최소 정리, 뉴턴의 방법 등을 이용할 수 있다고 하는데 하나같이 만만한 게 없다. 아래에 소개할 증명도 보조정리가 덕지덕지 붙는 등 간단하지 않으니 무리하지 말고 찬찬히 읽어보거나 패스하도록 하자.

보조정리 1

“
[math]\displaystyle{ \|\cdot\|_V,\|\cdot\|_M }[/math]를 각각

[math]\displaystyle{ \|\mathbf{x}\|_V=\sqrt{\sum_{i=1}^n x_i^2} }[/math]

[math]\displaystyle{ \|A\|_M=\sqrt{\sum_{i=1}^n\sum_{j=1}^n a_{ij}^2} }[/math]

로 정의된 노름 및 행렬 노름이라고 하자. 그러면 실수 성분을 가지는 n차 정사각행렬 \(A\)와 \(x\in \mathbb{R}^n\)에 대해

[math]\displaystyle{ \|A\mathbf{x}\|_V\le \|A\|_M\|x\|_V }[/math]
이다. “

[math]\displaystyle{ A=\begin{bmatrix} a_{ij} \end{bmatrix}, \mathbf{x}=\begin{bmatrix} x_1\\ x_2\\ \vdots\\ x_n \end{bmatrix} }[/math]

로 두면

[math]\displaystyle{ A\mathbf{x}=\begin{bmatrix} \sum_{j=1}^n a_{ij}x_j \end{bmatrix} }[/math]

이고 따라서

[math]\displaystyle{ \|A\mathbf{x}\|_V=\sqrt{\sum_{i=1}^n \left(\sum_{j=1}^n a_{ij}x_j\right)^2} }[/math]

이다. 코시-슈바르츠 부등식에 의해

[math]\displaystyle{ \begin{align} (a_{11}x_1+a_{12}x_2+\cdots+a_{1n}x_n)^2&\le (a_{11}^2+a_{12}^2+\cdots+a_{1n}^2)(x_1^2+x_2^2+\cdots+x_n^2)\\(a_{21}x_1+a_{22}x_2+\cdots+a_{2n}x_n)^2&\le (a_{21}^2+a_{22}^2+\cdots+a_{2n}^2)(x_1^2+x_2^2+\cdots+x_n^2)\\ &\vdots\\ (a_{n1}x_1+a_{n2}x_2+\cdots+a_{nn}x_n)^2&\le (a_{n1}^2+a_{n2}^2+\cdots+a_{nn}^2)(x_1^2+x_2^2+\cdots+x_n^2) \end{align} }[/math]

이므로 양변끼리 모두 더해주면

[math]\displaystyle{ \|A\mathbf{x}\|_V^2=\|A\|_M^2\|\mathbf{x}\|_V^2 }[/math]

이므로 원하는 결론을 얻는다.

보조정리 2

“
\(A\subseteq \mathbb{R}^n\)를 열린 집합, \(f:A\to \mathbb{R}^n\)을 한 번 미분가능하고 도함수가 연속인 함수라고 하자. 만약 \(Df(\mathbf{a})\)가 가역이면 어떤 공 \(B(\mathbf{a},\epsilon)\)에서 임의의 \(\mathbf{x}_0,\mathbf{x}_1\in B(\mathbf{a},\epsilon)\)에 대해 부등식

[math]\displaystyle{ \|f(\mathbf{x}_1)-f(\mathbf{x}_0)\| \ge \alpha\|\mathbf{x}_1-\mathbf{x_0}\| }[/math]
을 만족하는 \(\alpha>0\)가 존재한다. 더욱이 \(f\)는 \(B(\mathbf{a},\epsilon)\)에서 일대일 함수이다. “

\(Df(\mathbf{a})\)가 가역이므로 \([Df(\mathbf{a})]^{-1}\)이 존재한다. 임의의 \(\mathbf{x}_0,\mathbf{x}_1\)에 대해 보조정리 1에 의해

[math]\displaystyle{ \|\mathbf{x}_1-\mathbf{x}_0\| = \|[Df(\mathbf{a})]^{-1}(Df(\mathbf{a})\mathbf{x}_1-Df(\mathbf{a})\mathbf{x}_0)\|\le \|[Df(\mathbf{a})]^{-1}\|\|Df(\mathbf{a})\mathbf{x}_1-Df(\mathbf{a})\mathbf{x}_0\| }[/math]

이다. 이때

[math]\displaystyle{ \alpha=\frac{1}{2\|[Df(\mathbf{a})]^{-1}\|} }[/math]

로 두자. 이제 함수 \(g\)를

[math]\displaystyle{ g(\mathbf{x})=f(\mathbf{x})-Df(\mathbf{a})\mathbf{x} }[/math]

로 정의하자. 그러면

[math]\displaystyle{ Dg(\mathbf{x})=Df(\mathbf{x})-Df(\mathbf{a}) }[/math]

이므로

[math]\displaystyle{ Dg(\mathbf{a})=0 }[/math]

이다. \(Dg\)가 연속이므로 어떤 공 \(B(\mathbf{a},\epsilon)\)에서

[math]\displaystyle{ \|Dg(\mathbf{x})\| \lt \frac{\alpha}{\sqrt{n}} }[/math]

인 \(\epsilon>0\)이 존재한다. \(g=(g_1,g_2,\cdots,g_n)\)라 하면 평균값의 정리에 의해

[math]\displaystyle{ g_i(\mathbf{x}_1)-g_i(\mathbf{x}_0)=Dg_i(\mathbf{c})(\mathbf{x}_1-\mathbf{x}_0) }[/math]

인 \(\mathbf{c}\in B(\mathbf{a},\epsilon)\)가 존재한다. 따라서 보조정리 1에 의해

[math]\displaystyle{ |g_i(\mathbf{x}_1)-g_i(\mathbf{x}_0)| \le \frac{\alpha}{\sqrt{n}} \|\mathbf{x}_1-\mathbf{x}_0\| }[/math]

이고

[math]\displaystyle{ \|g(\mathbf{x}_1)-g(\mathbf{x}_0)\|=\sqrt{\sum_{i=1}^n |g_i(\mathbf{x}_1)-g_i(\mathbf{x}_0)|^2}\le \sqrt{\sum_{i=1}^n \frac{\alpha^2}{n} \|\mathbf{x}_1-\mathbf{x}_0\|^2} \le \alpha\|\mathbf{x}_1-\mathbf{x}_0\| }[/math]

이다. 따라서

[math]\displaystyle{ \begin{align} \alpha\|\mathbf{x}_1-\mathbf{x}_0\| &\ge \|g(\mathbf{x}_1)-g(\mathbf{x}_0)\|\\ &=\|f(\mathbf{x}_1)-Df(\mathbf{a})\mathbf{x}_1-f(\mathbf{x}_0)+Df(\mathbf{a})\mathbf{x}_0\|\\ &\ge \|Df(\mathbf{a})\mathbf{x}_1-Df(\mathbf{a})\mathbf{x}_0\|-\|f(\mathbf{x}_1)-f(\mathbf{x}_0)\|\\ &\ge 2\alpha\|\mathbf{x}_1-\mathbf{x}_0\|-\|f(\mathbf{x}_1)-f(\mathbf{x}_0)\| \end{align} }[/math]

이므로 원하는 결론을 얻는다.

보조정리 3

“
함수 \(f:A\to\mathbb{R}\)이 미분가능하고 \(f\)가 \(\mathbf{x}_0\in A\)에서 극소점을 가지면

[math]\displaystyle{ Df(\mathbf{x}_0)=\mathbf{0} }[/math]
이다. “

\(f\)가 \(\mathbf{x}_0\)에서 극솟값을 가지면 \(\mathbf{x}_0\)의 근방에 속한 임의의 \(\mathbf{x}\)에 대해

[math]\displaystyle{ f(\mathbf{x})\ge f(\mathbf{x}_0) }[/math]

이다. 그러면 영이 아닌 \(\mathbf{u}\)가 주어졌을 때, 충분히 작은 \(t\in \mathbb{R}\)에 대해

[math]\displaystyle{ f(\mathbf{x}_0+t\mathbf{u})-f(\mathbf{x}_0)\ge 0 }[/math]

이다. 이때 \(\mathbf{x}_0\)에서 \(\mathbf{u}\)에 대한 \(f\)의 방향도함수 [math]\displaystyle{ f(\mathbf{x}_0;\mathbf{u}) }[/math]는

[math]\displaystyle{ f(\mathbf{x}_0;\mathbf{u})=\lim_{t\to 0}\frac{f(\mathbf{x}_0+t\mathbf{u})-f(\mathbf{x}_0)}{t} }[/math]

인데,

[math]\displaystyle{ \lim_{t\to +0}\frac{f(\mathbf{x}_0+t\mathbf{u})-f(\mathbf{x}_0)}{t}\ge 0 }[/math]

[math]\displaystyle{ \lim_{t\to -0}\frac{f(\mathbf{x}_0+t\mathbf{u})-f(\mathbf{x}_0)}{t}\le 0 }[/math]

이고 \(f(\mathbf{x}_0;\mathbf{u})\)가 존재하므로

[math]\displaystyle{ f(\mathbf{x}_0;\mathbf{u})=0 }[/math]

이다. \(\mathbf{u}\)가 임의의 값을 가지므로

[math]\displaystyle{ D_jf(\mathbf{x}_0)=\mathbf{0} }[/math]

이고 따라서 원하는 결론을 얻는다.

진술 1

이제 드디어 본 증명으로 넘어갈 차례가 되었다. 보조정리를 보는 동안 원래 진술이 뭐였는지 까먹었다면 진술 문단으로 가서 다시 보고 오자. 보조정리 1에서 \(f\)가 일대일 함수가 되도록 하는 \(\mathbf{a}\)의 근방\(U_0\)이 존재한다. 한편 \(\det Df(\mathbf{x}\)이 연속함수이므로 \(Df(\mathbf{x})\ne 0\)인 \(\mathbf{a}\)의 근방 \(U_1\)이 존재한다.

같이 보기

음함수 정리

참고문헌

Munkres, J. (1991). Analysis on Manifolds. Redwood City, Calif.: Addison-Wesley Pub., Advanced Book Program. ISBN 0201510359