Probability Theory
From Theory of Measurements Wiki
In statistical inverse theory, the probability theory plays a crucial role. The knowledge of stochastic processes and probability laws is therefore needed, in order to get some understanding. Traditionally statistical inverse theory belongs to the so-called Bayesian school of probability theory.
In this chapter, we shall run through basic definitions of probability theory.
Contents |
Basic Definitions
First of all, we shall need couple of definitions.
Let
be a set and
be a class of subsets of
. Class
is called a
-algebra, if it fulfills the following three conditions.
- Empty set
- If set
, then also
- Let
, then also
.
Measurable Space
If
is a set and a class
of subsets of
is
-algebra, the pair
is called a measurable space.
Measures
If
is a measurable space, positive mappings (set functions)
are called measures, if the measure of empty set is zero
and they are countably additive
for any pairwise disjoint collections of sets
,
, if
.
Probability Measure, Probability Space
If
,
is called a probability measure. Probability measures are denoted by
. The triplet
is then called a probability space.
Events and Simple Examples
If
is a propability space, the measurable sets in
are often called events. A very simple example of a propability space is for example a finite set
, with
the class of all subsets of
. A simple propability measure is then defined by
for any
and
denoting the number of points in the set
. Another simple example is found by defining
(the unit closed interval in the real axis.) In this case,
is the class of all Lebesque-measurable subsets of
and the propability measure is simply the Lebesque measure. A little more complicated example is the unit two-dimensional square
equipped with the
-algebra of Lebesque-measurable subsets and Lebesgue measure as the probability measure.
Disjoint and Independent Events
Two events
and
are called disjoint, if
. Then
.
Two events
and
are called independent, if
.
A basic example of independent events is the following: Let
equipped with the Lebesgue measure. If
and
are any measurable sets, the events
are independent.
Random variables
Random variables with values in
Let
be a propability space and
a measurable space. Measurable mappings
are called random variables with values in
.
We can also denote the measurable space
with other symbols than
. An often used symbol is
. Random variables with values in
are often used to denote the unknowns in an inverse problem, while random variables with values in
are often used to denote the measurements in an inverse problem.
Conditional probabilities
In the case of simple events, conditional probabilities are simply defined through
This works, if the probability of the condition is greater than zero
Conditional probabilities can also be defined with conditions whose probabilities are zero -- like the having as condition the requirement that some random variable gets a specific value.
In this case, the definition is mathematically complex and we do not show details here. In general, the conditional probabilities are defined through specific
-algebras and they are only almost certainly defined.
We will only need these kinds of conditional probabilities for random variables in Euclidean spaces with distributions defined by densities. The future derivations can be understood, if one understands the conditional probabilities as defined above.
Example of conditional probabilities
We suppose that
is a finite set which consists of the following elements:
-
- A space scientist finds a Gyromitra Esculenta, cooks it and eats it and does not die.
-
- A space scientist finds a Gyromitra Esculenta, does not cook it, eats it and dies.
-
- A space scientist does not find a Gyromitra Esculenta and survives.
Let us suppose that all the probabilities of the points
are equal:
.
Let us define the following events:
- Event A: The space scientist survives.
- Event B: The space scientist finds a Gyromitra Esculenta.
Then
and
with
. The conditional probability of the space scientist surviving after finding a gyromitra esculenta is then
.
Intuitive Approach
Here is an example of a random variable with a finite
set. There are 10000 points in
and the random variable
approximates
a Gaussian distribution with mean in (4,3).
Densities of Random Variables
A very important case of random variables are variables getting their values in Euclidean spaces. These variables are easiest described by their densities. Let us suppose
is a random variable. Moreover, suppose that
We then call
the density of the random variable
.
EXPLANATION: The role of the probability space
is rather obscure when all derivations are based on densities of random variables.
It can be understood as some sort of a lottery
machine: each realization of the random variable
corresponds to a certain
and these
s are drawn from the set
through the propability measure
.
Marginal densities
Let us study a joint density of two random variables
. This is just a density in the space
. The marginal density of
is defined by
The marginal density is just the density of
, with
completely ignored.
Conditional densities
Let us again study two random variables with joint density
. The conditional density is defined by
This formula follows from the general mathematical definition of conditional distributions for random variables. It is also easy to intuitively understand if one thinks
and
are real-valued variables and divides the
and
-axis's into small intervals. The discrete conditional probabilities then approximately equal to the conditional density (provided it is continuous).
Bayes theorem
If we write the previous definition of conditional densities in a two different ways and play around with them a bit, we get the Bayes theorem:
And the Bayes' theorem is:
-
,
where eg.,
can denote a measurement, and
a model, or model parameter.
If we ignore the normalization constant D(y) we can use a more simple form:
-
,
In the picture below we have illustrated a 2-dimensional distribution together with its marginal density for
and an example of a conditional density for
. It can serve as a illustration of the Bayes theorem also: How to find a suitable
if only a density is given.
Suppose we have a random variable
, but we only know it's density
. This very often happens in applications. No probability space is explicitly specified, but in order to be able to use the mathematical formalism, we need the mapping:
. Solution: Let us simply define
to be the identity mapping and the propability measure in
be defined by
for every
in the
-algebra
of all Lebesque-measurable sets in
.
The expectation value of a random variable
The expectation value of a random variable in a linear space is defined to be Ito's average value when all possible realizations are averaged using the propability measure.'
In terms of a density
, this is given by
Note that
because it is completely certain that x gets some value in
.
The moments of a random variable
The moments of a random variable in
are defined as the expectation values of the new random variable defined as products of its components. In particular, the second moments are defined by
and in terms of a density D(x), this is given by
Gaussian random variables
Gaussian random variables are variables with Gaussian densities:
The expectation value of the random variable is given by the center point:
and the central second moments are given by
