Solving Transition Matrix: Octopus Training

In summary: Thinking)The initial distribution is the probability of the state $i$... (Thinking)At the beginning of the workout, the octopus is in level 1 with probability 1. So the initial distribution is $(1, 0, 0)$. (Thinking)In summary, we discussed the training of an octopus to choose object A over object B. The octopus can be in three levels of training, with level 1 being unable to remember which object was rewarded, level 2 briefly remembering but potentially forgetting, and level 3 always remembering. The transition matrix for this Markov chain was set up and the initial distribution was determined to be (1, 0, 0). The probability that the oct
  • #1
mathmari
Gold Member
MHB
5,049
7
Hey! :eek:

An octopus is trained to chosose from two objects A and B always the object A. Repeated training shows the octopus both objects, if the octopus chooses object A, he will be rewarded. The octopus can be in 3 levels of training:
Level 1: He can not remember which object was rewarded.
Level 2: He briefly remembers, chooses object A, but he can forget it again.
Level 3: He remembers, chooses object A and never forgets it.
The probability that the octopus rises from level 1 to level 2 is the same great, as he learns nothing.
When the octopus reaches level 2, the probability of falling back to level 1 is the same as staying in level 2 and rising to level 3 together.
The probability of moving up from level 2 to level 3 is 5 times greater than staying in level 2.
The octopus can not skip any training level and we assume that he is in level 1 at the beginning of the workout.
  1. Set up the associated transition matrix of this markov chain and specify the initial distribution.
  2. What is the probability that the octopus learns nothing in the first 5 attempts and stays completely in level 1?
  3. How big are the probabilities of the individual training levels after 3 attempts?
I have done the following:

  1. The stochastic matrix is in the following form: $$P=\begin{pmatrix}P_{1,1} & P_{1,2} & P_{1,3} \\ P_{2,1} & P_{2,2} & P_{2,3} \\ P_{3,1} & P_{3,2} & P_{3,3}\end{pmatrix}$$
    We have the following information:
    • $P_{1,2}=P(\text{Learns nothing})$
    • $P_{2,1}=P_{2,2}+P_{2,3}$
    • $P_{2,3}=5\cdot P_{2,2}$
    At each row of the transition matrix the sum of the probabilities is equal to $1$, so we have that $P_{1,1}+P_{1,2}+P_{1,3}=1$, $P_{2,1}+P_{2,2}+P_{2,3}=1$, $P_{3,1}+P_{3,2}+P_{3,3}=1$.

    Which is the probability $P(\text{Learns nothing})$ ? (Wondering)

    We have that $P_{2,1}+P_{2,2}+P_{2,3}=1$ and $P_{2,1}=P_{2,2}+P_{2,3}$ and $P_{2,3}=5\cdot P_{2,2}$, so we get the following: \begin{align*}P_{2,2}+P_{2,3}+P_{2,2}+P_{2,3}=1&\Rightarrow 2\cdot P_{2,2}+2\cdot P_{2,3}=1 \\ & \Rightarrow 2\cdot P_{2,2}+2\cdot (5\cdot P_{2,2})=1 \\ & \Rightarrow 2\cdot P_{2,2}+10\cdot P_{2,2}=1 \\ & \Rightarrow 12\cdot P_{2,2}=1 \\ & \Rightarrow P_{2,2}=\frac{1}{12}\end{align*}
    So, we get also $\displaystyle{P_{2,3}=5\cdot P_{2,2}=5\cdot \frac{1}{12}=\frac{5}{12}}$ and $\displaystyle{P_{2,1}=P_{2,2}+P_{2,3}=\frac{1}{12}+\frac{5}{12}=\frac{6}{12}=\frac{1}{2}}$.

    So, we get the transition matrix: $$P=\begin{pmatrix}P_{1,1} & P_{1,2} & P_{1,3} \\ \frac{1}{2} & \frac{1}{12} & \frac{5}{12} \\ P_{3,1} & P_{3,2} & P_{3,3}\end{pmatrix}$$

    How can we complete the remaining matrix? (Wondering)
 
Physics news on Phys.org
  • #2
mathmari said:
The probability that the octopus rises from level 1 to level 2 is the same great, as he learns nothing.

Which is the probability $P(\text{Learns nothing})$ ?

Hey mathmari!

When at level 1, what's the alternative to rising to level 2?
Did the octopus learn anything in that case? (Wondering)
 
  • #3
I like Serena said:
When at level 1, what's the alternative to rising to level 2?
Did the octopus learn anything in that case? (Wondering)

At level 1 the octopus can not remember which object was rewarded and at level 2 he briefly remembers, chooses object A, but he can forget it again. Does that mean that the octopus doesn't learn anything when he rises from level 1 to level 2? (Wondering)
 
  • #4
mathmari said:
At level 1 the octopus can not remember which object was rewarded and at level 2 he briefly remembers, chooses object A, but he can forget it again. Does that mean that the octopus doesn't learn anything when he rises from level 1 to level 2? (Wondering)

From level 1 we can either go to level 2 ($P_{1,2}$), or we can stay at level 1 ($P_{1,1}$) can't we?
And if we stay at level 1, we have 'Level 1: He can not remember which object was rewarded.'.
Isn't that the same as that he didn't learn anything? (Wondering)
 
  • #5
I like Serena said:
From level 1 we can either go to level 2 ($P_{1,2}$), or we can stay at level 1 ($P_{1,1}$) can't we?
And if we stay at level 1, we have 'Level 1: He can not remember which object was rewarded.'.
Isn't that the same as that he didn't learn anything? (Wondering)

Ah yes. So, we have that $P_{1,1}=P(\text{Learns nothing})$. It also given that $P_{1,2}=P(\text{Learns nothing})$. Since we cannot skip a level the probability $P_{1,3}$ is equal to $0$, right?
So we get that $$P_{1,1}+P_{1,2}+P_{1,3}=1 \Rightarrow P(\text{Learns nothing})+P(\text{Learns nothing})+0=1 \Rightarrow 2\cdot P(\text{Learns nothing})=1 \Rightarrow P(\text{Learns nothing})=\frac{1}{2}$$

Therefore, we have that $P_{1,1}=\frac{1}{2}$ and $P_{1,2}=\frac{1}{2}$. Is this correct? (Wondering)
 
  • #6
Yep. (Nod)
 
  • #7
I like Serena said:
Yep. (Nod)

So, we get $$P=\begin{pmatrix}\frac{1}{2} & \frac{1}{2} & 0 \\ \frac{1}{2} & \frac{1}{12} & \frac{5}{12} \\ P_{3,1} & P_{3,2} & P_{3,3}\end{pmatrix}$$

How can we calculate the last row? Do we have any information for that? (Wondering)
 
  • #8
mathmari said:
How can we calculate the last row? Do we have any information for that? (Wondering)

We have 'Level 3: He remembers, chooses object A and never forgets it.' (Thinking)
 
  • #9
I like Serena said:
We have 'Level 3: He remembers, chooses object A and never forgets it.' (Thinking)

That means that the probability that he falls to level 1 or 2 is equal to 0. Since he will never forget it the probability that he stays in level 3 is equal to 1.

Therefore we get $$P=\begin{pmatrix}\frac{1}{2} & \frac{1}{2} & 0 \\ \frac{1}{2} & \frac{1}{12} & \frac{5}{12} \\ 0 & 0 & 1\end{pmatrix}$$

Is everything correct? (Wondering)
 
  • #10
Yup. (Nod)
 
  • #11
mathmari said:
1. specify the initial distribution.

How can we find the initial distribution?
mathmari said:
2. What is the probability that the octopus learns nothing in the first 5 attempts and stays completely in level 1?

The probability that the octopus learns nothing in the first 5 attempts is equal to $5\cdot P_{1,1}=5\cdot \frac{1}{2}=\frac{5}{2}$, or not?

Is the probability that he stays completely in level 1 equal to $(\text{total number of attempts})\cdot P_{1,1}$ ?
 
  • #12
mathmari said:
How can we find the initial distribution?

It says 'we assume that he is in level 1 at the beginning of the workout.' (Thinking)

mathmari said:
The probability that the octopus learns nothing in the first 5 attempts is equal to $5\cdot P_{1,1}=5\cdot \frac{1}{2}=\frac{5}{2}$, or not?

Is the probability that he stays completely in level 1 equal to $(\text{total number of attempts})\cdot P_{1,1}$ ?

A probability cannot be greater than 1 can it?
Shouldn't we multiply the probabilities of events that are independent and that are all true? (Wondering)
 
  • #13
I like Serena said:
It says 'we assume that he is in level 1 at the beginning of the workout.' (Thinking)

The initial distribution is the probability of the state $i$ at time $n=0$. Since we assume that he is in level 1 at the beginning of the workout, we have that the probability that the octopus at time $n=0$ is at level 1 is equal to 1 and the probability that he is at level 2 or 3 is equal to 0, since we know that he is at level 1. So, we have that the initial distribution is equal to $(1,0,0)$. Is this correct? (Wondering)
2. What is the probability that the octopus learns nothing in the first 5 attempts and stays completely in level 1?

It holds that the distribution at time n is equal to $P(n)^n\cdot p(0)$, where $p(0)$ is the initial distribution. The $i$th component of that vector is the probability that at time $n$ we are at state $i$.

So, the probability that the octopus learns nothing in the first 5 attempts and stays completely in level 1 is equal to the first component of the vector $P(n)^5\cdot p(0)$, right?

We have that $p(0)=(1,0,0)$ and $$P=\begin{pmatrix}\frac{1}{2} & \frac{1}{2} & 0 \\ \frac{1}{2} & \frac{1}{12} & \frac{5}{12} \\ 0 & 0 & 1\end{pmatrix}\Rightarrow P^5=\frac{1}{248832}\begin{pmatrix}69156& 46266 & 133410 \\ 46266 & 30601 & 171965 \\ 0 & 0 & 248832\end{pmatrix}$$

So, we get that $$P(n)^5\cdot p(0)=\begin{pmatrix}\frac{69156}{248832} \\ \frac{46266}{248832} \\ 0 \end{pmatrix}$$

Therefore, the probability that the octopus learns nothing in the first 5 attempts and stays completely in level 1 is equal to $\frac{69156}{248832}=27,79\%$.

Is everything correct? (Wondering)
3. How big are the probabilities of the individual training levels after 3 attempts?

We have to caclulate $P(n)^3\cdot p(0)$, or not?

We have that $$P^3=\frac{1}{1728}\begin{pmatrix}648& 474 & 570 \\ 474 & 289 & 965 \\ 0 & 0 & 1728\end{pmatrix}$$

Therefore, we get that $$P(n)^3\cdot p(0)=\begin{pmatrix}\frac{648}{1728} \\ \frac{474}{1728} \\ 0 \end{pmatrix}$$ whoch means that the probability to be at level 1 after 3 attempts is equal to $\frac{648}{1728}$, to be at level 2 the probability is equal to $\frac{474}{1728}$ and to be at level 3 the probability is equal to $0$.

Is this correct? (Wondering)
 
Last edited by a moderator:
  • #14
mathmari said:
The initial distribution is the probability of the state $i$ at time $n=0$. Since we assume that he is in level 1 at the beginning of the workout, we have that the probability that the octopus at time $n=0$ is at level 1 is equal to 1 and the probability that he is at level 2 or 3 is equal to 0, since we know that he is at level 1. So, we have that the initial distribution is equal to $(1,0,0)$. Is this correct?

Yep. (Nod)

mathmari said:
2. What is the probability that the octopus learns nothing in the first 5 attempts and stays completely in level 1?

It holds that the distribution at time n is equal to $P(n)^n\cdot p(0)$, where $p(0)$ is the initial distribution. The $i$th component of that vector is the probability that at time $n$ we are at state $i$.

So, the probability that the octopus learns nothing in the first 5 attempts and stays completely in level 1 is equal to the first component of the vector $P(n)^5\cdot p(0)$, right?

Nope. That is the probability that the octopus ends up in level 1 after 5 attempts.
In particular it does not mean that he stays completely at level 1. (Shake)

We are looking for $P(\text{1st attempt fails}\land \text{2nd attempt fails}\land ... \land \text{5th attempt fails})$.
Previously you calculated this with the sum rule... but the sum rule only applies if we have OR instead of AND, and moreover, if the events are mutually exclusive, which they are not.
Shouldn't we use the product rule instead? (Thinking)
mathmari said:
3. How big are the probabilities of the individual training levels after 3 attempts?

We have to caclulate $P(n)^3\cdot p(0)$, or not?

We have that $$P^3=\frac{1}{1728}\begin{pmatrix}648& 474 & 570 \\ 474 & 289 & 965 \\ 0 & 0 & 1728\end{pmatrix}$$

Therefore, we get that $$P(n)^3\cdot p(0)=\begin{pmatrix}\frac{648}{1728} \\ \frac{474}{1728} \\ 0 \end{pmatrix}$$ whoch means that the probability to be at level 1 after 3 attempts is equal to $\frac{648}{1728}$, to be at level 2 the probability is equal to $\frac{474}{1728}$ and to be at level 3 the probability is equal to $0$.

Is this correct?

Let's take a step back.
Suppose we do only 1 attempt.
Which distribution do we get?
Is it what we expect? (Wondering)
 
  • #15
I like Serena said:
Nope. That is the probability that the octopus ends up in level 1 after 5 attempts.
In particular it does not mean that he stays completely at level 1. (Shake)

We are looking for $P(\text{1st attempt fails}\land \text{2nd attempt fails}\land ... \land \text{5th attempt fails})$.
Previously you calculated this with the sum rule... but the sum rule only applies if we have OR instead of AND, and moreover, if the events are mutually exclusive, which they are not.
Shouldn't we use the product rule instead? (Thinking)
We have that \begin{align*}P&(\text{1st attempt fails}\land \text{2nd attempt fails}\land \ldots \land \text{5th attempt fails}) \\ & =P(\text{1st attempt fails})\cdot P(\text{2nd attempt fails})\cdot \ldots \cdot P(\text{5th attempt fails}) \\ & =P_{1,1}\cdot P_{1,1}\cdot P_{1,1}\cdot P_{1,1}\cdot P_{1,1} \\ & =P_{1,1}^5 \\ & = \left (\frac{1}{2}\right )^5 \\ & =\frac{1}{32}\end{align*}

So, is the probability that the octopus learns nothing in the first 5 attempts and stays completely in level 1 equal to $\frac{1}{32}$ ?

But why does he stays completely in level 1 with this probability?

(Wondering)
I like Serena said:
Let's take a step back.
Suppose we do only 1 attempt.
Which distribution do we get?
Is it what we expect? (Wondering)

This distribution is equal to $$P\cdot p(0)=\begin{pmatrix}\frac{1}{2} & \frac{1}{2} & 0 \\ \frac{1}{2} & \frac{1}{12} & \frac{5}{12} \\ 0 & 0 & 1\end{pmatrix}\cdot \begin{pmatrix}1\\ 0 \\ 0\end{pmatrix}=\begin{pmatrix}\frac{1}{2}\\ \frac{1}{2} \\ 0\end{pmatrix}$$ or not?

That means that after the first attempt the octopus can either stay at level 1 or rise to level 2, and each of the possibilities has the probability $\frac{1}{2}$, right?

(Wondering)
 
  • #16
mathmari said:
We have that \begin{align*}P&(\text{1st attempt fails}\land \text{2nd attempt fails}\land \ldots \land \text{5th attempt fails}) \\ & =P(\text{1st attempt fails})\cdot P(\text{2nd attempt fails})\cdot \ldots \cdot P(\text{5th attempt fails}) \\ & =P_{1,1}\cdot P_{1,1}\cdot P_{1,1}\cdot P_{1,1}\cdot P_{1,1} \\ & =P_{1,1}^5 \\ & = \left (\frac{1}{2}\right )^5 \\ & =\frac{1}{32}\end{align*}

So, is the probability that the octopus learns nothing in the first 5 attempts and stays completely in level 1 equal to $\frac{1}{32}$ ?

Yep. (Nod)
And note that we assume that the attempts are independent, because otherwise we wouldn't be allowed to apply this product rule.

mathmari said:
But why does he stays completely in level 1 with this probability?

It means that after every attempt he is still in level 1. Isn't that what would be intended with 'he stays completely in level 1'? (Wondering)


This distribution is equal to $$P\cdot p(0)=\begin{pmatrix}\frac{1}{2} & \frac{1}{2} & 0 \\ \frac{1}{2} & \frac{1}{12} & \frac{5}{12} \\ 0 & 0 & 1\end{pmatrix}\cdot \begin{pmatrix}1\\ 0 \\ 0\end{pmatrix}=\begin{pmatrix}\frac{1}{2}\\ \frac{1}{2} \\ 0\end{pmatrix}$$ or not?

That means that after the first attempt the octopus can either stay at level 1 or rise to level 2, and each of the possibilities has the probability $\frac{1}{2}$, right?

Heh. The values of those probabilities is a bit unfortunate. (Lipssealed)
We picked the leftmost column that just happened to be the same as the topmost row in this case.
But should we have the leftmost column? Or should we have the topmost row? (Wondering)

Shouldn't we calculate:
$$
\begin{pmatrix}1 & 0 &\ 0\end{pmatrix} \cdot
\begin{pmatrix}\frac{1}{2} & \frac{1}{2} & 0 \\ \frac{1}{2} & \frac{1}{12} & \frac{5}{12} \\ 0 & 0 & 1\end{pmatrix}
=\begin{pmatrix}\frac{1}{2}& \frac{1}{2} & 0\end{pmatrix}$$
? (Wondering)
 
  • #17
I like Serena said:
Yep. (Nod)
And note that we assume that the attempts are independent, because otherwise we wouldn't be allowed to apply this product rule.
To clarify... At the question 2 "What is the probability that the octopus learns nothing in the first 5 attempts and stays completely in level 1?" we want to calculate the possibility that in the first 5 attempts the octopus is always in level 1, right?
Or does "...stays completetly in level 1" means not only in the first 5 attempts but in each attempt?

(Wondering)
I like Serena said:
Heh. The values of those probabilities is a bit unfortunate. (Lipssealed)
We picked the leftmost column that just happened to be the same as the topmost row in this case.
But should we have the leftmost column? Or should we have the topmost row? (Wondering)

Shouldn't we calculate:
$$
\begin{pmatrix}1 & 0 &\ 0\end{pmatrix} \cdot
\begin{pmatrix}\frac{1}{2} & \frac{1}{2} & 0 \\ \frac{1}{2} & \frac{1}{12} & \frac{5}{12} \\ 0 & 0 & 1\end{pmatrix}
=\begin{pmatrix}\frac{1}{2}& \frac{1}{2} & 0\end{pmatrix}$$
? (Wondering)

Ah so we have to multiply the initial distribution from the left side with the transition matrix, right? (Wondering)
 
  • #18
mathmari said:
To clarify... At the question 2 "What is the probability that the octopus learns nothing in the first 5 attempts and stays completely in level 1?" we want to calculate the possibility that in the first 5 attempts the octopus is always in level 1, right?
Or does "...stays completetly in level 1" means not only in the first 5 attempts but in each attempt?

I don't really know what was intended with the word 'completely'.
My interpretation is that they mean after every attempt, but they could also mean before and after the 5 attempts combined.
I was hoping that you could clarify. (Worried)

mathmari said:
Ah so we have to multiply the initial distribution from the left side with the transition matrix, right? (Wondering)

When multiplying on the left, we get the topmost row, which gives the probabilities that we either stay in level 1, or that we end up in level 2.
So yes, I think we should get the topmost row, and therefore we have to multiply on the left. (Thinking)
 
  • #19
I like Serena said:
I don't really know what was intended with the word 'completely'.
My interpretation is that they mean after every attempt, but they could also mean before and after the 5 attempts combined.
I was hoping that you could clarify. (Worried)

If it is intended that in each of the first 5 attempts the octopus stays in level 1 then the probability is equal to $P_{1,1}^5=\frac{1}{32}$.

If it is intended that generally he stays in level 1, so not only in the first 5 attempts but in all attempts, then we cannot say something about the probability, can we? (Wondering)



I like Serena said:
When multiplying on the left, we get the topmost row, which gives the probabilities that we either stay in level 1, or that we end up in level 2.
So yes, I think we should get the topmost row, and therefore we have to multiply on the left. (Thinking)

Ah ok!

To get the probabilities of the individual training levels after 3 attempts do we have to multiply the initial distribution on the left with the transition matrix raised to $3$, i.e. $(1, 0, 0)\cdot P^3$ ? (Wondering)
 
  • #20
mathmari said:
If it is intended that in each of the first 5 attempts the octopus stays in level 1 then the probability is equal to $P_{1,1}^5=\frac{1}{32}$.

Yep.

mathmari said:
If it is intended that generally he stays in level 1, so not only in the first 5 attempts but in all attempts, then we cannot say something about the probability, can we?

Staying in level 1 after any number of attempts has probability $(\frac 12)^n \to 0$ if $n\to \infty$.
Beginning in level 1, and being again in level 1 after 5 attempts has the top left most probability in $P^5$, which is what you already found. (Thinking)
mathmari said:
To get the probabilities of the individual training levels after 3 attempts do we have to multiply the initial distribution on the left with the transition matrix raised to $3$, i.e. $(1, 0, 0)\cdot P^3$ ? (Wondering)

Yep. (Nod)
 
  • #21
I like Serena said:
Yep.
Staying in level 1 after any number of attempts has probability $(\frac 12)^n \to 0$ if $n\to \infty$.
Beginning in level 1, and being again in level 1 after 5 attempts has the top left most probability in $P^5$, which is what you already found. (Thinking)

Yep. (Nod)
Thank you so much! (Mmm)
 

FAQ: Solving Transition Matrix: Octopus Training

What is a transition matrix?

A transition matrix is a mathematical tool used to represent the probabilities of transitioning from one state to another in a system over a period of time. In the context of octopus training, it can be used to track the progress and success of different training techniques.

How is a transition matrix calculated?

A transition matrix is calculated by dividing the number of transitions from one state to another by the total number of transitions in the system. This is typically represented as a square matrix, with each row and column representing a different state in the system.

What is the purpose of solving a transition matrix in octopus training?

The purpose of solving a transition matrix in octopus training is to analyze the effectiveness of different training techniques and strategies. By tracking the transitions between different states, such as from untrained to partially trained, or from partially trained to fully trained, trainers can determine which methods are most successful in achieving the desired behavior in the octopus.

How can a transition matrix be used to improve octopus training?

By analyzing the transition matrix, trainers can identify areas where the training may be lacking and make adjustments to improve the overall success rate. For example, if there are high numbers of transitions from partially trained to untrained, a different training approach may be needed to solidify the octopus's skills and prevent regression.

Are there any limitations to using a transition matrix in octopus training?

While a transition matrix can provide valuable insights into the effectiveness of training techniques, it is important to note that it is based on probabilities and cannot guarantee success. Other factors, such as the individual octopus's temperament and outside influences, may also play a role in their progress and behavior.

Back
Top