Probability to realize string using subsequences

In summary, the conversation discusses the probability of realizing a given string using a set of random subsequences of that string. It also raises questions about the optimal number of subsequences and the structure of those subsequences for optimal probability. The exact definition of "realizing" the string and the required lengths of the subsequences are unclear.
  • #1
MonD1
2
0
Given a input string and 'n' random subsequences of that string , what is the probability of realizing original string using these subsequences correctly?

Example :

Given string = "MISSISSIPPI"
Subsequences:
1) "MISS"
2) "III"
3) "MIP"
4) "SIS"
5) "IS"
6 ) "SP"
7) "MP"How many subsequences will be required for optimal probability?
Is there any specific structure for subsequences for optimal probability?
 
Physics news on Phys.org
  • #2
Re: probability to realize string using subsequences

Can you post what you have tried or what your thoughts are on how to begin? This will give our helpers a better idea how best to help you. :D
 
  • #3
Re: probability to realize string using subsequences

MarkFL said:
Can you post what you have tried or what your thoughts are on how to begin? This will give our helpers a better idea how best to help you. :D

Input
string of length 'm'
'k' subsequences of the string each of length 'n' (n <m)

What is the probability that by using these subsequences we can come up with the original string?
 
  • #4
i still don't understand the question. post #1 doesn't appear to have substrings of the same length as you required in post #3.

Its also not clear what you mean by probablility of "getting the original string" from your substrings.
 
  • #5

The probability of realizing the original string using the given subsequences depends on the length and composition of the input string, as well as the number and composition of the subsequences. It is difficult to determine an exact probability without knowing these factors.

However, in general, the more subsequences that are available and the longer they are, the higher the probability of realizing the original string correctly. This is because more subsequences provide more opportunities to match the characters in the original string.

The optimal number of subsequences for the highest probability would depend on the specific input string and the composition of the subsequences. In some cases, a larger number of shorter subsequences may provide a higher probability, while in other cases, a smaller number of longer subsequences may be more effective.

There is no specific structure for subsequences that guarantees optimal probability. However, it is important for the subsequences to cover a wide range of characters and positions in the original string in order to increase the chances of a correct match. Additionally, the subsequences should not overlap too much, as this can decrease the overall probability.
 

FAQ: Probability to realize string using subsequences

What is the concept of "probability to realize string using subsequences"?

The probability to realize a string using subsequences refers to the likelihood of obtaining a given string by combining shorter, overlapping sequences of characters from a larger set of characters. This concept is often used in fields such as bioinformatics and text analysis.

How is the probability to realize a string using subsequences calculated?

The calculation of the probability to realize a string using subsequences involves considering the total number of possible subsequences that could be formed from the larger set of characters and then determining the likelihood of obtaining the desired string from those subsequences. This can be done using mathematical formulas and algorithms.

What factors can affect the probability to realize a string using subsequences?

The probability to realize a string using subsequences can be affected by various factors, such as the length of the desired string, the size of the larger set of characters, and the specific combination of characters needed to form the string. Additionally, the presence of repeated characters in the larger set can also impact the probability.

Can the probability to realize a string using subsequences be increased?

Yes, the probability to realize a string using subsequences can be increased by increasing the size of the larger set of characters or by decreasing the length of the desired string. Additionally, selecting a larger number of overlapping subsequences from the larger set can also increase the likelihood of obtaining the desired string.

How is the concept of "probability to realize string using subsequences" applied in real-world scenarios?

The concept of probability to realize a string using subsequences has various applications in fields such as genetics, data analysis, and natural language processing. For example, in genetics, it can be used to determine the likelihood of a specific genetic sequence occurring in a DNA sample. In data analysis, it can be used to identify patterns and similarities between different data sets. In natural language processing, it can be used to generate text or to predict the likelihood of certain words appearing in a sentence based on their frequency in a given language.

Back
Top