Corresponding character matching probability

vivek1 · Jan 21, 2018

I have a dataset of protein, consisting of 10000 sequence each, having length Si
, where 1<=i<=10000. Now, I extracted k-mer "a" from the 1st sequence. The probability of occurrence of amino acid (character of protein sequence) is given by its frequency in the dataset. If I choose k-mer "b" from other sequence, what will be the probability that k-mer "b" matches k-mer "a" at least in r position out of k position?

Greg · Jan 21, 2018

I believe that would be the probability that k-mer a appears in the remaining 9999 sequences. Without numerical data we can't give an exact value.

Corresponding character matching probability

FAQ: Corresponding character matching probability

What is "Corresponding character matching probability"?

How is "Corresponding character matching probability" calculated?

What are the applications of "Corresponding character matching probability"?

Are there any limitations to using "Corresponding character matching probability"?

How can "Corresponding character matching probability" be improved?

Similar threads

Hot Threads

Recent Insights