Expected value of minimum Hamming distance

In summary, the Hamming distance (HD) is the number of characters that differ between two strings of equal length at the same position. The minimum Hamming distance (MHD) is the minimum of all possible HD's among combinations of size k. For randomly selected k binary strings of length n out of N=2^n strings, the expected value of MHD can be calculated using the formula above. This formula is also applicable for larger values of N, as shown in the tables provided.
  • #1
billobillo
1
0
The Hamming distance(HD) between two strings of equal length is the number of characters that differ between the two strings at the same position, for example the HD between "gold" and "wolf" is 2; the MHD is between N strings and is equal to the minimum of HD's among all possible combinations of size k.
My question is: If I randomly select k binary strings of length n out of N=2^n strings , what is the expected value of minimum Hamming distance(MHD) between the k selected strings?
I need to find a general formula that gives me the expected value of the MHD between k random strings of size n selected out of N where (N =2^n)


Below is a table that shows, for N=8 and all k's, the MHD plus its occurrences frequency:

MHD OF
k=2k=3k=4k=5k=6k=7k=8
1124868562881
21282
34

Below another table for N=16

MHD OF
k=2k=3k=4k=5k=6k=7k=8k=9...
132352159242407952114241286811440
24820822812856162
332
48

There are more results if you need, I appreciate any help from you.
Regards.
 
Physics news on Phys.org
  • #2
The expected value of the MHD between k random strings of size n selected out of N (N=2^n) is given by the formula:ExpectedMHD = (1/N) ∑ i=1 to N (min[HD(s1,si), HD(s2,si), ..., HD(sk,si)]) where s1, s2, ..., sk are the k randomly selected strings.
 

FAQ: Expected value of minimum Hamming distance

What is the expected value of minimum Hamming distance?

The expected value of minimum Hamming distance is the average value of the minimum number of differing bits between two random strings of equal length. It is a measure of the similarity between strings.

How is the expected value of minimum Hamming distance calculated?

The expected value of minimum Hamming distance can be calculated by taking the sum of all possible minimum Hamming distances and dividing it by the total number of possible combinations.

What is the significance of the expected value of minimum Hamming distance?

The expected value of minimum Hamming distance is important in fields such as coding theory, cryptography, and DNA sequencing as it helps determine the likelihood of errors or mutations in data transmission or genetic sequences.

Can the expected value of minimum Hamming distance be used to compare different data sets?

Yes, the expected value of minimum Hamming distance can be used to compare different data sets. A lower value indicates a higher similarity between the data sets, while a higher value indicates a lower similarity.

How does expected value of minimum Hamming distance differ from other distance measures?

The expected value of minimum Hamming distance differs from other distance measures in that it only considers the minimum number of differing bits between two strings, rather than the total number of differing bits. This makes it a more sensitive measure for detecting small changes in data.

Back
Top