Random.randrange() is really random?

  • Thread starter ChrisVer
  • Start date
  • Tags
    Random
In summary, the random module of Python's random package (?or class?) contains a warning about its pseudo-random generators not being cryptographically secure, and the documentation for the randrange() function says that the generator is more sophisticated about producing equally distributed values. The code creates a list of frequencies of 0, 1, 2, ...,10000 and calculates the sum of the squared differences between the frequencies and the theoretical value it should have (100). It passes a Chi-squared goodness of fit test with one test.
  • #1
ChrisVer
Gold Member
3,378
465
I was wondering how could I check how random the random.randrange() method of python's random package (?or class?) is...
How could I do that by building a macro?
Maybe check the two hypothesis: is good random generator vs is not?
 
Technology news on Phys.org
  • #2
ChrisVer said:
I was wondering how could I check how random the random.randrange() method of python's random package (?or class?) is...
How could I do that by building a macro?
Maybe check the two hypothesis: is good random generator vs is not?
There is the random module (Lib/random.py), which contains the Random class.
The documention (v3.4.2) contains this warning in its documentation of the random module:
Warning
The pseudo-random generators of this module should not be used for security purposes. Use https://www.physicsforums.com/os.html#os.urandom or SystemRandom if you require a cryptographically secure pseudo-random number generator.

Regarding randrange(), the docs say this:
Changed in version 3.2: randrange() is more sophisticated about producing equally distributed values. Formerly it used a style like int(random()*n) which could produce slightly uneven distributions.
I don't know what you mean by "building a macro".
 
Last edited by a moderator:
  • #4
Mark44 said:
I don't know what you mean by "building a macro".

I am thinking something like the following:
Python:
import random
event = 0
number = 0
lista = []
occuranc = []

while event<10000:
    x = random.randrange(0,100)
    lista.append(x)
    event += 1

while number<100:
    y = lista.count(number)
    occuranc.append(y)
    number += 1

Here I used the randrange to generate 10000 numbers between 0 and 100 (saved in the list "lista"). In the second while I am counting the multiplicity of appearence of 0, 1, 2, ...,100 in those 10000 numbers (the multiplicity is saved into my "occuranc" list).
Normally if the generator produces the numbers from a uniform distribution the probability of occurance for each number should be (1/100) and so each number should appear with multiplicity 10000/100=100.

So I am not sure how to continue the above code into making the chi-squared test of this hypothesis. In other words I would like to test by myself at what confidence level I can say that the randrange is a uniform-distribution generator. Like the test should check whether if you plot the occurance vs number whether it will be described by a [itex]y=const[/itex] relation or anything else (uniform pdf or not).
 
Last edited:
  • #5
You have your list of frequencies. Calculate the sum of the squared differences between your frequencies and the theoretical value it should have (100). The process is laid out here - https://en.wikipedia.org/wiki/Pearson's_chi-squared_test - in the definition section. (Apologies if you already know this...)

If you need help implementing this process in Python, give a holler.
 
  • #6
I would be surprised if any of the standard random number generators would not pass a Chi-squared goodness of fit test.
That being said, none of the "pseudo-random" random number generators are truly random. There is always a test that is sophisticated enough to determine that they are not random. Unless you are going to apply statistical methods like Box-Jenkins time series analysis, the standard random number generators will probably work fine for you. But if you are going to use those methods, you should test your random number generator with them first. Verify that the generator will look random when those methods are applied.
 
Last edited:
  • #7
FactChecker said:
I would be surprised if any of the standard random number generators would not pass a Chi-squared goodness of fit test.
I think I made it... and passed the test by p-value ~0.26 with one test [the rest are almost the same]. So yes, it passed the test. I was a bit stuck with how to implement the code part for the chi2 determination in python because I didn't have my coffee on the desk (a good reasoning to avoid saying I was thinking dumb at that moment).

FactChecker said:
There is always a test that is sophisticated enough to determine that they are not random.
ha, quite intriguing... probably I will try to make that pseudorandom numb generator fail next.

DrClaude said:
Have a look at this excerpt from the book Beautiful Testing to see how to test RNGs.
Yup, that book contains the same test I am trying to apply, but what it actually tests there is whether given a uniform distribution it works fine to go to a non-uniform one (or so I understood) . It was interesting reading however, and almost the same method I did (the "bucket method" with 100 bins is the same I tried).
 
  • #8
ChrisVer said:
I will try to make that pseudorandom numb generator fail next.
If your work involves a number generator and the work requires you to use more sensitive statistical tests, you should make sure that the number generator is good enough. I think problems would be rare. Most real-world applications of techniques like the Box-Jenkins time series analysis are on real data, not generated data. Of course, examples for a class might be artificially generated. That is where I encountered a problem once, long ago. I generated artificial data for classwork and the number generator contained bad autocorrelations.
 

FAQ: Random.randrange() is really random?

1. Is the random.randrange() function truly random?

Yes, the random.randrange() function uses a pseudo-random number generator to generate a sequence of numbers that appear to be random. However, it is not truly random as it follows a specific algorithm and can be reproduced with the same seed.

2. How does the random.randrange() function work?

The random.randrange() function works by taking a range of numbers and selecting a random integer within that range. It uses a pseudo-random number generator to generate a random sequence of numbers, and then selects one of those numbers as the output.

3. Can the random.randrange() function produce duplicate numbers?

Yes, the random.randrange() function can produce duplicate numbers if the range of numbers is larger than the number of possible outcomes. This is because it uses a pseudo-random number generator and is not truly random.

4. Is the output of the random.randrange() function predictable?

The output of the random.randrange() function is not entirely predictable, but it can be reproduced with the same seed. This means that if the same seed is used, the function will produce the same sequence of numbers.

5. How can I make the random.randrange() function more random?

To make the random.randrange() function more random, you can use a larger range of numbers or change the seed value. Alternatively, you can use a different random function that uses a different algorithm for generating random numbers.

Similar threads

Replies
4
Views
1K
Replies
1
Views
2K
Replies
10
Views
3K
Replies
6
Views
2K
Replies
3
Views
2K
Replies
4
Views
1K
Replies
30
Views
3K
Back
Top