Pyth0nicPenguin Pyth0nicPenguin - 6 months ago 29
Python Question

Generate a lot of random strings

I have a method that will generate 50,000 random strings, save them all to a file, and then run through the file, and delete all duplicates of the strings that occur. Out of those 50,000 random strings, after using

to generate unique ones, on average 63 of them are left.

Function to generate the strings:

def random_strings(size=8, chars=string.ascii_uppercase + string.digits + string.ascii_lowercase):
return ''.join(random.choice(chars) for _ in xrange(size))

Delete duplicates:

with open("dicts/temp_dict.txt", "a+") as data:
created = 0
while created != 50000:
string = random_strings()
data.write(string + "\n")
created += 1
sys.stdout.write("\rCreating password: {} out of 50000".format(created))

print "\nRemoving duplicates.."
with open("dicts\\rainbow-dict.txt", "a+") as rewrite:

Example of before and after:

How can I generate completely random unique strings without duplicates?


You can use set from the start

created = set()
while len(created) < 50000:

And save once outside the loop