HighwayJohn - 9 months ago 42

Python Question

I have the following problem. It starts with a list which I get. Let's say for example I got the list:

`A=['1-00', '10--']`

`B=['01-1', '1-00', '0-01']`

In A I have one string of rank two

`'10--'`

`'1-00'`

Here I have only one rank two string

`'10--'`

`'1000', '1001', '1010', '1011'`

`'1-00'`

`'1100'`

`'1000'`

I think its a tough problem but guess there are some commands which I don't know that could help. Hope you can give me some hints :)

Answer

If I understand your question, you want to sort by the number of fixed digits ("rank") first, then compute the total number of unique possible bit strings at or below each rank. So for `A`

, you'd want it sorted to `['10--', '1-00']`

(putting lower rank first), and you'd want to get the number of unique patterns for all rank 2 and lower patterns (in this case, there is only one), then for all rank 3 and lower patterns (again, in this case, only one), etc.

If that understanding is correct, the first step is sorting:

```
A.sort(key=lambda x: x.count('0') + x.count('1'))
# Or if all patterns are of length 4, the slightly simpler:
A.sort(key=lambda x: 4 - x.count('-'))
```

After that, you process by ranks using `itertools.groupby`

, generating unique outputs with `itertools.product`

and storing them in a `set`

so you remove duplicates:

```
from __future__ import print_function # For consistent print on Py2/Py3
from future_builtins import map # Only on Py2, to get Py3 generator based map
from itertools import groupby, product
uniquebits = set()
for rank, pats in groupby(A, key=lambda x: x.count('0') + x.count('1')):
for pat in pats:
# product can be abused to sub 0, then 1 for each wildcarded space
patpieces = [let if let != '-' else '01' for let in pat]
# Generate all patterns (using ''.join reduces storage by converting
# tuples back to str) updating the set to get unique values seen so far
uniquebits.update(map(''.join, product(*patpieces)))
print("For rank", rank, "and below,", len(uniquebits), "unique bit strings")
```

Which for your `A`

would output:

```
For rank 2 and below, 4 unique bit strings
For rank 3 and below, 5 unique bit strings
```

If you want to avoid gaps, and output for all ranks out to the maximum length of the patterns, it's slightly more complicated (since `groupby`

only produces groups when there is at least one input with that "key", in this case, at least one input of a given rank):

```
lastrank = None # Change to 1 or 2 to force ranks prior to lowest rank to output
for rank, pats in groupby(A, key=lambda x: x.count('0') + x.count('1')):
if lastrank is not None and lastrank + 1 != rank:
for fillrank in range(lastrank + 1, rank):
print("For rank", fillrank, "and below,", len(uniquebits), "unique bit strings")
lastrank = rank
for pat in pats:
# product can be abused to sub 0, then 1 for each wildcarded space
patpieces = [let if let != '-' else '01' for let in pat]
# Generate all patterns (using ''.join reduces storage by converting
# tuples back to str) updating the set to get unique values seen so far
uniquebits.update(map(''.join, product(*patpieces)))
print("For rank", rank, "and below,", len(uniquebits), "unique bit strings")
# Or len(max(A, key=len))+1 if uneven lengths
for rank in range(lastrank + 1, len(A[0]) + 1):
print("For rank", rank, "and below,", len(uniquebits), "unique bit strings")
```

Source (Stackoverflow)