user3338505 - 4 months ago 22

Python Question

I have a array

`x`

`x=np.array(["83838374747412E61E4C202C004D004D004D020202C3CF",`

"8383835F6260127314A0127C078E07090705023846C59F",

"83838384817E14231D700FAC09BC096808881E1C1BC68F",

"8484835C535212600F860A1612B90FCF0FCF012A2AC6BF",

"848484787A7A1A961BAC1E731086005D005D025408C6CF",

"8484845050620C300D500A9313E613E613012A2A5CC4BF",

"838383757C7CF18F02192653070D03180318080101BE6F",

"8584845557570F090E830F4309E5080108012A2A2AC6DF",

"85858453536B07D608B3124C102A102A1026010101C61F",

"83838384848411A926791C162048204820484D4444C3BF"], dtype=object)

These are concatenated hex values that I need to slice in order to convert to integers and then apply conversion factors. I want an array such as:

`[83,83,83,84,84,84,83,85,85,83]`

Which would be the equivalent of

`x[:,0:2]`

`(10,)`

Answer

If you're just after the first two characters from each hex value, one option is to recast your array to a `dtype`

of `'|S2'`

:

```
>>> x.astype('|S2')
array(['83', '83', '83', '84', '84', '84', '83', '85', '85', '83'],
dtype='|S2')
```

This idea can be generalised to return the first `n`

characters from each string.

Arbitrary slicing of string arrays is much more difficult to do in NumPy. Answers on this Stack Overflow page explain why it isn't the best tool for strings but show what can be possible.

Alternatively, the Pandas library facilitates fast vectorized operations (being built on top of NumPy). It has a number of very useful string operations which makes slicing a whole lot simpler than plain NumPy:

```
>>> import pandas as pd
>>> s = pd.Series(x)
>>> s.str.slice(2, 9)
0 8383747
1 83835F6
2 8383848
3 84835C5
4 8484787
5 8484505
6 8383757
7 8484555
8 8584535
9 8383848
dtype: object
```