sachin saxena - 1 year ago 114

Python Question

I am trying to fit Blaze data object to scikit kmeans function.

`from blaze import *`

from sklearn.cluster import KMeans

data_numeric = Data('data.csv')

data_cluster = KMeans(n_clusters=5)

data_cluster.fit(data_numeric)

Data Sample:

`A B C`

1 32 34

5 57 92

89 67 21

Its throwing error :

I have been able to do it with Pandas Dataframe. Any way to feed blaze object to this function ?

Answer Source

I think you need to convert your pandas dataframe into an numpy array before you fit.

```
from blaze import *
import numpy
from sklearn.cluster import KMeans
data_numeric = numpy.array(data('data.csv'))
data_cluster = KMeans(n_clusters=5)
data_cluster.fit(data_numeric)
```