AlbertoD AlbertoD - 1 year ago 210
Java Question

Weka - load UTF-8 encoded csv

Is there a way in Weka 3.7.13 to load UTF-8 encoded files without converting them to ANSII?

I am trying to load a csv file containing a string attribute, whose value can contain emoticons, and I need not to lose them.

Answer Source

It is very possible to do this. See this link, it describes how to do this from command line or GUI.

Add this parameter if using the command line -Dfile.encoding=utf-8.

If using the GUI then edit the RunWEKA.ini file. Change the fileEncoding placeholder to utf-8.

Recommended from our users: Dynamic Network Monitoring from WhatsUp Gold from IPSwitch. Free Download