> Is there any easy (and sophisticated) way to divide the arff dataset into two parts using Weka explorer? One part should be 80% and another 20%.

>

> I want to divide the data into (a) part 1 and (b) part 2.. With part1, I will use the k fold CV to evaluate the performance estimates (i.e. RMSE values). Then I will evaluate the performance on the unseen data (part 2) and calculate their differences. The difference of performance estimates between part1 and part2 data is called the bias of the model.

Use the RemovePercentage filter:

https://weka.sourceforge.io/doc.dev/weka/filters/unsupervised/instance/RemovePercentage.htmlCheers, Peter

--

Peter Reutemann

Dept. of Computer Science

University of Waikato, NZ

+64 (7) 577-5304

http://www.cms.waikato.ac.nz/~fracpete/http://www.data-mining.co.nz/_______________________________________________

Wekalist mailing list --

[hidden email]
Send posts to

[hidden email]
To unsubscribe send an email to

[hidden email]
To subscribe, unsubscribe, etc., visit

https://list.waikato.ac.nz/postorius/lists/wekalist.list.waikato.ac.nzList etiquette:

http://www.cs.waikato.ac.nz/~ml/weka/mailinglist_etiquette.html