Some bugs

classic Classic list List threaded Threaded
2 messages Options
Reply | Threaded
Open this post in threaded view
|

Some bugs

Steffen Herbold
Dear Devs,

we reported a couple of bugs in the Jira, but we are not sure anymore if
that is actively used. So we just post them here as well:

https://jira.pentaho.com/browse/DATAMINING-779 - Tree-based classifiers
crash with data close to MaxDouble
https://jira.pentaho.com/browse/DATAMINING-780 - BayesNet crashes with
many starved categories
https://jira.pentaho.com/browse/DATAMINING-781 - Bug in SMO or
Standardization with values close to MaxDouble
https://jira.pentaho.com/browse/DATAMINING-782 - Number of KMeans
clusters in EM is not checked
https://jira.pentaho.com/browse/DATAMINING-783 - HierarchicalClustering
crashing with FilteredDistance for MaxDouble

The issues contain all details for reproduction.

Two questions for the future:
  - Should we report similar bugs or are these uninteresting corner
cases? I got another one, with a possibly preventable underflow in
SimpleKMeans.
  - Where should we report them?

Best,
Steffen

--
PD Dr. Steffen Herbold
Institute of Computer Science, University of Goettingen
Goldschmidtstraße 7, 37077 Göttingen, Germany
mailto. [hidden email]
tel. +49 551 39-172037
_______________________________________________
Wekalist mailing list -- [hidden email]
Send posts to [hidden email]
To unsubscribe send an email to [hidden email]
To subscribe, unsubscribe, etc., visit https://list.waikato.ac.nz/postorius/lists/wekalist.list.waikato.ac.nz
List etiquette: http://www.cs.waikato.ac.nz/~ml/weka/mailinglist_etiquette.html
Reply | Threaded
Open this post in threaded view
|

Re: Some bugs

Peter Reutemann
> we reported a couple of bugs in the Jira, but we are not sure anymore if
> that is actively used.

With Pentaho's sale to Hitachi and the subsequent departure of Mark
Hall from Pentaho, I doubt that this is still actively being used.

> So we just post them here as well:
>
> https://jira.pentaho.com/browse/DATAMINING-779 - Tree-based classifiers
> crash with data close to MaxDouble
> https://jira.pentaho.com/browse/DATAMINING-780 - BayesNet crashes with
> many starved categories
> https://jira.pentaho.com/browse/DATAMINING-781 - Bug in SMO or
> Standardization with values close to MaxDouble
> https://jira.pentaho.com/browse/DATAMINING-782 - Number of KMeans
> clusters in EM is not checked
> https://jira.pentaho.com/browse/DATAMINING-783 - HierarchicalClustering
> crashing with FilteredDistance for MaxDouble
>
> The issues contain all details for reproduction.
>
> Two questions for the future:
>   - Should we report similar bugs or are these uninteresting corner
> cases? I got another one, with a possibly preventable underflow in
> SimpleKMeans.

IMHO These edge cases are unlikely to happen for most users. However,
you could provide patches against the trunk branch of Weka's
subversion repo to fix the issues (assuming they are not detrimental
to the performance). Always happy to receive bug fixes. ;-)

>   - Where should we report them?

On this mailing list.

Cheers, Peter
--
Peter Reutemann
Dept. of Computer Science
University of Waikato, NZ
+64 (7) 577-5304
http://www.cms.waikato.ac.nz/~fracpete/
http://www.data-mining.co.nz/
_______________________________________________
Wekalist mailing list -- [hidden email]
Send posts to [hidden email]
To unsubscribe send an email to [hidden email]
To subscribe, unsubscribe, etc., visit https://list.waikato.ac.nz/postorius/lists/wekalist.list.waikato.ac.nz
List etiquette: http://www.cs.waikato.ac.nz/~ml/weka/mailinglist_etiquette.html