encog-java-core
encog-java-core copied to clipboard
Provide a random data set
Encog needs a random data set. Such a data set would enclose a regular data set. It would then return randomly selected elements from the enclosed data set. You would be able to set the number of elements that the random data set would have. For example, you might set the dataset to have 1000 elements. If this dataset enclosed a data set with 10,000 elements each iteration would select 1000 random elements from the 10,000 sized data set. This was inspired by this forum post:
http://www.heatonresearch.com/comment/reply/3128#comment-form
Jeff if I understand correctly I think some of that is already in the EnsembleDataSetFactory class family (org.encog.ensemble.data.factories) - maybe it's an easy adaptation from there. I already have another one ready for random selection without resampling, which I will contribute back along with dropout. I hope that helps in any way.
On 8 June 2013 01:18, Jeff Heaton [email protected] wrote:
Encog needs a random data set. Such a data set would enclose a regular data set. It would then return randomly selected elements from the enclosed data set. You would be able to set the number of elements that the random data set would have. For example, you might set the dataset to have 1000 elements. If this dataset enclosed a data set with 10,000 elements each iteration would select 1000 random elements from the 10,000 sized data set. This was inspired by this forum post:
http://www.heatonresearch.com/comment/reply/3128#comment-form
— Reply to this email directly or view it on GitHubhttps://github.com/encog/encog-java-core/issues/140 .
Alan
Stratified sampling could also be implemented - in order to help with classification of skewed classes.