encog-java-core icon indicating copy to clipboard operation
encog-java-core copied to clipboard

Provide a random data set

Open jeffheaton opened this issue 11 years ago • 2 comments

Encog needs a random data set. Such a data set would enclose a regular data set. It would then return randomly selected elements from the enclosed data set. You would be able to set the number of elements that the random data set would have. For example, you might set the dataset to have 1000 elements. If this dataset enclosed a data set with 10,000 elements each iteration would select 1000 random elements from the 10,000 sized data set. This was inspired by this forum post:

http://www.heatonresearch.com/comment/reply/3128#comment-form

jeffheaton avatar Jun 08 '13 00:06 jeffheaton

Jeff if I understand correctly I think some of that is already in the EnsembleDataSetFactory class family (org.encog.ensemble.data.factories) - maybe it's an easy adaptation from there. I already have another one ready for random selection without resampling, which I will contribute back along with dropout. I hope that helps in any way.

On 8 June 2013 01:18, Jeff Heaton [email protected] wrote:

Encog needs a random data set. Such a data set would enclose a regular data set. It would then return randomly selected elements from the enclosed data set. You would be able to set the number of elements that the random data set would have. For example, you might set the dataset to have 1000 elements. If this dataset enclosed a data set with 10,000 elements each iteration would select 1000 random elements from the 10,000 sized data set. This was inspired by this forum post:

http://www.heatonresearch.com/comment/reply/3128#comment-form

— Reply to this email directly or view it on GitHubhttps://github.com/encog/encog-java-core/issues/140 .

Alan

nitbix avatar Jun 09 '13 22:06 nitbix

Stratified sampling could also be implemented - in order to help with classification of skewed classes.

PetrToman avatar Jul 17 '13 12:07 PetrToman