shogun Use iterative machine mixin in more algorithms

This is a very good issue for those who want to be involved in the project inside the black box II

In order to make algorithms stoppable, we need to execute each iteration and then save the state in member variables. If a user decides to stop the process at any iteration, control is returned back. This means the user is free to continue, serialize, test etc. on the pre-trained model See Perceptron and NewtonSVM as examples. We want all iterative algorithms to apply this mixin

First Steps

Identify the initialization and iteration phase(what occurs in the training loop) of the algorithm
The initialization part (mostly above the training loop) goes into the init_model() method
The iteration function will implement a single iteration of the algorithm, so we will move the contents of the training loop into iteration function
Create new member variables if required to share data between init_model and iteration
Delete local copies of variables that represent state(like weights and bias)
Finally write tests to see if everything is consistent :) see NewtonSVM unit tests and Perceptron unit tests

Initially we want this done for a single Linear Machine algorithm (CAveragedPerceptron is a good place to start) See the iterative machine guide for more information

Jan 25 '19 17:01 shubham808

Hi @shubham808, I am new here. I read the wiki for inside the black box II project and I am really interested to work on it. This issue would be a great introduction to the project. So, Can I work on this?

Feb 03 '19 18:02 souvik3333

Yes go ahead :)

Feb 04 '19 08:02 shubham808

I am new to open source and I will like to work on this issue.

Mar 09 '19 08:03 han0305

I this issue closed as per the Merge on Mar 2?

Jul 09 '19 16:07 samdbrice

Nope, still lots of iterative algorithms/machines to port

Jul 09 '19 18:07 karlnapf

Ah ok, so this is akin to a Milestone/Sticky issue - not a one-off item. This is indeed a good first issue, I'll see if I can implement one.

Jul 09 '19 21:07 samdbrice

Just checked a few machines that are iterative and realised that most low hanging fruits have been taken. There are quite a few algorithms to port left but the changes will be slightly more nontrivial. Everything that is based on (stochastic) gradient descent is worth looking into here

Jul 09 '19 22:07 karlnapf

I would like to contribute. Is there any work left to be done in this issue?

Oct 26 '19 15:10 progmatic-99

Hi. If this is still open can I take up this one?

Jan 17 '20 04:01 ashutosh-b-b

Sure, go for it. Thanks!

Jan 17 '20 16:01 sbrice

@samdbrice I did started to read the blog and understand it. Which algorithm should I go with knowing that I am a beginner here?

Jan 22 '20 20:01 ashutosh-b-b

I was thinking to start with Brute KNN can anyone guide me?

Jan 23 '20 06:01 ashutosh-b-b

@ashutosh-b-b you could start with that or with https://github.com/shogun-toolbox/shogun/blob/develop/src/shogun/metric/LMNN.cpp basically the idea is that you follow the CRTP pattern with IterativeMachine and move an iteration to logic into iteration function

Jan 23 '20 09:01 vigsterkr

Thanks!

Jan 24 '20 03:01 ashutosh-b-b

shogun shogun copied to clipboard

Use iterative machine mixin in more algorithms

First Steps

shogun
shogun copied to clipboard