kaldi icon indicating copy to clipboard operation
kaldi copied to clipboard

WIP : Changes to allow multiple ivectors per chunk

Open freewym opened this issue 9 years ago • 56 comments

freewym avatar Jan 23 '16 20:01 freewym

I would recommend assuming ivector_period is 10 as default and modify all scripts and binaries according to that.

vijayaditya avatar Jan 23 '16 21:01 vijayaditya

I will do another round of review after these changes have been made.

vijayaditya avatar Jan 23 '16 21:01 vijayaditya

Thanks! Done.

Yiming

freewym avatar Jan 24 '16 00:01 freewym

I am a bit busy this week and I don't want to delay this PR review, as it can be used to further improve chain results. @vimalmanohar or @pegahgh could you please help with the review ?

@freewym Would you please add to nnet3-chain-get-egs the changes you made to nnet3-get-egs ? If yes, these commits should be submitted to chain branch. Assume the current PR is going to be accepted to master and then merged to chain branch. So just make additional changes necessary.

vijayaditya avatar Jan 24 '16 23:01 vijayaditya

Sure.

freewym avatar Jan 24 '16 23:01 freewym

I am done with the review. It looks good for commit.

vimalmanohar avatar Jan 28 '16 19:01 vimalmanohar

@vijayaditya, as you can see I want a lot of changes, and some of them (e.g. involving descriptors) are not very trivial, so you might need to help Yiming a bit. There needs to be a lot more clarity in the decodable object. It seems Yiming was trying to make this change without making any structural changes to the code, and the result was very unclear.

danpovey avatar Jan 29 '16 00:01 danpovey

Also please make it so the function AmNnetSimple::Info() prints out the ivector-interval.

danpovey avatar Jan 29 '16 00:01 danpovey

OK, I will go through the required changes.

--Vijay

On Thu, Jan 28, 2016 at 4:50 PM, Daniel Povey [email protected] wrote:

Also please make it so the function AmNnetSimple::Info() prints out the ivector-interval.

— Reply to this email directly or view it on GitHub https://github.com/kaldi-asr/kaldi/pull/468#issuecomment-176500779.

vijayaditya avatar Jan 29 '16 00:01 vijayaditya

Guys, is this pull request moving forward?

danpovey avatar Feb 06 '16 05:02 danpovey

I have to do the review. I will take care of it over the weekend.

--Vijay

On Sat, Feb 6, 2016 at 12:02 AM, Daniel Povey [email protected] wrote:

Guys, is this pull request moving forward?

— Reply to this email directly or view it on GitHub https://github.com/kaldi-asr/kaldi/pull/468#issuecomment-180682037.

vijayaditya avatar Feb 06 '16 05:02 vijayaditya

I have modified decodable code, and will further make changes according to Vijay's later review.

Yiming

On Saturday, February 6, 2016, Vijayaditya Peddinti < [email protected]> wrote:

I have to do the review. I will take care of it over the weekend.

--Vijay

On Sat, Feb 6, 2016 at 12:02 AM, Daniel Povey <[email protected] javascript:_e(%7B%7D,'cvml','[email protected]');> wrote:

Guys, is this pull request moving forward?

— Reply to this email directly or view it on GitHub https://github.com/kaldi-asr/kaldi/pull/468#issuecomment-180682037.

— Reply to this email directly or view it on GitHub https://github.com/kaldi-asr/kaldi/pull/468#issuecomment-180683271.

Sent from my iPhone

freewym avatar Feb 06 '16 05:02 freewym

@vijayaditya, what's the status of this? I'd like to get this finished, as it could improve results.

danpovey avatar Feb 24 '16 01:02 danpovey

Will be done this week.

On Tue, Feb 23, 2016 at 8:35 PM, Daniel Povey [email protected] wrote:

@vijayaditya https://github.com/vijayaditya, what's the status of this? I'd like to get this finished, as it could improve results.

— Reply to this email directly or view it on GitHub https://github.com/kaldi-asr/kaldi/pull/468#issuecomment-188002289.

vijayaditya avatar Feb 24 '16 01:02 vijayaditya

Just an update: I have started this review, should be done by end of day.

vijayaditya avatar Feb 29 '16 05:02 vijayaditya

vijay, did you finish this review? I'd like to be able to merge this before we run our final experiments for the chain models- it should help.

danpovey avatar Mar 05 '16 05:03 danpovey

I was able to start the review yesterday. I am going through Yiming's suggested changes (offline) before I give my suggestions.

On Sat, Mar 5, 2016 at 12:23 AM, Daniel Povey [email protected] wrote:

vijay, did you finish this review? I'd like to be able to merge this before we run our final experiments for the chain models- it should help.

— Reply to this email directly or view it on GitHub https://github.com/kaldi-asr/kaldi/pull/468#issuecomment-192580463.

vijayaditya avatar Mar 05 '16 05:03 vijayaditya

@vijayaditya Changes have been made.

Yiming

freewym avatar Mar 10 '16 03:03 freewym

@freewym could you resolve the conflicts ? An easy way to do this would be to rebase your branch with the master.

vijayaditya avatar Mar 10 '16 19:03 vijayaditya

Will do it later today

freewym avatar Mar 10 '16 19:03 freewym

@freewym what is the status of this PR? Is it ready for review ?

vijayaditya avatar Mar 14 '16 10:03 vijayaditya

yes

On Monday, March 14, 2016, Vijayaditya Peddinti [email protected] wrote:

@freewym https://github.com/freewym what is the status of this PR? Is it ready for review ?

— Reply to this email directly or view it on GitHub https://github.com/kaldi-asr/kaldi/pull/468#issuecomment-196245575.

Sent from my iPhone

freewym avatar Mar 14 '16 12:03 freewym

I am stopping the review. Will resume after you make required changes or justify why these changes are not necessary.

vijayaditya avatar Mar 14 '16 20:03 vijayaditya

@danpovey could you please comment on the implementation of the function GetInputInterval() in nnet-utils.cc?

Yiming

freewym avatar Mar 15 '16 23:03 freewym

Moved the comments in the braces. @vijayaditya I am done with the changes.

Thanks, Yiming

freewym avatar Mar 16 '16 00:03 freewym

OK will review in few hours.

On Tue, Mar 15, 2016 at 8:19 PM, Yiming Wang [email protected] wrote:

Moved the comments in the braces. @vijayaditya https://github.com/vijayaditya I am done with the changes.

Thanks, Yiming

— You are receiving this because you were mentioned. Reply to this email directly or view it on GitHub https://github.com/kaldi-asr/kaldi/pull/468#issuecomment-197079424

vijayaditya avatar Mar 16 '16 00:03 vijayaditya

after code review we need to test it a bit before merge.

On Tue, Mar 15, 2016 at 8:20 PM, Vijayaditya Peddinti < [email protected]> wrote:

OK will review in few hours.

On Tue, Mar 15, 2016 at 8:19 PM, Yiming Wang [email protected] wrote:

Moved the comments in the braces. @vijayaditya https://github.com/vijayaditya I am done with the changes.

Thanks, Yiming

— You are receiving this because you were mentioned. Reply to this email directly or view it on GitHub https://github.com/kaldi-asr/kaldi/pull/468#issuecomment-197079424

— You are receiving this because you were mentioned. Reply to this email directly or view it on GitHub https://github.com/kaldi-asr/kaldi/pull/468#issuecomment-197079837

danpovey avatar Mar 16 '16 02:03 danpovey

@freewym I might not be able to review this till Saturday. In the mean time could you please add the unit tests for the decodable object and egs creation code with various descriptors combinations. (This would be the same thing you discussed offline today evening). Please try to keep these unit tests as random as possible, its up to you to decide the level of randomness.

vijayaditya avatar Mar 18 '16 03:03 vijayaditya

I was able to perform a cursory check. I think you should start an experiment and compare the performance with baseline (one ivector/chunk). You can create a new script based on swbd/s5c/local/chain/run_tdnn_6z.sh. You can make that script a part of this PR.

@danpovey I am assuming that we would not have time to repeat all the fshr+swbd experiments for the paper, with multiple ivectors. So I think getting _6z results on swbd, with multiple ivectors, should be sufficient for the immediate needs of the paper.

vijayaditya avatar Mar 21 '16 19:03 vijayaditya

On Mon, Mar 21, 2016 at 3:00 PM, Vijayaditya Peddinti < [email protected]> wrote:

I was able to perform a cursory check. I think you should start an experiment and compare the performance with baseline (one ivector/chunk). You can create a new script based on swbd/s5c/local/chain/run_tdnn_6z.sh. You can make that script a part of this PR.

@danpovey https://github.com/danpovey I am assuming that we would not have time to repeat all the fshr+swbd experiments for the paper, with multiple ivectors. So I think getting _6z results on swbd, with multiple ivectors, should be sufficient for the immediate needs of the paper.

  • agreed.

— You are receiving this because you were mentioned. Reply to this email directly or view it on GitHub https://github.com/kaldi-asr/kaldi/pull/468#issuecomment-199425536

danpovey avatar Mar 21 '16 19:03 danpovey