hastic-server icon indicating copy to clipboard operation
hastic-server copied to clipboard

Error: AssertionError('labeled list empty, skip fitting for ikzM3VqvqlSHwuoB',)

Open tuapuikia opened this issue 4 years ago • 8 comments

Received this error when try to save pattern drop label. I have positive and negative label when using count agg in Elasticsearch data source.

I'm able to save it when using average instead.

tuapuikia avatar Mar 08 '20 08:03 tuapuikia

It's hard to say what is going on here. Need to debug analytics. https://github.com/hastic/hastic-server/blob/fa9673e347c2d49e08691bb6eb76bf2e3b4a01f2/analytics/analytics/models/model.py#L159

jonyrock-back avatar Mar 08 '20 18:03 jonyrock-back

@VargBurz do you have ideas? Please explain that this asserts checks here

jonyrock-back avatar Mar 08 '20 18:03 jonyrock-back

Hi, @tuapuikia. This asserts checks if there are positive segments for learning. Analytics skips a segment if it's more than 10% empty. Please check analytics' debug logs. Is there any messages like that? segment {segment.from_index}-{segment.to_index} skip because of invalid data

VargBurz avatar Mar 10 '20 10:03 VargBurz

What does "empty" mean?

jonyrock-back avatar Mar 10 '20 11:03 jonyrock-back

It means that part of segment doesn't have values. For example, like on this screen from 16:11:30 to 16:12:45. image

VargBurz avatar Mar 10 '20 12:03 VargBurz

Hi @VargBurz ,

I have gap in my logs and it is normal for my data. Is it possible to accept or skip null / empty segment?

tuapuikia avatar Mar 12 '20 04:03 tuapuikia

@tuapuikia we have internal conversation about what UX should we provide. Skipping is what we doing now and this is the root of issue.

The problem is that I just don't know what to do with null values in data. Options are

  1. iterpretate as zeros/constant (or interpolate)
  2. adjust our detection algorithms to work with null

@tuapuikia what do you think we should do?

jonyrock avatar Mar 23 '20 08:03 jonyrock

@jonyrock

I would say option 2 is better. Because zero could be integer for some data. For example getting concurrent user count. Zero mean there are no user using the services.

tuapuikia avatar Mar 26 '20 07:03 tuapuikia