If tx_semaphore_get() returns TX_NO_INSTANCE, the immediate call to tx_thread_sleep() also returns TX_NO_INSTANCE after the sleep duration even though the sleep is called from thread context.
Describe the bug
When we call the tx_semaphore_get() API with wait_option value other than TX_WAIT_FOREVER and TX_NO_WAIT, if no instances of the specified counting semaphore is available, the API will fail with TX_NO_INSTANCE once the wait time has expired.
At this point, if we call the tx_thread_sleep() API it also would fail with the error code - TX_NO_INSTANCE.
Working scenarios:
- If we pass
TX_WAIT_FOREVERas thewait_optionargument to thetx_semaphore_get()API, once an instance of the semaphore is available, the API will return withTX_SUCCESS. Now if we call thetx_thread_sleep()API, the API also will returnTX_SUCCESS. - Similarly, if we pass
TX_NO_WAITas thewait_optionargument to thetx_semaphore_get()API, it will return immediately withTX_NO_INSTANCEif semaphore instance is not availale. Now if we call thetx_thread_sleep()API, the API will returnTX_SUCCESS.
Psuedo code to reproduce the issue
void test_thread()
{
while (1) {
/* do something */
rc_loc = tx_semaphore_get(sem_ptr, TX_WAIT_FOREVER);
printf("ret = %x", rc_loc); /* Prints - "ret = 0" => TX_SUCCESS - once semaphore is available */
rc_loc = tx_thread_sleep(100);
printf("ret = %x", rc_loc); /* Prints - "ret = 0" => TX_SUCCESS - after sleeping for 100 ticks*/
rc_loc = tx_semaphore_get(sem_ptr, TX_NO_WAIT);
printf("ret = %x", rc_loc); /* Prints - "ret = D" => TX_NO_INSTANCE - immediatedly when there is no semaphore */
rc_loc = tx_thread_sleep(100);
printf("ret = %x", rc_loc); /* Prints - "ret = 0" => TX_SUCCESS - after sleeping for 100 ticks*/
rc_loc = tx_semaphore_get(sem_ptr, 10);
printf("ret = %x", rc_loc); /* Prints - "ret = D" => TX_NO_INSTANCE - after waiting for 10 ticks */
rc_loc = tx_thread_sleep(100);
printf("ret = %x", rc_loc); /* Prints - "ret = D" => TX_NO_INSTANCE - after sleeping for 100 ticks*/
/* do something */
}
}
Environment and tool:
- Target - MIPS Interaptiv SMP CPU
- Eclipse ThreadX version - threadx-6.4.1_rel
- Compiler - GHS Multi compiler
Debugging observation:
- Whenever the sleep failed, we observed that the value of
thread_ptr->tx_thread_suspend_statuswasTX_NO_INSTANCEbefore the_tx_thread_sleep()is called. - In our build, TX_NOT_INTERRUPTABLE is defined. In that case we could see from the function definition of
_tx_thread_sleep()thatthread_ptr->tx_thread_suspend_statusis not being initialized toTX_SUCCESSbefore calling_tx_thread_system_ni_suspend()but in the case whereTX_NOT_INTERRUPTABLEis not defined,thread_ptr->tx_thread_suspend_statusis initialized toTX_SUCCESSbefore calling_tx_thread_system_suspend(). Is there a possibilty that the issue that we are facing is due this? - Please note that we have this issue only when the thread is put to suspended state because of the unavailability of the semaphore (when we specify a valid wait time). If the
wait_optionisTX_NO_WAIT, even if semaphore is not available, the thread is not put to sleep and thethread_ptr->tx_thread_suspend_statusis not updated by the call totx_semaphore_get().
The relevant porition of the _tx_thread_sleep() function definition is pasted below.
/* Set the state to suspended. */
thread_ptr -> tx_thread_state = TX_SLEEP;
#ifdef TX_NOT_INTERRUPTABLE
/* Call actual non-interruptable thread suspension routine. */
_tx_thread_system_ni_suspend(thread_ptr, timer_ticks);
/* Restore interrupts. */
TX_RESTORE
#else
/* Set the suspending flag. */
thread_ptr -> tx_thread_suspending = TX_TRUE;
/* Initialize the status to successful. */
thread_ptr -> tx_thread_suspend_status = TX_SUCCESS;
/* Setup the timeout period. */
thread_ptr -> tx_thread_timer.tx_timer_internal_remaining_ticks = timer_ticks;
/* Temporarily disable preemption. */
_tx_thread_preempt_disable++;
/* Restore interrupts. */
TX_RESTORE
/* Call actual thread suspension routine. */
_tx_thread_system_suspend(thread_ptr);
#endif
/* Return status to the caller. */
status = thread_ptr -> tx_thread_suspend_status;
}
}
/* Return completion status. */
return(status);
}
Temporary workaround
We have found a temporary workaround for the time being. If we initialize the thread_ptr->tx_thread_suspend_status to TX_SUCCESS and then call _tx_thread_sleep(), the API doesn't fail.
void test_thread()
{
while (1) {
/* do something */
rc_loc = tx_semaphore_get(sem_ptr, 10);
printf("ret = %x", rc_loc); /* Prints - "ret = D" => TX_NO_INSTANCE - after waiting for 10 ticks */
thread_ptr->tx_thread_suspend_status = TX_SUCCESS;
printf("ret = %x", rc_loc); /* Prints - "ret = 0" => TX_SUCCESS - */
rc_loc = tx_thread_sleep(100);
printf("ret = %x", rc_loc); /* Prints - "ret = 0" => TX_SUCCESS - after sleeping for 100 ticks*/
/* do something */
}
}
Expected behavior We expect the sleep to not fail if the prevoius call to tx_semaphore_get() is timed out.
Impact The issue is a show stopper. We have used the sleep API in many places. Although the issue seems to have stopped with the temporary workaround we can't mark the bug as fixed until we find the root cause and get the actual bugfix.