awx-operator icon indicating copy to clipboard operation
awx-operator copied to clipboard

Jobs failed with "The running ansible process received a shutdown signal."

Open matiasperkins opened this issue 9 months ago • 3 comments

Please confirm the following

  • [X] I agree to follow this project's code of conduct.
  • [X] I have checked the current issues for duplicates.
  • [X] I understand that the AWX Operator is open source software provided for free and that I might not receive a timely response.

Bug Summary

Hi! Hope you are doing good.

We have a problem on a template that is doing a backup for a DB. When it is doing the backup it suddenly failed with the error "The running ansible process received a shutdown signal.".

The backup was running without any problem some days ago when it was smaller.

We are ruuning AWX on a K8s cluster with version v1.23.8+vmware.3

AWX Operator version

1.0.0

AWX version

21.8.0

Kubernetes platform

kubernetes

Kubernetes/Platform version

v1.23.8+vmware.3

Modifications

no

Steps to reproduce

the task that is crashing is:

  • name: Dump database to a tmp file community.postgresql.postgresql_db: name: "{{ item }}" state: dump target: "/tmp/{{ item }}-{{ tsGlobal }}.sql" login_user: "{{ db_user }}" login_password: "{{ db_password }}" login_host: "{{ db_host }}" port: "{{ postgres_port }}" loop: '{{db_name}}' ignore_errors: yes register: dbDump

  • name: archive file backup archive: path: "/tmp/{{ item }}-{{ tsGlobal }}.sql" dest: "/tmp/{{ item }}-{{ tsGlobal }}.sql.gz" format: gz loop: '{{db_name}}'

Expected results

we expected that the backup finish without problems

Actual results

it crash then it is executing with error: "The running ansible process received a shutdown signal."

Additional information

No response

Operator Logs

No response

matiasperkins avatar May 02 '24 14:05 matiasperkins

Hey, what is failing, the job/task or the pod?

If its the job then the issue would be more appropriate for the AWX repo

YaronL16 avatar May 05 '24 14:05 YaronL16

Hi! On the output of the job I only see that the job end, but on the error of AWX I see that error. I think is more a problema of the pod, something with the resource o something like that

matiasperkins avatar May 05 '24 19:05 matiasperkins

Same behavior on version 2.17.0, longer tasks fails.

Peter1295 avatar Jun 03 '24 06:06 Peter1295