summary bugs report
The voice-to-text function is working properly, but there is a failure when the large model tries to summarize the content. Ollama is deployed on another host within the local area network, and both devices can access the network normally.
logs:
` {'load_data': '0.000', 'extract_feat': '0.014', 'forward': '2.224', 'batch_size': '1', 'rtf': '0.057'}, : 100%|██████████| 1/1 [00:02<00:00, 2.22s/it]
rtf_avg: 0.057: 100%|██████████| 1/1 [00:02<00:00, 2.22s/it]
rtf_avg: 0.057: 100%|██████████| 1/1 [00:02<00:00, 2.22s/it]
0%| | 0/51 [00:00<?, ?it/s]
100%|██████████| 51/51 [00:00<00:00, 67.14it/s]
{'load_data': '0.000', 'extract_feat': '0.040', 'forward': '0.760', 'batch_size': '1', 'rtf': '0.010'}, : 100%|██████████| 51/51 [00:00<00:00, 67.14it/s]
rtf_avg: 0.010: 100%|██████████| 51/51 [00:00<00:00, 67.14it/s]
rtf_avg: 0.010: 100%|██████████| 51/51 [00:00<00:00, 67.09it/s]
0%| | 0/1 [00:00<?, ?it/s]
100%|██████████| 1/1 [00:02<00:00, 2.17s/it]
{'load_data': '0.000', 'extract_feat': '0.015', 'forward': '2.167', 'batch_size': '1', 'rtf': '0.054'}, : 100%|██████████| 1/1 [00:02<00:00, 2.17s/it]
rtf_avg: 0.054: 100%|██████████| 1/1 [00:02<00:00, 2.17s/it]
rtf_avg: 0.054: 100%|██████████| 1/1 [00:02<00:00, 2.17s/it]
0%| | 0/53 [00:00<?, ?it/s]
100%|██████████| 53/53 [00:00<00:00, 62.22it/s]
{'load_data': '0.000', 'extract_feat': '0.041', 'forward': '0.852', 'batch_size': '1', 'rtf': '0.011'}, : 100%|██████████| 53/53 [00:00<00:00, 62.22it/s]
rtf_avg: 0.011: 100%|██████████| 53/53 [00:00<00:00, 62.22it/s]
rtf_avg: 0.011: 100%|██████████| 53/53 [00:00<00:00, 62.18it/s]
0%| | 0/1 [00:00<?, ?it/s]
100%|██████████| 1/1 [00:02<00:00, 2.34s/it]
{'load_data': '0.000', 'extract_feat': '0.016', 'forward': '2.337', 'batch_size': '1', 'rtf': '0.057'}, : 100%|██████████| 1/1 [00:02<00:00, 2.34s/it]
rtf_avg: 0.057: 100%|██████████| 1/1 [00:02<00:00, 2.34s/it]
rtf_avg: 0.057: 100%|██████████| 1/1 [00:02<00:00, 2.34s/it]
0%| | 0/54 [00:00<?, ?it/s]
100%|██████████| 54/54 [00:00<00:00, 59.92it/s]
{'load_data': '0.000', 'extract_feat': '0.046', 'forward': '0.901', 'batch_size': '1', 'rtf': '0.011'}, : 100%|██████████| 54/54 [00:00<00:00, 59.92it/s]
rtf_avg: 0.011: 100%|██████████| 54/54 [00:00<00:00, 59.92it/s]
rtf_avg: 0.011: 100%|██████████| 54/54 [00:00<00:00, 59.88it/s]
0%| | 0/1 [00:00<?, ?it/s]
100%|██████████| 1/1 [00:02<00:00, 2.40s/it]
{'load_data': '0.000', 'extract_feat': '0.015', 'forward': '2.399', 'batch_size': '1', 'rtf': '0.055'}, : 100%|██████████| 1/1 [00:02<00:00, 2.40s/it]
rtf_avg: 0.055: 100%|██████████| 1/1 [00:02<00:00, 2.40s/it]
rtf_avg: 0.055: 100%|██████████| 1/1 [00:02<00:00, 2.40s/it]
0%| | 0/58 [00:00<?, ?it/s]
100%|██████████| 58/58 [00:01<00:00, 54.84it/s]
{'load_data': '0.000', 'extract_feat': '0.046', 'forward': '1.058', 'batch_size': '1', 'rtf': '0.012'}, : 100%|██████████| 58/58 [00:01<00:00, 54.84it/s]
rtf_avg: 0.012: 100%|██████████| 58/58 [00:01<00:00, 54.84it/s]
rtf_avg: 0.012: 100%|██████████| 58/58 [00:01<00:00, 54.80it/s]
0%| | 0/1 [00:00<?, ?it/s]
100%|██████████| 1/1 [00:02<00:00, 2.48s/it]
{'load_data': '0.000', 'extract_feat': '0.014', 'forward': '2.485', 'batch_size': '1', 'rtf': '0.056'}, : 100%|██████████| 1/1 [00:02<00:00, 2.48s/it]
rtf_avg: 0.056: 100%|██████████| 1/1 [00:02<00:00, 2.48s/it]
rtf_avg: 0.056: 100%|██████████| 1/1 [00:02<00:00, 2.49s/it]
0%| | 0/58 [00:00<?, ?it/s]
100%|██████████| 58/58 [00:00<00:00, 66.81it/s]
{'load_data': '0.000', 'extract_feat': '0.050', 'forward': '0.868', 'batch_size': '1', 'rtf': '0.010'}, : 100%|██████████| 58/58 [00:00<00:00, 66.81it/s]
rtf_avg: 0.010: 100%|██████████| 58/58 [00:00<00:00, 66.81it/s]
rtf_avg: 0.010: 100%|██████████| 58/58 [00:00<00:00, 66.76it/s]
0%| | 0/1 [00:00<?, ?it/s]
100%|██████████| 1/1 [00:03<00:00, 3.44s/it]
{'load_data': '0.000', 'extract_feat': '0.021', 'forward': '3.442', 'batch_size': '1', 'rtf': '0.057'}, : 100%|██████████| 1/1 [00:03<00:00, 3.44s/it]
rtf_avg: 0.057: 100%|██████████| 1/1 [00:03<00:00, 3.44s/it]
rtf_avg: 0.057: 100%|██████████| 1/1 [00:03<00:00, 3.44s/it]
0%| | 0/80 [00:00<?, ?it/s]
100%|██████████| 80/80 [00:01<00:00, 57.43it/s]
{'load_data': '0.000', 'extract_feat': '0.063', 'forward': '1.393', 'batch_size': '1', 'rtf': '0.012'}, : 100%|██████████| 80/80 [00:01<00:00, 57.43it/s]
rtf_avg: 0.012: 100%|██████████| 80/80 [00:01<00:00, 57.43it/s]
rtf_avg: 0.012: 100%|██████████| 80/80 [00:01<00:00, 57.40it/s]
0%| | 0/1 [00:00<?, ?it/s]
100%|██████████| 1/1 [00:03<00:00, 3.23s/it]
{'load_data': '0.000', 'extract_feat': '0.025', 'forward': '3.234', 'batch_size': '1', 'rtf': '0.054'}, : 100%|██████████| 1/1 [00:03<00:00, 3.23s/it]
rtf_avg: 0.054: 100%|██████████| 1/1 [00:03<00:00, 3.23s/it]
rtf_avg: 0.054: 100%|██████████| 1/1 [00:03<00:00, 3.23s/it]
0%| | 0/80 [00:00<?, ?it/s]
100%|██████████| 80/80 [00:01<00:00, 78.22it/s]
{'load_data': '0.000', 'extract_feat': '0.068', 'forward': '1.126', 'batch_size': '1', 'rtf': '0.009'}, : 100%|██████████| 80/80 [00:01<00:00, 78.22it/s]
rtf_avg: 0.009: 100%|██████████| 80/80 [00:01<00:00, 78.22it/s]
rtf_avg: 0.009: 100%|██████████| 80/80 [00:01<00:00, 78.18it/s]
0%| | 0/1 [00:00<?, ?it/s]
100%|██████████| 1/1 [00:02<00:00, 3.00s/it]
{'load_data': '0.000', 'extract_feat': '0.021', 'forward': '2.998', 'batch_size': '1', 'rtf': '0.050'}, : 100%|██████████| 1/1 [00:02<00:00, 3.00s/it]
rtf_avg: 0.050: 100%|██████████| 1/1 [00:02<00:00, 3.00s/it]
rtf_avg: 0.050: 100%|██████████| 1/1 [00:02<00:00, 3.00s/it]
0%| | 0/80 [00:00<?, ?it/s]
100%|██████████| 80/80 [00:01<00:00, 64.98it/s]
{'load_data': '0.000', 'extract_feat': '0.067', 'forward': '1.231', 'batch_size': '1', 'rtf': '0.010'}, : 100%|██████████| 80/80 [00:01<00:00, 64.98it/s]
rtf_avg: 0.010: 100%|██████████| 80/80 [00:01<00:00, 64.98it/s]
rtf_avg: 0.010: 100%|██████████| 80/80 [00:01<00:00, 64.93it/s]
2025-04-10 14:08:36 - Translation file for zh-CN not found. Using default translation en-US.
0%| | 0/1 [00:00<?, ?it/s]
100%|██████████| 1/1 [00:02<00:00, 2.81s/it]
{'load_data': 0.0, 'extract_feat': 0.0, 'forward': '2.814', 'batch_size': '1', 'rtf': '-2.814'}, : 100%|██████████| 1/1 [00:02<00:00, 2.81s/it]
rtf_avg: -2.814: 100%|██████████| 1/1 [00:02<00:00, 2.81s/it]
rtf_avg: -2.814: 100%|██████████| 1/1 [00:02<00:00, 2.81s/it]
100%|██████████| 1/1 [01:33<00:00, 93.02s/it]
rtf_avg: 0.091, time_speech: 1015.360, time_escape: 92.350: 100%|██████████| 1/1 [01:33<00:00, 93.02s/it]
rtf_avg: 0.091, time_speech: 1015.360, time_escape: 92.350: 100%|██████████| 1/1 [01:33<00:00, 93.02s/it]
2025-04-10 14:10:19 - HTTP Request: POST http://192.168.1.22:11434/v1/chat/completions "HTTP/1.1 200 OK"
2025-04-10 14:10:19 - 1 change detected
2025-04-10 14:10:20 - Translation file for zh-CN not found. Using default translation en-US.
2025-04-10 14:10:47 - Translation file for zh-CN not found. Using default translation en-US.
2025-04-10 14:14:50 - Translation file for zh-CN not found. Using default translation en-US.
2025-04-10 14:16:00 - Translation file for zh-CN not found. Using default translation en-US.
2025-04-10 14:17:06 - An error occurred: (sqlalchemy.dialects.postgresql.asyncpg.ProgrammingError) <class 'asyncpg.exceptions.UndefinedColumnError'>: column "threadId" of relation "feedbacks" does not exist
[SQL:
INSERT INTO feedbacks ("forId", "value", "threadId", "id", "comment")
VALUES ($1, $2, $3, $4, $5)
ON CONFLICT (id) DO UPDATE
SET "forId" = $1, "value" = $2, "threadId" = $3, "comment" = $5;
]
[parameters: ('7ab4f0a0-206c-419e-8f29-871106743595', 0, 'b3c68554-5669-4f9b-89e8-97a558c9f25b', '0ae69c33-2eb3-468b-bf53-359683ac0dfd', 'task unfinished. task failed.')]
(Background on this error at: https://sqlalche.me/e/20/f405)
2025-04-10 14:18:24 - Translation file for zh-CN not found. Using default translation en-US.
`