jan icon indicating copy to clipboard operation
jan copied to clipboard

bug: Document analysis failed after update to 0.4.7

Open chenshaoju opened this issue 11 months ago • 19 comments

Describe the bug Sorry for my English. After upgrading to version 0.4.7, the document parsing function may not work.

Steps to reproduce Steps to reproduce the behavior:

  1. Enable Experimental mode in Advanced Settings.
  2. Start a new Thread.
  3. Upload a PDF document and add the message Please summarize the document's content. .
  4. Assistant replay be like I don't know the context or the document to provide a summary. Please provide more information.

Expected behavior Get a summary of the document's content.

Screenshots Snipaste_2024-02-27_20-09-43

Environment details

  • Operating System: Microsoft Windows 10 22H2 19045.4046
  • Jan Version: 0.4.7
  • Processor: AMD Ryzen 5 5600X
  • RAM: 64GB
  • Any additional relevant hardware specifics: AMD Radeon RX 5500 XT / NVIDIA CMP 50HX

Logs app.log

Additional context If you need any more information, please feel free to ask. 👋

chenshaoju avatar Feb 27 '24 12:02 chenshaoju

You can retry with another model. However, in my test case, I used OpenChat with a larger file (2.43MB) and it still works.

OpenChat on Tinyllama paper: image

Openhermes on Tinyllama paper: image

hahuyhoang411 avatar Feb 28 '24 04:02 hahuyhoang411

Thanks for the reply.

I try it again: image

I also changed the language of the question (in simplified Chinese) and the model: image

After rollback to 0.4.6, everything looks fine now: image

Simplified Chinese with the same model before: image

If you need any more information, please feel free to ask.👋

chenshaoju avatar Feb 28 '24 05:02 chenshaoju

hi @chenshaoju, can you give us the settings.json in jan\settings? thank you

Van-QA avatar Feb 28 '24 07:02 Van-QA

@Van-QA Here you are, thanks for the reply.

Version 0.4.6: settings_0.4.6.json

Version 0.4.7: settings_0.4.7.json

Here is screenshot of all my settings in UI: | | | | | | | |

chenshaoju avatar Feb 28 '24 07:02 chenshaoju

@chenshaoju thank you, can you give us the settings in the Tools section of your thread? image

Van-QA avatar Feb 28 '24 07:02 Van-QA

@Van-QA here you are: | | | | | |

chenshaoju avatar Feb 28 '24 08:02 chenshaoju

@chenshaoju, nothing seems wrong so far, can you give us the entire conversation? jan\threads\ Thank you for your time 🙏

Van-QA avatar Feb 28 '24 09:02 Van-QA

@Van-QA I'm very happy to help solve this issue.🤗

here is the conversation: jan_1709104282.zip

chenshaoju avatar Feb 28 '24 10:02 chenshaoju

I had a similar problem, I wanted to chat with books, I sent a 13MB Chinese book, my graphics card is an RTX3060 12G The graphics card is highly utilized, but it keeps getting stuck in the progress bar living without an answer, and at this point clicking on stop the program won't stop it. I am using full default settings Openchat-3.5 7B Q4. Jan v0.4.7 win11

image image

msjsc001 avatar Feb 28 '24 15:02 msjsc001

@chenshaoju, can you also try our latest nightly build? and see if the issue resolved? https://github.com/janhq/jan?tab=readme-ov-file#download

Van-QA avatar Feb 29 '24 04:02 Van-QA

@Van-QA Thanks for the reply, But unfortunately, it's still not working.

Here are screenshots, threads, and logs: image

jan_1709185945.zip

app.zip

chenshaoju avatar Feb 29 '24 06:02 chenshaoju

The refresh button may lead to the same issue.

AI can recognize documents by:

- Sending the document along with the question directly.
or
- Clicking the edit button, and then clicking submit.

Image: 1


AI cannot recognize documents by:

- Clicking the refresh button to make the AI regenerate.

Image: 2

VaporHail avatar Mar 01 '24 06:03 VaporHail

@chenshaoju Could you please share the compressed version of the broken thread so we can reproduce, ensuring there is no sensitive personal information?

louis-jan avatar Mar 08 '24 03:03 louis-jan

@louis-jan Sure thing, here is the thread with the issue, with 0.4.7-304 version of Jan: jan_1709880132.zip

Here are the logs: app.zip

The PDF files in the thread are from: https://www.linkedin.com/posts/activity-7167774533339299841-M5xc , it's a public paper, with no sensitive information. 👏

chenshaoju avatar Mar 08 '24 07:03 chenshaoju

@chenshaoju I found the root cause of the issue: you deleted a model marked as default from the previous thread. As a result, new threads have a model of "*" in threads.json, thus, no files were ingested afterward. In the meantime, you can work around this by deleting older threads. Create a new thread and select a model from the drop-down list, then upload files. We will fix this issue as soon as possible in upcoming releases. cc @Van-QA

louis-jan avatar Mar 22 '24 05:03 louis-jan

@chenshaoju we resolved the issue as of Jan v0.4.9-343, would you be able to retry and see if the issue persist? 🙏

Van-QA avatar Mar 26 '24 08:03 Van-QA

@Van-QA Thanks for the reply, version 0.4.9-343 doesn't seem to work properly, It's stuck on "Generation Response..." and has no GPU usage.

image

here are the logs folder and threads folder: logs_threads.zip

chenshaoju avatar Mar 26 '24 14:03 chenshaoju

@Van-QA Thanks for the reply, version 0.4.9-343 doesn't seem to work properly, It's stuck on "Generation Response..." and has no GPU usage.

image

here are the logs folder and threads folder: logs_threads.zip

hi @chenshaoju, another issue came up, I will notify you again once it's resolved, thank you for your time :pray:

Van-QA avatar Mar 26 '24 14:03 Van-QA

hi @chenshaoju, the build Jan v0.4.9-345 was able to function properly with document analysis, can you try?

image

Van-QA avatar Mar 27 '24 08:03 Van-QA

Thanks, The version is 0.4.9-348 now, but it's working perfectly.

image

Thanks you! 🎉

chenshaoju avatar Mar 28 '24 05:03 chenshaoju