-
-
Notifications
You must be signed in to change notification settings - Fork 5.3k
Description
How are you running AnythingLLM?
Docker (local)
What happened?
Diagnosis: AnythingLLM's Document Parser Failure; only those files that have a .txt extension are embedded in AnythingLLM. Files with a .docx or .doc or .xls or .xlsx are not embedded in AnythingLLM; While AnythingLLM claims to support all these formats (search result 1.1), the process of extracting text from a complex binary format (like a DOCX or XLSX file) is handled by an internal parser within AnythingLLM. This parser often fails due to:
Missing Dependencies: The AnythingLLM Windows desktop app might be missing a necessary internal library to correctly read the Office format.
File Complexity: Complex formatting, images, or protected data within your specific Office files are causing the parser to crash or return empty text.
The text extraction step (before embedding) is failing locally, meaning there's no text to send to OpenAI.
Are there known steps to reproduce?
No response 5EBB em>