llama parse summarize the information of a paragraph in the pdf file #559

NegTech · 2024-12-19T09:38:47Z

Describe the bug
I'm not sure if it is a bug or not but I'm encountering a problem in parsing PDF. l have some paragraphs which are like 4 or 5 lines each, but when llammaparse parse it the text in the paragraphs turns to 2 lines. The parsed text doesn't even include those paragraphs' full information and concepts.
how can I change it ? is it possible to change ?

Job ID
bbd6df67-b18d-45ec-a9b1-e71ac157f5f7

Client:

API
Notebook

Additional context
I'm using the accurate mode. and want to get markdown output.

BinaryBrain · 2024-12-19T13:18:35Z

Hi @NegTech,
While it seems we have a issue on one of the font, have you also tried to run the job without any parsing instruction? It sometimes lead to better results.

NegTech · 2024-12-19T13:41:26Z

hiii @BinaryBrain
I have , unfortunately, it didn't give me any better result -_-
what do you mean you have an issue with one of the fonts?

BinaryBrain · 2024-12-20T14:31:55Z

One of the font in the file is weirdly encoded (it can happen with PDF). We need to fix it.

NegTech added the bug Something isn't working label Dec 19, 2024

BinaryBrain self-assigned this Dec 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama parse summarize the information of a paragraph in the pdf file #559

llama parse summarize the information of a paragraph in the pdf file #559

NegTech commented Dec 19, 2024

BinaryBrain commented Dec 19, 2024

NegTech commented Dec 19, 2024

BinaryBrain commented Dec 20, 2024

llama parse summarize the information of a paragraph in the pdf file #559

llama parse summarize the information of a paragraph in the pdf file #559

Comments

NegTech commented Dec 19, 2024

BinaryBrain commented Dec 19, 2024

NegTech commented Dec 19, 2024

BinaryBrain commented Dec 20, 2024