Analyzing Images #794

ausangshukla · 2024-09-27T06:48:14Z

Is your feature request related to a problem? Please describe.
I have a bunch of images such as passports, licenses, tax docs etc. I need to extract and validate the data that they have by asking the LLM questions such as is the Passport expired? Is the tax doc of the year 2024. These questions will be adhoc and input by the users, so cant use off the shelf OCR for it.

Describe the solution you'd like

Upload the image (ex Tax documents)
Ask the question is it valid for 2024?
What it the total tax paid?

Describe alternatives you've considered
I know this can be done from the UI of chat gpt-4, but I dont have any other options at the moment

Additional context
The questions are adhoc, but generally centered around validating and extracting facts from the image. And the documents are all images. It may already be doable with the assistants api, but an working example is required, as Im not able to make it work.

andreibondarev · 2024-09-30T19:53:18Z

@ausangshukla Yep, you'll be able to do that after this PR is merged.

andreibondarev · 2024-10-03T16:13:50Z

@ausangshukla Right now the Langchain::Assistant, when using OpenAI or MistralAI, supports sending image_url. Take a look at this example: https://gist.github.com/andreibondarev/b6f444194d0ee7ab7302a4d83184e53e. I'm imagining if you're uploading the same types of documents, you could define your own tool, like a PassportDataExtractor that would extract certain values, like { full_name:, expiration_date:, issue_date: }. What do you think?

andreibondarev · 2024-10-16T22:15:06Z

Closing this issue as it's duplicate with #416.

ausangshukla added the enhancement New feature or request label Sep 27, 2024

andreibondarev linked a pull request Oct 1, 2024 that will close this issue

Langchain::Assistant when using MistralAI accepts a message with image_url #803

Merged

andreibondarev closed this as completed Oct 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Analyzing Images #794

Analyzing Images #794

ausangshukla commented Sep 27, 2024 •

edited

Loading

andreibondarev commented Sep 30, 2024

andreibondarev commented Oct 3, 2024

andreibondarev commented Oct 16, 2024

Analyzing Images #794

Analyzing Images #794

Comments

ausangshukla commented Sep 27, 2024 • edited Loading

andreibondarev commented Sep 30, 2024

andreibondarev commented Oct 3, 2024

andreibondarev commented Oct 16, 2024

ausangshukla commented Sep 27, 2024 •

edited

Loading