How to use Oscar / VinVL for image-text retreival inference? #192

yonatanbitton · 2022-04-12T09:21:02Z

Hello.

I need to execute simple inference for Image-text retreival, I want a score for image and a caption, as presented here for ViLT.

I've installed the package, and running run_retreival.py

I'm trying to following the instructions in the model zoo. By the way, the checkpoint for the VinVL doesn't work, but I can use Oscar model checkpoint as well.

What should be the --eval_model_dir? You write "# could be base/large models.", I pointed it the the model downloaded from the model zoo.

However, when I ran it, I receive:
FileNotFoundError: [Errno 2] No such file or directory: 'datasets/coco_ir/test_captions.pt'

I found the coco_ir download here in the Download page, but it's 20GB and my network doesn't succeed downloading such file. Is it mandatory to run the simple inference?

Is there a simple way to receive a matching score given an image and a possible caption?

Thank you

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use Oscar / VinVL for image-text retreival inference? #192

How to use Oscar / VinVL for image-text retreival inference? #192

yonatanbitton commented Apr 12, 2022

How to use Oscar / VinVL for image-text retreival inference? #192

How to use Oscar / VinVL for image-text retreival inference? #192

Comments

yonatanbitton commented Apr 12, 2022