Image to image agent #1628

anuragts · 2024-12-24T04:58:43Z

Description

Image to Image agent that uses Fal tools.

Type of change

Please check the options that are relevant:

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Model update
Infrastructure change

Checklist

My code follows Phidata's style guidelines and best practices
I have performed a self-review of my code
I have added docstrings and comments for complex logic
My changes generate no new warnings or errors
I have added cookbook examples for my new addition (if needed)
I have updated requirements.txt/pyproject.toml (if needed)
I have verified my changes in a clean environment

Additional Notes

Include any deployment notes, performance implications, or other relevant information:

manthanguptaa · 2024-12-30T10:46:45Z

phi/tools/fal_tools.py

    ):
        super().__init__(name="fal")

        self.api_key = api_key or getenv("FAL_KEY")
        if not self.api_key:
            logger.error("FAL_KEY not set. Please set the FAL_KEY environment variable.")
        self.model = model
+        self.image_url = image_url


Why image_url isn't a param in image_to_image function?

Yeah I don't think it makes sense in the constructor. Rather give it to the main agent in the prompt.

manthanguptaa · 2024-12-30T10:51:14Z

phi/tools/fal_tools.py

+            media_id = str(uuid4())
+            agent.add_image(
+                Image(
+                    id=media_id,
+                    url=url,
+                )
+            )


why do this?

This is how we set the images on the agent

dirkbrnd · 2025-01-02T10:58:12Z

phi/tools/fal_tools.py

    ):
        super().__init__(name="fal")

        self.api_key = api_key or getenv("FAL_KEY")
        if not self.api_key:
            logger.error("FAL_KEY not set. Please set the FAL_KEY environment variable.")
        self.model = model
+        self.image_url = image_url


Yeah I don't think it makes sense in the constructor. Rather give it to the main agent in the prompt.

dirkbrnd · 2025-01-02T10:59:05Z

cookbook/agents/47_image_to_image.py

+    ],
+)
+
+agent.print_response("a cat dressed as a wizard with a background of a mystic forest", stream=True)


I think you should provide the URL in this prompt. Like a cat dressed as a wizard with a background of a mystic forest. Make it look like "https://fal.media/files/koala/Chls9L2ZnvuipUTEwlnJC.png"

dirkbrnd · 2025-01-02T11:02:15Z

phi/tools/fal_tools.py

+            media_id = str(uuid4())
+            agent.add_image(
+                Image(
+                    id=media_id,
+                    url=url,
+                )
+            )


This is how we set the images on the agent

dirkbrnd · 2025-01-02T11:02:59Z

phi/tools/fal_tools.py

+
+    def image_to_image(self, agent: Agent, prompt: str, image_url: Optional[str] = None) -> str:
+        """
+        Use this function to generate an image from a given image using the Fal AI API.


What do you mean "From a given image". Maybe give a more detailed explanation and a link to their docs please.

anuragts added 2 commits December 24, 2024 10:25

feat: image to image agent

b00921b

fix: remove print

928ecf1

anuragts mentioned this pull request Dec 24, 2024

Image to image generation phidatahq/phidata-docs#156

Open

Merge branch 'main' into feat/image_to_image_agent

733ebb8

manthanguptaa reviewed Dec 30, 2024

View reviewed changes

fix: let model also pass image

c443ffc

dirkbrnd requested changes Jan 2, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Image to image agent #1628

Image to image agent #1628

anuragts commented Dec 24, 2024

manthanguptaa Dec 30, 2024

dirkbrnd Jan 2, 2025

manthanguptaa Dec 30, 2024

dirkbrnd Jan 2, 2025

dirkbrnd Jan 2, 2025

dirkbrnd Jan 2, 2025

dirkbrnd Jan 2, 2025

dirkbrnd Jan 2, 2025

Image to image agent #1628

Are you sure you want to change the base?

Image to image agent #1628

Conversation

anuragts commented Dec 24, 2024

Description

Type of change

Checklist

Additional Notes

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment