Skip to content
#

gemini-pro-vision

Here are 176 public repositories matching this topic...

Unlock the potential of Google's Gemini AI models with this versatile toolkit. Offering seamless chat, text generation, and multimodal interactions, supporting various file types, including PDF's, images, videos, audio, text and more. Enjoy real-time responses, customizable parameters, and easy integration for diverse AI tasks.

  • Updated Dec 14, 2024
  • Python
GptMap

Gptmap will guide you through creating a comprehensive Android application using a modern toolkit, highlighting the integration of AI technologies and illustrating the real-world applications of these advanced technologies, providing valuable insights and best practices.

  • Updated May 5, 2024
  • Kotlin

An innovative AI conversation API leveraging Google's Gemini for multimodal understanding. Combines FastAPI, Langchain, and Redis for robust, scalable, and privacy-conscious text and image-based interactions

  • Updated May 20, 2024
  • Python

An AI-powered Virtual YouTuber (Vtuber) utilizing Google's Gemini language model to create engaging, personalized, and context-aware interactions. This project explores the potential of AI in human-computer interaction and virtual content creation.

  • Updated Dec 22, 2024
  • Python

Improve this page

Add a description, image, and links to the gemini-pro-vision topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gemini-pro-vision topic, visit your repo's landing page and select "manage topics."

Learn more