#

phi-3-vision

Here are 9 public repositories matching this topic...

roboflow / maestro

streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL

transformers vqa objectdetection captioning fine-tuning multimodal vision-and-language phi-3-vision paligemma florence-2

Updated Nov 11, 2024
Python

mbzuai-oryx / LLaVA-pp

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

conversation lmms vision-language llm llava llama3 phi3 llava-llama3 llava-phi3 llama3-llava phi3-llava llama-3-vision phi3-vision llama-3-llava phi-3-llava llama3-vision phi-3-vision

Updated Jul 10, 2024
Python

retkowsky / Azure-OpenAI-demos

Azure OpenAI (demos, documentation, accelerators).

azure embeddings openai azure-cognitive-services dall-e gpt-4 azure-openai llm chatgpt langchain-python phi-3 phi-3-vision gpt-4o

Updated Oct 9, 2024
Jupyter Notebook

JosefAlbers / Phi-3-Vision-MLX

Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon

Updated Sep 7, 2024
Jupyter Notebook

bhimrazy / chat-with-phi-3-vision

Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which include - synthetic data and filtered publicly available websites - with a focus on very high-quality, reasoning dense data both on text and vision.

chat-application huggingface streamlit phi-3-vision litserve

Updated Sep 19, 2024
Jupyter Notebook

antonio-f / Phi-3-Vision

Phi-3-Vision model test - running locally

machine-learning computer-vision jupyter-notebook artificial-intelligence image-to-text multimodal-learning hands-on hugging-face multimodal-models llms running-locally tiny-models small-models phi-3 phi-3-vision

Updated May 29, 2024
Jupyter Notebook

divakarkumarp / Phi-3-Vision-MS-Multimodal

Phi-3-Vision-128K-Instruct Demo

python phi-3-vision

Updated Jun 8, 2024
Jupyter Notebook

shrimantasatpati / Microsoft-Phi-3-Vision

Microsoft Phi-3 Vision-the first Multimodal model By Microsoft- Demo With Huggingface

opensource vision-language-model phi-3-vision phi-3-mini microsoft-phi3

Updated May 26, 2024
Jupyter Notebook

leodeveloper / phi3-vision-multimodel

Microsoft Phi-3 Vision-the first Multimodal model By Microsoft- Demo With Huggingface

machine-learning nlp-machine-learning nuralnetwork huggingface huggingface-transformers large-language-models phi-3-vision

Updated May 24, 2024

Improve this page

Add a description, image, and links to the phi-3-vision topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the phi-3-vision topic, visit your repo's landing page and select "manage topics."