Skip to content

divakarkumarp/Phi-3-Vision-MS-Multimodal

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Phi-3-Vision-Microsoft-Multimodal

Microsoft Phi-3 Vision-the first Multimodal model By Microsoft, a multimodal model that brings together language and vision capabilities. the multimodal version comes with 128K context length (in tokens) it can support. The model underwent a rigorous enhancement process, incorporating both supervised fine-tuning and direct preference optimization to ensure precise instruction adherence and robust safety measures.

Demo with Huggingface🤗

image

About

Phi-3-Vision-128K-Instruct Demo

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published