multimodal
A list of posts tagged multimodal
Blogs
Notes
Responses
- Ultravox - An open, fast, and extensible multimodal LLM
- MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
- V-JEPA: The next step toward Yann LeCun’s vision of advanced machine intelligence (AMI)
- OpenAI Sora - Creating video from text
- Ferret: Refer and Ground Anything Anywhere at Any Granularity
- PointLLM: Empowering Large Language Models to Understand Point Clouds