GitHub - lyuchenyang/Macaw-LLM: Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration

GitHub - lyuchenyang/Macaw-LLM: Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration

lyuchenyanggithub.com
Thumbnail of GitHub - lyuchenyang/Macaw-LLM: Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration

[1hr Talk] Intro to Large Language Models

youtube.com

uni-medical โ€ข GitHub - uni-medical/SAM-Med3D: SAM-Med3D: An Efficient 3D Model for Promptable Volumetric Medical Image Segmentation

alibaba โ€ข GitHub - alibaba/data-juicer: A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! ๐ŸŽ ๐Ÿ‹ ๐ŸŒฝ โžก๏ธ โžก๏ธ๐Ÿธ ๐Ÿน ๐Ÿทไธบๅคง่ฏญ่จ€ๆจกๅž‹ๆไพ›ๆ›ด้ซ˜่ดจ้‡ใ€ๆ›ดไธฐๅฏŒใ€ๆ›ดๆ˜“โ€ๆถˆๅŒ–โ€œ็š„ๆ•ฐๆฎ๏ผ

The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)

An analysis of GPT-4V, a large multimodal model with visual understanding, discussing its capabilities, input modes, working modes, prompting techniques, and potential applications in various domains.

browse.arxiv.org