minus-squareKingsmanVince@kbin.socialOPtoMachine Learning@kbin.social•PaLI-3 Vision Language Models: Smaller, Faster, Strongerlinkfedilinkarrow-up1·edit-211 months ago SigLIP PaLI PaLI-X linkfedilink
KingsmanVince@kbin.social to Machine Learning@kbin.social · 11 months agoPaLI-3 Vision Language Models: Smaller, Faster, Strongerplus-squarearxiv.orgexternal-linkmessage-square1fedilinkarrow-up10arrow-down10
arrow-up10arrow-down1external-linkPaLI-3 Vision Language Models: Smaller, Faster, Strongerplus-squarearxiv.orgKingsmanVince@kbin.social to Machine Learning@kbin.social · 11 months agomessage-square1fedilink
KingsmanVince@kbin.social to Machine Learning@kbin.social · 1 year agoVision-Language Models for Vision Tasks: A Surveyplus-squarearxiv.orgexternal-linkmessage-square0fedilinkarrow-up14arrow-down10
arrow-up14arrow-down1external-linkVision-Language Models for Vision Tasks: A Surveyplus-squarearxiv.orgKingsmanVince@kbin.social to Machine Learning@kbin.social · 1 year agomessage-square0fedilink
KingsmanVince@kbin.social to Machine Learning@kbin.social · 1 year agoVisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasksplus-squarearxiv.orgexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkVisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasksplus-squarearxiv.orgKingsmanVince@kbin.social to Machine Learning@kbin.social · 1 year agomessage-square0fedilink
minus-squareKingsmanVince@kbin.socialtoMachine Learning@kbin.social•Machine Learning Beginner Info/Resourceslinkfedilinkarrow-up2·edit-21 year agoI also want to share some resources. For Pytorch, https://pytorch.org/tutorials/ their basic tutorials are fundamental but some more advanced tutorials might be outdated. https://www.learnpytorch.io/ the author guides mostly in computer vision but he gives the overview from research to production. For TPU, https://github.com/ayaka14732/tpu-starter full guideline using TPUs with Jax linkfedilink