Vision Language models: towards multi-modal deep learning

A review of state of the art vision-language models such as CLIP, DALLE, ALIGN and SimVL

Artificial Technology Jun 8, 2024 0 9 Add to Reading List

Vision Language models: towards multi-modal deep learning

A review of state of the art vision-language models such as CLIP, DALLE, ALIGN and SimVL

What's Your Reaction?

0

Like

0

Dislike

0

Love

0

Funny

0

Angry

0

Sad

0

Wow

Muhammad Hadi

Comments
Facebook Comments

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies Find out more here