MaMMUT: A simple vision-encoder text-decoder architecture for multimodal tasks
3 by mfiguiere | 1 comments on Hacker News.
Thursday, May 4, 2023
Home »
Hacker News
» New top story on Hacker News: MaMMUT: A simple vision-encoder text-decoder architecture for multimodal tasks
0 comments:
Post a Comment