Hacker News new | past | comments | ask | show | jobs | submit login
Janus-Pro: Autoregressive framework unifying multimodal understanding&generation (huggingface.co)
49 points by victormustar 3 months ago | hide | past | favorite | 4 comments



As for what it is: it is a multimodal LLM that can accept both text and images, and generate both text and images as output.


Note: this model was just released from DeepSeek. https://github.com/deepseek-ai/Janus



Anybody used this yet and can share example outputs?




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: