filmov
tv
DeepSeek AI Releases Janus: A 1.3B Multimodal Model with Image Generation Capabilities
Показать описание
Researchers from DeepSeek-AI, the University of Hong Kong, and Peking University propose Janus, a novel autoregressive framework that unifies multimodal understanding and generation by employing two distinct visual encoding pathways. Unlike prior models that use a single encoder, Janus introduces a specialized pathway for each task, both of which are processed through a unified transformer. This unique design alleviates conflicts inherent in prior models and provides enhanced flexibility, enabling different encoding methods that best suit each modality. The name “Janus” aptly represents this duality, much like the Roman god, with two faces representing transitions and coexistence.
Audio Created by NotebookLLM and reviewed by real human
#opensource #artificialintelligence #neuralnetworks #datascience #ai
Audio Created by NotebookLLM and reviewed by real human
#opensource #artificialintelligence #neuralnetworks #datascience #ai
Комментарии