Visual Multimodal Text

Z.ai debuts open source GLM-4.6V, a native tool-calling vision model for multimodal reasoning

Chinese AI startup Zhipu AI aka Z.ai has released its GLM-4.6V series, a new generation of open-source vision-language models ...

Nature

Multimodal Argumentation and Visual Rhetoric

Multimodal argumentation and visual rhetoric encompass an emergent field that explores how diverse communicative modes—including images, diagrams and other visual representations—contribute to the ...

InfoQ

Mistral AI Releases Pixtral Large: a Multimodal Model for Advanced Image and Text Analysis

A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...

GIGAZINE

DeepSeek releases 'DeepSeek-OCR,' a multimodal AI model that uses visual information to compress text input

DeepSeek has released a new multimodal AI model called ' DeepSeek-OCR.' 'OCR' stands for Optical Character Recognition, which is used for document scanning and other purposes. The model is said to be ...

abc27

HiDream.ai Awards Best Demo at ACM MM 2025: Redefining Conversational Visual Creation

BEIJING, Nov. 6, 2025 /PRNewswire/ -- Recently, HiDream.ai has been honored the Best Demo at the 33rd ACM International Conference on Multimedia (ACM MM 2025), thus becoming the first Chinese startup ...

SiliconANGLE

Microsoft releases new Phi models optimized for multimodal processing, efficiency

Microsoft Corp. today expanded its Phi line of open-source language models with two new algorithms optimized for multimodal processing and hardware efficiency. The first addition is the text-only ...

MarketWatch

HiDream.ai Awards Best Demo at ACM MM 2025: Redefining Conversational Visual Creation

The MarketWatch News Department was not involved in the creation of this content. BEIJING, Nov. 6, 2025 /PRNewswire/ -- Recently, HiDream.ai has been honored the Best Demo at the 33rd ACM ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results