Multimodal Model - Search News

19h

Talking to the Moon: World’s First Multimodal Foundation Model for Lunar Exploration and Resources

The same AI methods that power ChatGPT can now allow you to talk to the Moon Its good to be skeptical when applying ...

12d

New Apple model combines vision understanding and image generation with impressive results

Manzano combines visual understanding and text-to-image generation, while significantly reducing performance or quality trade-offs.

The Cardiology Advisor

Multimodal Sleep Foundation Model Can Predict Risk for 130 Conditions

A multimodal sleep foundation model based on polysomnography data can predict the risk for multiple conditions.

Deep learning model can predict cardiopulmonary disease in retinal images of premature infants

A deep learning model using retinal images obtained during retinopathy of prematurity (ROP) screening may be used to predict diagnosis of bronchopulmonary dysplasia (BPD) and pulmonary hypertension ...

Business Wire

Ai2 Releases Molmo 2: State-of-the-Art Open Multimodal Family for Video and Multi-Image Understanding

SEATTLE--(BUSINESS WIRE)--Ai2 (The Allen Institute for AI) today announced Molmo 2, a state-of-the-art open multimodal model suite capable of precise spatial and temporal understanding of video, image ...

Geeky Gadgets

AnyGPT any-to-any open source multimodal large language model (LLM)

AnyGPT is an innovative multimodal large language model (LLM) is capable of understanding and generating content across various data types, including speech, text, images, and music. This model is ...

Nasdaq

Remark AI Launches its Large Multimodal Model (LMM) AI-Powered Aviation Safety Platform (ASP)

Current safety inspection failures and supply chain disruptions experienced by Boeing and its airline customers create a natural demand for AI to improve safety performance by maintenance crews. Top ...

SiliconANGLE

Microsoft releases new Phi models optimized for multimodal processing, efficiency

Microsoft Corp. today expanded its Phi line of open-source language models with two new algorithms optimized for multimodal processing and hardware efficiency. The first addition is the text-only ...

12d

Zhipu AI open-sources advanced multimodal model trained on Huawei Ascend chips, marking solid step toward independent tech development

Chinese AI startup Zhipu AI announced on Wednesday that it has partnered with Huawei to open-source GLM-Image, a ...

China Automotive Multimodal Interaction Development Research Report 2025 Featuring Multimodal Interaction Cockpit Solutions of 14 OEMs, and Multimodal Cockpit Solutions of 8 ...

The automotive multimodal interaction market offers opportunities in evolving intelligent cockpits from L2 to L4, enhancing AI agents for personalized, proactive driver assistance. Integration of ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results