Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory costs and time-to-first-token by up to 8x for multi-turn AI applications.
OpenAI Group PBC and Mistral AI SAS today introduced new artificial intelligence models optimized for cost-sensitive use cases. OpenAI is rolling out two algorithms called GPT-5.4 mini and GPT 5.4 ...
Artificial intelligence (AI) is rapidly transforming healthcare. AI systems can now detect diabetic eye disease from retinal photos and analyze CT images for signs of early-stage lung cancers and ...
A Stanford engineer has demonstrated that frontier language models can run directly on everyday edge devices using convex ...
Memory-based Small Language Models deployed across virtualized, highly distributed telecommunications networks achieve sub-500ms response times and up to 40x lower operating costs compared to LLMs on ...
Google Research has proposed a training method that teaches large language models to approximate Bayesian reasoning by learning from the predictions of an optimal Bayesian system. The approach focuses ...
GPT-4o achieved ICC/CCC of 0.815/0.866 versus in-person SALT scoring and 0.833/0.817 versus image-based scoring, while expert ...
As large language models (LLMs) gain momentum worldwide, there’s a growing need for reliable ways to measure their performance. Benchmarks that evaluate LLM outputs allow developers to track ...
At NVIDIA GTC 2026, DeepRoute.ai presented a comprehensive introduction to its 40-billion-parameter Vision-Language-Action (VLA) Foundation Model architecture, representing a fundamental breakthrough ...
The world's first Tibetan large language model and its application, DeepZang, has been officially unveiled in Lhasa, ...
HONG KONG and SHANGHAI, March 15, 2026 /PRNewswire/ -- Ping An Insurance (Group) Company of China, Ltd. ("Ping An" or "the Group"; HKEX: 2318/82318; SSE: 601318) announced that PingAnGPT-Qwen3-32B, ...
AI systems that understand and generate text, known as language models, are the hot new thing in the enterprise. A recent survey found that 60% of tech leaders said that their budgets for AI language ...