YouZum

News

News

Unique Hard Attention: A Tale of Two Sides

arXiv:2503.14615v2 Announce Type: replace-cross Abstract: Understanding the expressive power of transformers has recently attracted attention...

Understanding OpenAI Codex CLI Commands

We have seen a new era of agentic IDEs like Windsurf and Cursor AI...

Understanding In-context Learning of Addition via Activation Subspaces

arXiv:2505.05145v2 Announce Type: replace-cross Abstract: To perform in-context learning, language models must extract signals from...

TUMS: Enhancing Tool-use Abilities of LLMs with Multi-structure Handlers

arXiv:2505.08402v1 Announce Type: new Abstract: Recently, large language models(LLMs) have played an increasingly important role...

Time Magazine appears to accidentally publish embargoed story confirming new Anthropic model

Someone also appears to have published a full scrape of the Time article online on...

Three takeaways about AI’s energy use and climate impacts

This week, we published Power Hungry, a package all about AI and energy. At the...

This data set helps researchers spot harmful stereotypes in LLMs

AI models are riddled with culturally specific biases. A new data set, called SHADES, is...

This AI Paper Investigates Test-Time Scaling of English-Centric RLMs for Enhanced Multilingual Reasoning and Domain Generalization

Reasoning language models, or RLMs, are increasingly used to simulate step-by-step problem-solving by generating long...

This AI Paper Introduces WINGS: A Dual-Learner Architecture to Prevent Text-Only Forgetting in Multimodal Large Language Models

Multimodal LLMs: Expanding Capabilities Across Text and Vision Expanding large language models (LLMs) to handle...

This AI Paper Introduces WEB-SHEPHERD: A Process Reward Model for Web Agents with 40K Dataset and 10× Cost Efficiency

Web navigation focuses on teaching machines how to interact with websites to perform tasks such...

This AI Paper Introduces MathCoder-VL and FigCodifier: Advancing Multimodal Mathematical Reasoning with Vision-to-Code Alignment

Multimodal mathematical reasoning enables machines to solve problems involving textual information and visual components like...
en_US