YouZum

新闻

新闻

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

arXiv:2505.04588v1 Announce Type: new Abstract: Effective information searching is essential for enhancing the reasoning and...

ZeroSearch from Alibaba Uses Reinforcement Learning and Simulated Documents to Teach LLMs Retrieval Without Real-Time Search

Large language models are now central to various applications, from coding to academic tutoring and...

Your AI models are failing in production—Here’s how to fix model selection

The Allen Institute of AI updated its reward model evaluation RewardBench to better reflect real-life...

You can now fine-tune your enterprise’s own version of OpenAI’s o4-mini reasoning model with reinforcement learning

For organizations with clearly defined problems and verifiable answers, RFT offers a compelling way to...

Yandex Releases Yambda: The World’s Largest Event Dataset to Accelerate Recommender Systems

Yandex has recently made a significant contribution to the recommender systems community by releasing Yambda...

Yandex Releases Alchemist: A Compact Supervised Fine-Tuning Dataset for Enhancing Text-to-Image T2I Model Quality

Despite the substantial progress in text-to-image (T2I) generation brought about by models such as DALL-E...

Why Generalization in Flow Matching Models Comes from Approximation, Not Stochasticity

Introduction: Understanding Generalization in Deep Generative Models Deep generative models, including diffusion and flow matching...

Why enterprise RAG systems fail: Google study introduces ‘sufficient context’ solution

Google’s “sufficient context” helps refine RAG systems, reduce LLM hallucinations, and boost AI reliability for...

Why Educational Videos Are the Quiet MVP

(aka how teaching is the new flex) Continue reading on Medium »...

Why Educational Videos Are the Quiet MVP

(aka how teaching is the new flex) Continue reading on Medium »...

Why doctors should look for ways to prescribe hope

This week, I’ve been thinking about the powerful connection between mind and body. Some new...
zh_CN