INFUSER: Influence-Guided Self-Evolution Improves Reasoning
Self-evolution offers a scalable path to stronger reasoning: a pretrained language model improves itself with
- 用途
- 技術検証・論文読解補助
- 難易度
- Hard
- コスト
- High
「qa」の検索結果
15 件Self-evolution offers a scalable path to stronger reasoning: a pretrained language model improves itself with
Chain-of-thought (CoT) reasoning has proven effective for enhancing problem-solving in large language models.
Multimodal Large Language Models (MLLMs) have demonstrated remarkable success in visual understanding, yet the
Despite advances in 3D scene understanding, existing 3D Large Multimodal Models operate in offline settings, r
Agent systems increasingly use textual skills to encode reusable task procedures, but injecting these skills i
Retrieval-augmented QA pipelines often route retrieved passages through an LLM rewriter before a smaller reade
We study the personal camera roll visual question answering setting. In this setting, a conversational AI assi
System prompt optimization improves agent behavior without modifying the underlying model, yielding human-read
Processing video in vision-language models is expensive: each frame occupies hundreds of tokens, and inference
Selection is a core operation in interactive image editing. To be practical, a user should be able to specify
Memory is an indispensable capability for long-horizon LLM agents, enabling them to preserve and utilize infor
Agentic search systems iteratively interact with retrieval models to answer complex queries. Despite substanti
AI glasses present a compelling platform for AI agents to serve as personalized memory assistants. To be genui
Multimodal Large Language Models (MLLMs) have demonstrated significant achievements in general visual question
Off-policy reinforcement learning of pretrained flow policies remains challenging due to the instability of op