끄적끄적

HOME
CATEGORIES
TAGS
ARCHIVES
ABOUT

Home Categories Paper

Category

Paper 12

Do Transformers Need Three Projections? — QKV 투영을 공유해 KV 캐시를 절반으로 Jun 11, 2026
Hierarchical Reasoning Model — 뇌에서 영감받은 계층적 잠재 추론 아키텍처 May 30, 2026
Hallucinations Undermine Trust; Metacognition is a Way Forward — Faithful Uncertainty로 환각을 재정의하다 May 30, 2026
Contexts are Never Long Enough: Structured Reasoning for Scalable Question Answering over Long Document Sets May 12, 2026
TurboQuant: 정보 이론적 최적에 근접하는 온라인 벡터 양자화 Apr 16, 2026
Prompt Repetition Improves Non-Reasoning LLMs Mar 8, 2026
Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free Mar 8, 2026
REFRAG: Rethinking RAG based Decoding Mar 8, 2026
Do As We Do, Not As You Think: The Conformity of Large Language Models Mar 8, 2026
Talk Isn't Always Cheap: Understanding Failure Modes in Multi-Agent Debate Mar 8, 2026
How we built our multi-agent research system Mar 8, 2026
Improving Factuality and Reasoning in Language Models through Multiagent Debate Mar 8, 2026

Recently Updated

Do Transformers Need Three Projections? — QKV 투영을 공유해 KV 캐시를 절반으로
How we built our multi-agent research system
Hierarchical Reasoning Model — 뇌에서 영감받은 계층적 잠재 추론 아키텍처
Hallucinations Undermine Trust; Metacognition is a Way Forward — Faithful Uncertainty로 환각을 재정의하다
MCP(Model Context Protocol) 개념과 구조

Trending Tags

llm stanford-cme295 transformer attention mcp multiagent kv-cache rag reasoning anthropic

© 2026 Jeong Mo, Hong. Some rights reserved.

Using the Chirpy theme for Jekyll.

Trending Tags

llm stanford-cme295 transformer attention mcp multiagent kv-cache rag reasoning anthropic

A new version of content is available.