AI 29

Do Transformers Need Three Projections? — QKV 투영을 공유해 KV 캐시를 절반으로 Jun 11, 2026
Hierarchical Reasoning Model — 뇌에서 영감받은 계층적 잠재 추론 아키텍처 May 30, 2026
Hallucinations Undermine Trust; Metacognition is a Way Forward — Faithful Uncertainty로 환각을 재정의하다 May 30, 2026
MCP(Model Context Protocol) 개념과 구조 May 19, 2026
Contexts are Never Long Enough: Structured Reasoning for Scalable Question Answering over Long Document Sets May 12, 2026
TurboQuant: 정보 이론적 최적에 근접하는 온라인 벡터 양자화 Apr 16, 2026
270개 API를 가진 구조해석 SW를 LLM에 연결하기 - GEN NX MCP 서버 만들기 Apr 11, 2026
Stanford CME295: Lecture 9 - Recap & Current Trends Mar 9, 2026
Prompt Repetition Improves Non-Reasoning LLMs Mar 8, 2026
Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free Mar 8, 2026
REFRAG: Rethinking RAG based Decoding Mar 8, 2026
Do As We Do, Not As You Think: The Conformity of Large Language Models Mar 8, 2026
Talk Isn't Always Cheap: Understanding Failure Modes in Multi-Agent Debate Mar 8, 2026
How we built our multi-agent research system Mar 8, 2026
Improving Factuality and Reasoning in Language Models through Multiagent Debate Mar 8, 2026
LLM 양자화 (Quantization) 가이드 Mar 8, 2026
Stanford CME295: Lecture 8 - LLM Evaluation Mar 8, 2026
Stanford CME295: Lecture 7 - Agentic LLMs (RAG, Tool Calling, Agents) Mar 8, 2026
Visual Studio 2022 Copilot vs Copilot CLI 아키텍처 비교 Mar 8, 2026
Stanford CME295: Lecture 6 - LLM Reasoning Mar 8, 2026
Stanford CME295: Lecture 5 - LLM Tuning (Preference Tuning) Mar 8, 2026
Stanford CME295: Lecture 4 - LLM Training Mar 8, 2026
Stanford CME295: Lecture 3 - LLMs & 추론 최적화 Mar 8, 2026
Stanford CME295: Lecture 2 - Transformer-Based Models & Tricks Mar 8, 2026
Stanford CME295: Lecture 1 - Transformer 기초 Mar 8, 2026
Stanford CME295: Lecture 0 - Transformer 개요 Mar 8, 2026
AI Coding Tool 활용 팁 - 캐시 암호화 문제 해결 Mar 8, 2026
AI Coding Tool 동작 원리 이해하기 Oct 22, 2025
LLM 동작 원리 알아보기 Oct 21, 2025