๐Ÿ“š Weekly AI Paper Digest

๊ธฐ๊ฐ„: 2026-02-16 ~ 2026-02-21 ์„ ์ •: ์ด๋ฒˆ ์ฃผ ๊ฐ€์žฅ ์ฃผ๋ชฉ๋ฐ›์€ ๋…ผ๋ฌธ Top 5


๐Ÿ† ์ด๋ฒˆ ์ฃผ Top 5

์ˆœ์œ„๋…ผ๋ฌธโฌ†๏ธDeep Dive
๐Ÿฅ‡Less is Enough: Synthesizing Diverse Datโ€ฆ219DD-026
๐ŸฅˆSQuTR: A Robustness Benchmark for Spokenโ€ฆ140DD-027
๐Ÿฅ‰GLM-5: from Vibe Coding to Agentic Enginโ€ฆ71DD-028
4.Experiential Reinforcement Learning61DD-029
5.MedXIAOHE: A Comprehensive Recipe for Buโ€ฆ58DD-030

๐Ÿ” ์ด๋ฒˆ ์ฃผ ํŠธ๋ Œ๋“œ

ํ•ต์‹ฌ ํ‚ค์›Œ๋“œ

  • ํ”ผ์ฒ˜ ๊ณต๊ฐ„ ๋ฐ์ดํ„ฐ ํ•ฉ์„ฑ (Feature Space Synthesis): ํ…์ŠคํŠธ์˜ ์–ธ์–ด์  ๋‹ค์–‘์„ฑ์ด ์•„๋‹Œ ๋ชจ๋ธ์˜ ํ”ผ์ฒ˜ ๊ณต๊ฐ„์—์„œ ์œ ์˜๋ฏธํ•œ ๋ฐ์ดํ„ฐ๋ฅผ ์ƒ์„ฑํ•˜์—ฌ ์‚ฌํ›„ ํ•™์Šต ํšจ์œจ์„ ๊ทน๋Œ€ํ™”ํ•˜๋Š” ์ ‘๊ทผ๋ฒ•
  • ์—์ด์ „ํŠธ ์—”์ง€๋‹ˆ์–ด๋ง (Agentic Engineering): ๋‹จ์ˆœํ•œ ์ฝ”๋“œ ์ƒ์„ฑ์ด๋‚˜ ์ถ”๋ก ์„ ๋„˜์–ด, ๋ชจ๋ธ์ด ์ž์œจ์ ์œผ๋กœ ์—”์ง€๋‹ˆ์–ด๋ง ์ž‘์—…์„ ์ˆ˜ํ–‰ํ•˜๋Š” ํŒจ๋Ÿฌ๋‹ค์ž„์œผ๋กœ์˜ ์ „ํ™˜
  • ์‹ค์„ธ๊ณ„ ๊ฐ•์ธ์„ฑ (Real-world Robustness): ์Œ์„ฑ ๋…ธ์ด์ฆˆ๋‚˜ ์˜๋ฃŒ ํ˜„์žฅ ๋“ฑ ์‹ค์ œ ํ™˜๊ฒฝ์—์„œ ๋ฐœ์ƒํ•˜๋Š” ๋ณต์žกํ•œ ๋ณ€๋™๊ณผ ํ”ผ๋“œ๋ฐฑ ์†์—์„œ๋„ ๊ฒฌ๊ณ ํ•˜๊ฒŒ ์„ฑ๋Šฅ์„ ์œ ์ง€ํ•˜๋Š” ๊ฒƒ
  • ๊ฒฝํ—˜์  ๊ฐ•ํ™”ํ•™์Šต (Experiential RL): ํฌ์†Œํ•˜๊ณ  ์ง€์—ฐ๋œ ํ”ผ๋“œ๋ฐฑ ํ™˜๊ฒฝ์—์„œ ์–ธ์–ด ๋ชจ๋ธ์ด ๊ด€์ฐฐ๋œ ์‹คํŒจ๋ฅผ ๋ฏธ๋ž˜์˜ ํ–‰๋™ ๋ณ€ํ™”๋กœ ํšจ๊ณผ์ ์œผ๋กœ ์ „ํ™˜์‹œํ‚ค๋Š” ํ•™์Šต ๋ฐฉ๋ฒ•

๊ณตํ†ต ์ฃผ์ œ

์ด๋ฒˆ ์ฃผ ๋…ผ๋ฌธ๋“ค์€ ๋Œ€๊ทœ๋ชจ ๋ฐ์ดํ„ฐ์˜ ์–‘์  ํ™•๋ณด๋ณด๋‹ค๋Š” ๋ฐ์ดํ„ฐ์˜ ์งˆ์  ํšจ์œจ์„ฑ๊ณผ ๋ชจ๋ธ์˜ ์‹ค์šฉ์  ๊ฐ•์ธ์„ฑ์„ ๊ณ ๋„ํ™”ํ•˜๋Š” ๋ฐ ์ดˆ์ ์„ ๋งž์ถ”๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค. ์—ฐ๊ตฌ์ž๋“ค์€ ๋ชจ๋ธ์˜ ๋‚ด๋ถ€ ํ”ผ์ฒ˜ ๊ณต๊ฐ„์„ ์ดํ•ดํ•˜์—ฌ ์ ์€ ๋ฐ์ดํ„ฐ๋กœ๋„ ํ•™์Šต ํšจ๊ณผ๋ฅผ ๋†’์ด๊ฑฐ๋‚˜, ๋…ธ์ด์ฆˆ์™€ ์ „๋ฌธ ๋„๋ฉ”์ธ ๋“ฑ ์‹ค์ œ ํ™˜๊ฒฝ์˜ ์–ด๋ ค์›€์„ ๊ทน๋ณตํ•˜๋Š” ๋ฐฉํ–ฅ์œผ๋กœ ์—ฐ๊ตฌ๋ฅผ ํ™•์žฅํ•˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค.

์ฃผ๋ชฉํ•  ์ 

GLM-5 ์—ฐ๊ตฌ์—์„œ ์ œ์‹œ๋œ โ€˜๋ฐ”์ด๋ธŒ ์ฝ”๋”ฉ(Vibe Coding)โ€˜์—์„œ โ€˜์—์ด์ „ํŠธ ์—”์ง€๋‹ˆ์–ด๋งโ€™์œผ๋กœ์˜ ํŒจ๋Ÿฌ๋‹ค์ž„ ์ „ํ™˜์€ AI๊ฐ€ ๋‹จ์ˆœํ•œ ๋„๊ตฌ๋ฅผ ๋„˜์–ด ์‚ฌ์šฉ์ž์˜ ์˜๋„๋ฅผ ์ž์œจ์ ์œผ๋กœ ํ•ด์„ํ•˜๊ณ  ์‹คํ–‰ํ•˜๋Š” ๋‹จ๊ณ„๋กœ ์ง„์ž…ํ•˜๊ณ  ์žˆ์Œ์„ ์‹œ์‚ฌํ•ฉ๋‹ˆ๋‹ค. ๋˜ํ•œ, ๊ธฐ์กด ํ…์ŠคํŠธ ๊ธฐ๋ฐ˜์˜ ๋‹ค์–‘์„ฑ ์ง€ํ‘œ์˜ ํ•œ๊ณ„๋ฅผ ์ง€์ ํ•˜๊ณ  ํ”ผ์ฒ˜ ๊ณต๊ฐ„์ƒ์˜ ๋ฐ์ดํ„ฐ ํ•ฉ์„ฑ์„ ํ†ตํ•ด โ€˜Less is Enoughโ€™๋ฅผ ์‹คํ˜„ํ•˜๋ ค๋Š” ์‹œ๋„์™€, ๊ฐ•ํ™”ํ•™์Šต์—์„œ์˜ ์‹คํŒจ๋ฅผ ๋” ์ž˜ ํ•™์Šต์‹œํ‚ค๋Š” ๊ฒฝํ—˜์  ์ ‘๊ทผ๋ฒ•์€ ๋ฐ์ดํ„ฐ ํšจ์œจ์„ฑ๊ณผ ํ•™์Šต ๋ฉ”์ปค๋‹ˆ์ฆ˜์˜ ํ˜์‹ ์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.

์‹ค๋ฌด ์‹œ์‚ฌ์ 

LLM์„ ์‚ฌํ›„ ํ•™์Šต(Post-training)ํ•  ๋•Œ๋Š” ๋‹จ์ˆœํžˆ ๋ฐ์ดํ„ฐ ์–‘์„ ๋Š˜๋ฆฌ๊ธฐ๋ณด๋‹ค, ํ”ผ์ฒ˜ ๊ณต๊ฐ„ ๋ถ„์„์„ ํ†ตํ•ด ํƒœ์Šคํฌ ๊ด€๋ จ ํŠน์ง•์„ ์ž˜ ํฌ์ฐฉํ•˜๋Š” ๊ณ ํ’ˆ์งˆ์˜ ๋ฐ์ดํ„ฐ๋ฅผ ์„ ๋ณ„ํ•˜๊ฑฐ๋‚˜ ํ•ฉ์„ฑํ•˜๋Š” ์ „๋žต์ด ๋น„์šฉ ์ ˆ๊ฐ๊ณผ ์„ฑ๋Šฅ ํ–ฅ์ƒ์— crucialํ•  ๊ฒƒ์ž…๋‹ˆ๋‹ค. ๋˜ํ•œ ์Œ์„ฑ ๊ฒ€์ƒ‰์ด๋‚˜ ์˜๋ฃŒ ๋ถ„์•ผ ๋“ฑ ํŠน์ • ๋„๋ฉ”์ธ ์„œ๋น„์Šค๋ฅผ ๊ฐœ๋ฐœํ•  ๋•Œ๋Š” ๊นจ๋—ํ•œ ํ•™์Šต ๋ฐ์ดํ„ฐ๋ฟ๋งŒ ์•„๋‹ˆ๋ผ SQuTR์ด๋‚˜ MedXIAOHE ์‚ฌ๋ก€์ฒ˜๋Ÿผ ๋…ธ์ด์ฆˆ์™€ ๋ณต์žกํ•œ ์ƒํ™ฉ์„ ๋ฐ˜์˜ํ•œ ๊ฐ•์ธ์„ฑ ํ…Œ์ŠคํŠธ๋ฅผ ๋ชจ๋ธ ๊ฒ€์ฆ ํ”„๋กœ์„ธ์Šค์— ๋ฐ˜๋“œ์‹œ ํฌํ•จํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค.


๐Ÿ“‘ ๋…ผ๋ฌธ๋ณ„ ์š”์•ฝ

๐Ÿฅ‡ 1. Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

arXiv: 2602.10388 | โฌ†๏ธ 219 โ†’ Deep Dive ๋ณด๊ธฐ ํƒœ๊ทธ: llm data-synthesis sparse-autoencoder feature-learning post-training interpretability alignment data-diversity

๊ธฐ์กด ํ…์ŠคํŠธ ๊ธฐ๋ฐ˜ ๋‹ค์–‘์„ฑ ์ง€ํ‘œ์˜ ํ•œ๊ณ„๋ฅผ ๋„˜์–ด, ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ(LLM)์˜ ๋‚ด๋ถ€ ํŠน์„ฑ ๊ณต๊ฐ„(Feature Space)์—์„œ ๋ˆ„๋ฝ๋œ ์ค‘์š”ํ•œ ํŠน์ง•์„ ์‹๋ณ„ํ•˜๊ณ  ์ด๋ฅผ ์ฑ„์šฐ๋Š” ๋ฐฉ์‹์œผ๋กœ ๋ฐ์ดํ„ฐ๋ฅผ ํ•ฉ์„ฑํ•˜์—ฌ ์ ์€ ์–‘์œผ๋กœ๋„ ํ›„์† ํ•™์Šต ํšจ์œจ์„ ๊ทน๋Œ€ํ™”ํ•˜๋Š” ์ƒˆ๋กœ์šด ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ–ˆ๊ธฐ ๋•Œ๋ฌธ์— ์ค‘์š”ํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ“– ์ƒ์„ธ ๋ถ„์„: โ†’ Deep Dive ๋ณด๊ธฐ์—์„œ ์‹ฌ์ธต ๋ถ„์„์„ ํ™•์ธํ•˜์„ธ์š”.


๐Ÿฅˆ 2. SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise

arXiv: 2602.12783 | โฌ†๏ธ 140 โ†’ Deep Dive ๋ณด๊ธฐ ํƒœ๊ทธ: spoken-query-retrieval robustness-benchmark acoustic-noise asr-error-propagation information-retrieval noise-robustness squtr multimodal-retrieval

์‹ค์ œ ํ™˜๊ฒฝ์˜ ์†Œ์Œ(Noise)์„ ๊ณ ๋ คํ•˜์—ฌ ์Œ์„ฑ ๊ฒ€์ƒ‰ ์‹œ์Šคํ…œ์˜ ๋‚ด๊ตฌ์„ฑ์„ ํ‰๊ฐ€ํ•  ์ˆ˜ ์žˆ๋Š” ๋Œ€๊ทœ๋ชจ ๋ฒค์น˜๋งˆํฌ(SQuTR)๋ฅผ ์ œ์•ˆํ•จ์œผ๋กœ์จ, ์Œ์„ฑ ์ธ์‹ ์˜ค๋ฅ˜๊ฐ€ ์ •๋ณด ๊ฒ€์ƒ‰ ์„ฑ๋Šฅ์— ๋ฏธ์น˜๋Š” ์•…์˜ํ–ฅ์„ ์ •๋Ÿ‰์ ์œผ๋กœ ๋ถ„์„ํ•˜๊ณ  ๊ธฐ์กด ์—ฐ๊ตฌ๋“ค์˜ ํ‰๊ฐ€ ๊ฒฉ์ฐจ๋ฅผ ํ•ด์†Œํ–ˆ๋‹ค๋Š” ์ ์—์„œ ์ค‘์š”ํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ“– ์ƒ์„ธ ๋ถ„์„: โ†’ Deep Dive ๋ณด๊ธฐ์—์„œ ์‹ฌ์ธต ๋ถ„์„์„ ํ™•์ธํ•˜์„ธ์š”.


๐Ÿฅ‰ 3. GLM-5: from Vibe Coding to Agentic Engineering

arXiv: 2602.15763 | โฌ†๏ธ 71 โ†’ Deep Dive ๋ณด๊ธฐ ํƒœ๊ทธ: glm-5 agentic-ai asynchronous-rl software-engineering llm-post-training model-alignment long-context vibe-coding

์ด ๋…ผ๋ฌธ์€ ์ธ๊ฐ„์ด ํ”„๋กฌํ”„ํŠธ๋ฅผ ์ž…๋ ฅํ•˜๋Š” โ€˜๋ฐ”์ด๋ธŒ ์ฝ”๋”ฉ(Vibe Coding)โ€™ ๋‹จ๊ณ„๋ฅผ ๋„˜์–ด, AI ์Šค์Šค๋กœ ๊ณ„ํšํ•˜๊ณ  ์ฝ”๋“œ๋ฅผ ์ž‘์„ฑํ•˜๋Š” โ€˜์—์ด์ „ํŠธ ์—”์ง€๋‹ˆ์–ด๋ง(Agentic Engineering)โ€™ ์‹œ๋Œ€๋ฅผ ์—ด์—ˆ์œผ๋ฉฐ, ๋น„๋™๊ธฐ ๊ฐ•ํ™” ํ•™์Šต(Asynchronous RL)์„ ํ†ตํ•ด ํšจ์œจ์„ฑ์„ ๊ทน๋Œ€ํ™”ํ•˜์—ฌ ์˜คํ”ˆ ์†Œ์Šค ๋ชจ๋ธ์ด ์ตœ์ƒ์œ„ ์ƒ์šฉ ๋ชจ๋ธ์„ ๋›ฐ์–ด๋„˜์„ ์ˆ˜ ์žˆ์Œ์„ ์ฆ๋ช…ํ–ˆ๊ธฐ์— ์ค‘์š”ํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ“– ์ƒ์„ธ ๋ถ„์„: โ†’ Deep Dive ๋ณด๊ธฐ์—์„œ ์‹ฌ์ธต ๋ถ„์„์„ ํ™•์ธํ•˜์„ธ์š”.


4. 4. Experiential Reinforcement Learning

arXiv: 2602.13949 | โฌ†๏ธ 61 โ†’ Deep Dive ๋ณด๊ธฐ ํƒœ๊ทธ: reinforcement-learning experiential-learning agentic-ai sparse-rewards reflection-loop reasoning llm-training

์ด ๋…ผ๋ฌธ์€ ์–ธ์–ด ๋ชจ๋ธ์ด ํฌ์†Œํ•˜๊ณ  ์ง€์—ฐ๋œ ํ”ผ๋“œ๋ฐฑ ํ™˜๊ฒฝ์—์„œ ๊ฒช๋Š” ํ•™์Šต์˜ ๋น„ํšจ์œจ์„ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด, ์ธ๊ฐ„์˜ ๊ฒฝํ—˜ ํ•™์Šต ๊ณผ์ •์ฒ˜๋Ÿผ ์„ฑ์ฐฐ(Reflection)๊ณผ ๊ต์ •(Correction) ๋‹จ๊ณ„๋ฅผ ๊ฐ•ํ™” ํ•™์Šต ๋ฃจํ”„์— ๋ช…์‹œ์ ์œผ๋กœ ํ†ตํ•ฉํ•˜์—ฌ ํ•™์Šต ํšจ์œจ๊ณผ ์„ฑ๋Šฅ์„ ํš๊ธฐ์ ์œผ๋กœ ๋†’์˜€๊ธฐ์— ๋งค์šฐ ์ค‘์š”ํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ“– ์ƒ์„ธ ๋ถ„์„: โ†’ Deep Dive ๋ณด๊ธฐ์—์„œ ์‹ฌ์ธต ๋ถ„์„์„ ํ™•์ธํ•˜์„ธ์š”.


5. 5. MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs

arXiv: 2602.12705 | โฌ†๏ธ 58 โ†’ Deep Dive ๋ณด๊ธฐ ํƒœ๊ทธ: medical-ai multimodal-learning continual-pretraining reasoning clinical-decision-support llm reinforcement-learning data-curation

์˜๋ฃŒ ๋ถ„์•ผ์˜ ์ด๊ธฐ์ข… ๋ฐ์ดํ„ฐ๋ฅผ ํ†ตํ•ฉํ•˜์—ฌ ์ง€์‹์˜ ๊ณต๋ฐฑ์„ ๋ฉ”์šฐ๊ณ  ๊ฐ•ํ™” ํ•™์Šต ๋ฐ ๋„๊ตฌ ํ™œ์šฉ ํ›ˆ๋ จ์„ ํ†ตํ•ด ์ „๋ฌธ๊ฐ€ ์ˆ˜์ค€์˜ ์ž„์ƒ ์ถ”๋ก  ๋Šฅ๋ ฅ์„ ๊ฐ–์ถ˜ ์ตœ๊ณ  ์„ฑ๋Šฅ์˜ ์˜๋ฃŒ์šฉ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๋ชจ๋ธ์„ ๊ตฌ์ถ•ํ•œ ์ ์—์„œ ์ค‘์š”ํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ“– ์ƒ์„ธ ๋ถ„์„: โ†’ Deep Dive ๋ณด๊ธฐ์—์„œ ์‹ฌ์ธต ๋ถ„์„์„ ํ™•์ธํ•˜์„ธ์š”.


๐Ÿ“… ์ƒ์„ฑ์ผ: 2026-02-22 | ๐Ÿค– GLM-4.7 Weekly Digest