Wooseok Seo

Hi! I am a 1st year Graduate student at Yonsei University, and a visiting researcher at PILAB, Seoul National University. I am fortunate to be advised by Prof. Youngjae Yu.

My main research interest is in pushing the boundaries of foundational models through better evaluation or effective post-training . Recently, I am interested in:

  • Improving language models to utilize tools or external knowledge sources to solve challenging problems, with a focus on agentic frameworks such as deep research agents.
  • Defining good synthetic data to train models. I prefer scalable methods that can identify and generate useful data for model improvement.

More broadly, I'm interested in leveraging models to evaluate or improve other models, or utilizing them to augment human capabilities.

I am always open to research collaborations or grabbing a cup of coffee! Please reach me via email to have a chat 🤗

profile photo

News

2026.03 I will be joining as a Research Intern, working on deep research agents!
2025.10 I am attending COLM 2025! I will be at Montreal from 10/5 to 10/11, so please reach out to have a chat ☕
2025.09 I will be joining as a Research Intern, working on foundational language models!
2025.07 One paper on studying fact verifiers is accepted at COLM 2025!
2025.06 One paper on video diffusion distillation via preference learning is accepted at ICCV 2025!

Research Experience

Microsoft, Copilot Team
Research Intern
Mentor: Khanh Nguyen
Redmond, WA (Remote)
Mar 2026 ~ Jun 2026 (Exp.)
LG AI Research, EXAONE Lab
Research Intern
Mentor: Seokhee Hong
Seoul, South Korea
Sep 2025 ~ Feb 2026

Research

project image

K-EXAONE Technical Report

LG AI Research
Technical Report, 2026
We present K-EXAONE-236B-A23B, the best model in Korea. I contribute as a member of the post-training team, specifically working on synthetic data for reasoning.
arxiv / code

project image

Verifying the Verifiers: Unveiling Pitfalls and Potentials in Fact Verifiers

Wooseok Seo*, Seungju Han*, Jaehun Jung, Benjamin Newman, Seungwon Lim, Seungbeen Lee, Ximing Lu, Yejin Choi, Youngjae Yu
COLM, 2025
We systematically detect ambiguous & mislabeled examples in fact-verification benchmarks and introduce Clearfacts and Grayfacts, along with a SOTA 8B fact verifer and insights on building better fact verifiers.
arxiv / code / bibtex

project image

V.I.P. : Iterative Online Preference Distillation for Efficient Video Diffusion Models

Jisoo Kim, Wooseok Seo, Junwan Kim, Seungho Park, Sooyeon Park, Youngjae Yu
ICCV, 2025
We integrate DPO and SFT loss for distillation to build an efficient video diffusion model, with an automatic pair curation pipeline and outperform the teacher only with the synthetic data generated from the teacher itself.
arxiv / bibtex

project image

Layout-and-Retouch: A Dual-stage Framework for Improving Diversity in Personalized Image Generation

Kangyeol Kim*, Wooseok Seo*, Sehyun Nam, Bodam Kim, Suhyeon Jeong, Wonwoo Cho, Jaegul Choo, Youngjae Yu
Under Review, 2024
We use a two-stage approach for personalized T2I generation, to first draw the context with step-blended denoising and enhance the context with multi-source attention swapping.
arxiv / bibtex

Academic Services

Reviewer

  • COLM, 2025
  • ACL ARR, 2025