← Back to Main Site
YennJ12 Engineering Blog

Engineering insights, architecture deep dives, and technical solutions

Home Engineering Architecture Data All Posts About

LLM

Articles in llm

Mar 13, 2026 35 min

開源 LLM Post-Training 全攻略:從 SFT 到 RLHF,手把手帶你訓練 Qwen

全面介紹開源 LLM 的 Post-Training 方法,包含 SFT、RLHF、DPO、ORPO、持續預訓練等技術,以 Qwen 為範例,深入分析各方法的優缺點、所需資源與適用場景,幫助你選擇最合適的訓練策略。

LLM post-training fine-tuning

About

  • About me
  • Blog home
  • All posts
  • Authors
  • GitHub
  • Contact

Categories

  • Engineering
  • AI & ML
  • DevOps
  • Cloud & AWS
  • Data Engineering
  • Tools & Productivity

Resources

  • All tags
  • Archives
  • Series
  • Documentation

Community

  • GitHub profile
  • Blog repository
  • Report an issue
  • RSS feed
Get it on Google Play Download on the App Store
English
Taipei

© 2026 YennJ12 Engineering Team. All rights reserved.

Built with Hugo