Notes on the phi-4-reasoning Technical Paper
Phi-4 Reasoning Microsoft recently released the phi-4 reasoning model as well as its technical report. They explore using supervised finetuning as well as synthetic dataset curation to train phi-4 a 14B parameter model to compete with and often outperform significanly larger models such as DeepSeek-R1-Distill-Llama-70B. We present Phi-4-reasoning, a 14-billion