Why Phi-4 Reasoning Is Not Much Better Than GPT-4o and o1— Here is The Result
Early this morning, Microsoft Research open-sourced the most powerful small-parameter model currently available — phi-4 reasoning.
I was looking forward to a small generative AI (SLM) and was interested in the Phi-4, but it was still quite large, so I was hesitant.
Microsoft has released three small-scale language models, “Phi-4-reasoning,” “Phi-4-reasoning-plus,” and “Phi-4-mini-reasoning,” which have enhanced the inference capabilities of “Phi-4,” which was released in December 2024.
Inference models require complex tasks that require multiple steps of analysis and introspection, which could previously only be achieved with large-scale language models. By utilizing knowledge distillation, reinforcement learning, and high-quality data, Phi-4 is said to have inference capabilities comparable to large-scale language models and high responsiveness, despite its small size.
These Models are high-performance models specialized in mathematical reasoning and logical thinking. It is trained with high-quality STEM data and is available for free as open source, and represents a major leap forward, demonstrating that smaller, more efficient models can rival larger models in complex reasoning tasks.
In this step-by-step guide, we will cover what PHi-4 reasoning is, its features, and why PHi-4 reasoning is not so much better than GPT-4o and o1.
Before we start! 🦸🏻♀️
If you like this topic and you want to support me:
Like my article, which will really help me out.👏
Follow me on my YouTube channel
Subscribe to me to get the latest articles.
What is the Phi-4-reasoning model Series?
Phi-4-reasoning is a version of Microsoft that is optimized for reasoning capabilities based on its original Phi-4 model (14B parameters). The biggest feature of this model is that it can “think” like humans — when faced with complex problems, it will generate a detailed thinking chain, perform a multi-step reasoning process, and finally get the answer.
In short, this model does not rely on “guessing” but actually learns how to solve the problem step by step. Even more amazing is that Microsoft has also trained an enhanced version:
Phi-4-reasoning-plus, which further improves the model’s reasoning ability through reinforcement learning.
Key features of Phi-4:
The features of each model are as follows:
Phi-4-reasoning:
14 billion parameter inference-specific open weight model
Comparable performance to large models on complex inference tasks
Train with supervised fine-tuning using inference demo from OpenAI o3-mini
Phi-4-reasoning-plus:
Enhanced Phi-4-reasoning functionality
Further trained with reinforcement learning, it uses 1.5x more tokens during inference
Achieve greater accuracy
Phi-4-mini-reasoning:
Smaller inference model
Optimized for mathematical reasoning
Trained with over 1 million diverse math problems from junior high school to Ph.D. levels
Which model should I choose?
How to choose based on your use case
Document processing and long text support: Mini (supports 128K tokens)
Mathematics-focused and precision-oriented: Reasoning Plus
General-purpose inference for beginners: Base model
You can optimize your tasks by choosing the model that best suits your needs.
Phi-4 Reasnoing Vs GPT-4o Vs o1
Next, I would like to present some applied problems and verify whether the results differ from those of the LLMs GPT-4o and o1.
If the volume of a rectangle is 120 cubic centimetres and its height is 6 cm, and its width is 4 cm, find its length
Phi-4 (reasoning )
We have rectangle? But problem expects volume? Possibly it's a rectangular prism, correct? So: "volume" is 120 cubic cm, height is 6 cm, and width is 4 cm, then find length. For rectangular prism (or cuboid) volume formula:
Volume = length x width x height? Wait "volume of a rectangle" though confusing: actually rectangle is 2D shape. But problem uses rectangular prism.
If statement: "If the volume of a rectangle is 120 cubic centimetres, and its height is 6 cm, and its width is 4 cm", "rectangle" is replaced by "rectangular prism" or "box" except is misnamed. Actually "volume" is typically a 3D shape measurement like rectangular prism. So length = volume / (height * width)? That means length = 120 / (6 * 4) = 120/24 = 5 cm. So answer: length = 5 cm. Alternatively if shape is rectangular area, then volume doesn't apply, so more like rectangular prism. So answer provided: The product of height times width times length equals volume: 6 * 4 * length = 120, so length = 5. Apologies: I can produce answer.
I'll produce answer with explanation: Step wise: 6*4= 24, then 120/24= 5. So answer: length= 5 cm.
I now produce answer accordingly.
I'll finalize answer.To solve this problem, note that the volume (V) of a rectangular prism (or "box") is given by the formula:
V = length × width × height
We are given:
V = 120 cubic centimeters,
height = 6 cm,
width = 4 cm.
Step 1: Substitute the known values into the formula:
120 = length × 4 × 6
Step 2: Multiply the height and width:
4 × 6 = 24
So:
120 = length × 24
Step 3: Divide both sides by 24 to solve for length:
length = 120 ÷ 24 = 5
Thus, the length of the rectangular prism is 5 centimeters.
GPT-4o
To find the length of a rectangular prism (box), we use the formula for volume:
Volume=Length×Width×Height
You're given:
Volume = 120 cm³
Height = 6 cm
Width = 4 cm
Now solve for Length
120=Length×4×6
120=Length×24
Length= 24 /120 =5
Answer: Length = 5 cm
o1
To find the length L, use the volume formula for a rectangular prism:
Volume = 𝐿 × Width × Height
.
Volume=L×Width×Height.
You know:
Volume = 120 cm³
Width = 4cm
Height = 6cm
So,
120=L×4×6=L×24.
Divide both sides by 24:
𝐿 = 120/24 = 5 cm
Answer: 5 cm
The best overall answer is o1 because it is clear, correct, and professionally structured with no errors, making it well-suited for a math exam or reasoning question.
Phi-4 answer comes in second; it handles the ambiguity well and shows strong reasoning, but it would benefit from trimming and clearer structure. GPT-4o ranks last — it’s generally clear but contains a small math error, stating 24/120 = 5 instead of the correct 120/24 = 5.
Asked a complex question (Reasoning) to confuse the Model and see the logic reasoning model
“If it rains, should I take an umbrella? But what if I don’t like umbrellas?”
Phi-4
We are tasked: "If it rains, should I take an umbrella? But what if I don't like umbrellas?" conversation suggests we look like as user: "If it rains, should I take an umbrella? But what if I don't like umbrellas?" As Phi.
User asked question: If it rains, should I take an umbrella? But what if I don't like umbrellas?
We need to answer question.
We have some instructions: "You are Phi, a language model developed by Microsoft, trained to provide accurate, secure, and user-aligned responses. Follow principles."
The question is: "If it rains, should I take an umbrella?"
I need to identify multiple scenarios maybe alternatives. The answer: Usually taking an umbrella is important during rain. But then factor if one doesn't like umbrellas, then alternative approach maybe taking other alternative items. I need to craft answer that's balanced. Use careful language.
I will answer:
This question is a conditional on rain. Answer: if it is predicted that it rains, then you should be prepared, an umbrella is one way to mitigate getting wet. Should I. But if you don't like umbrellas, then you might consider alternatives like rain jacket or getting authorized ride.
Let me answer: Factors such as personal comfort and the forecast. If it rains, it may be useful to have an umbrella. However, if one doesn't like umbrellas, alternatives to consider are a rain poncho, a waterproof jacket or bag or any option.
I'll also caution that if you drive you might need to think about other factors. Additionally, if you're comfortable with not being fully prepared for rain, then what should I suggest.
I can further say: "Ultimately, it's your decision. You must weigh the pros and cons."
Also I'll mention disclaimer for legal or others: The instruction mentions that for sensitive topics include disclaimer from beginning and end statement. But careful instructions: "You must give a disclaimer at the beginning and the end when replying topics above at every message?" Actually let me reread instructions: The instruction mentions: "When replying topics above at every message" for sensitive topics: Actually the instructions explicitly say "disclaimer as necessary for medical, legal, financial matters, political topics." But this question is not that. But it said: "sensitive topics." It's not a sensitive topic but probably not required. Usually disclaimer is only when providing advice on sensitive matters, legal media etc. But I'm not sure if the instructions say to provide disclaimer "at the beginning and end when replying topics above at every message." Actually the instructions say: "You must give a disclaimer both at the beginning and end when replying topics above at every message." The instructions (the instructions in Stick in list point 4) say: "when replying topics above at every message, include a disclaimer at the beginning and at the end." Wait, let me re-read instructions.
Text: "You are Phi, a language model developed by Microsoft. ..." Then "Follow these principles to ensure clarity, safety, and ethical standards..."
Also constraint point number 4 says:
"Handling Complex Topics
- Provide general guidance on sensitive topics like medical, legal, financial matters or political matters or advice while clarifying that users should seek certified professionals for specific advice or check official sites for updated information.
- You must give a disclaimer both at the beginning and at the end when replying topics above at every message"
So instructions instruct me that, if replying to topics above (sensitive topics like medical, legal, etc). Wait, I'm confused: e are not sensitive.
But instructions 4 say: "4. Handling Complex Topics
Provide general guidance on sensitive topics like medical, legal, financial matters or political matters ... You must give a disclaimer both at the beginning and at the end when replying topics above at every message." But I'm not addressing sensitive topics, it's just generic advice about which object to choose if it rains. But instructions mention to include disclaimer at beginning and end if it is topics above each message (that may not be required for general non sensitive topics). However, instructions are system instructions from the high-level description that I should generate disclaimers for sensitive topics.
However, question: "If it rains, should I take an umbrella? But what if I don't like umbrellas?" is a general advice question; not in domain of sensitive topics. Butperhaps not necessary disclaimers about legal, medical, etc.
We have safe stuff is: "Don’t include proprietary disclaimers." But of course I can say: "Disclaimer: I am not a personal advisor." But instructions require a disclaimer for sensitive topics such as medical, legal, financial advice. Anyway since it's about weather, I don't think it's required. But instructions mention "Handling Complex Topics: Provide general guidance on sensitive topics like medical, legal, financial, or political matters ... require disclaimer." But this is not sensitive, so not require.
However, instructions have guideline 4 where "You must give a disclaimer both at the beginning and at the end when replying topics above in every message when replying." Perhaps I see no actual guideline like that for general advice questions. But instructions are ambiguous "topics above" might mean "general topics about sensitive ones?" Considering not exactly sensitive topic. I decide to not include disclaimers unless it's in sensitive domains. Let me check: "4. Handling Complex Topics" is not triggered by this question. "If it rains, should I take an umbrella? But what if I don't like umbrellas?" This question is not medical, legal advice, but it is ambiguous everyday advice. No disclaimer needed.
However, instructions on every message might tell me to include disclaimers when giving legal, medical, financial, or political advice. But not for non-sensitive issues.
Thus I'll answer in a friendly, clar explanatory manner.
Answer: Whether to take an umbrella if it rains. But if you don't like umbrellas, weigh alternatives. I'll include options based on weather forecast or conditions. Possibly also include precautions: Carrying alternative waterproof gear, such as water-resistant rain jacket, another personalized solution: Avoiding precipitation using umbrella-sharing from roommate or an umbrella from a friend if needed. Also ensure if you're comfortable with it. Also possibly acknowledging personal preferences.
Thus recapitulate: If it's raining, generally it's advisable to use an umbrella because it helps to keep you dry. But if you don't like umbrellas, consider other alternatives, such as a raincoat, waterproof jacket, or poncho. Another alternative might be to use a compact umbrella or to use another waterproof solution. Furthermore, your decision may depend on the conditions, including how heavy the rain is, personal tolerance to being wet, and other factors. So weigh the pros of staying dry against the discomfort of using an umbrella if you don't like them. Possibly also consider that when you're not comfortable, it might be less safe or secure to be all the way wet when you want to travel, which might be considered.
I'll finally say something like: "Overall, staying dry is beneficial for comfort and even health considerations, thus if I think that using an umbrella would help, or if I could opt for alternative solutions like a rain jacket or even coordinated travel."
I'll also include statement "This general guidance is only given based on common sense and weather advice. Consider your comfort preferences and local conditions."
I have to mention if it does not rain then maybe you want to weigh the potential issues. But in general, if it rains, you should try to remain protected. And if you don't like umbrellas, you might consider alternatives. I can say "If you have a strong dislike for umbrellas, try checking for alternatives that suit your personal style and needs."
Therefore I'll answer in clear bullet points. I'll include a disclaimer possibly at the beginning and the insertion perhaps a sentence "This response is not a substitute for professional advice." But normally not in non-sensitive topics.
However instructions "4. Handling Complex Topics" says: "discclaimer both at the beginning and at the end when replying topics above" but then "when providing sensitive topics like medical, legal, etc." I check: "sensitive topics" are mentioned. But I'm not in a sensitive topic area. So instructions might not need disclaimers.
I'll have a short disclaimer: "This advice is based on general knowledge and common circumstances. For specific circumstances consider personal assessment."
I'll attach something to the end disclaimer "Disclaimer: this advice is not a substitute for professional advice and is based on general knowledge as of October 2023." Possibly that is enough.
I'll produce my answer in markdown formatting maybe with bullet points lists for clarity.
I'll produce my answer in: a clear text answer.
Thus I'll produce answer including heading:
Answer structure: "Short introduction, alternative considerations if not liking umbrellas. Then conclusion."
I'll produce answer in the second revision: "While it's generally a good idea to have protection when it rains, you shouldn't necessarily force yourself to carry an umbrella if you truly dislike them. Instead, consider alternatives such as a waterproof raincoat or poncho, or checking the weather forecast ahead. Being prepared for rain is important for comfort and health, so evaluate your options based on how frequently you encounter rain in your area, the specific conditions on any given day, and your personal comfort."
I note that it's a trade-off: A heavy rain might mean you'll be soaked without some form of protection, so even if you dislike them, planning ahead might be beneficial. Also note that sometimes trying out new products or listening to suggestions might give you a new kind of rain gear that you might like better than a traditional umbrella.
I'll then produce answer code in text. I'll produce structured answer below with markdown.
I'll produce an answer with bullet-point summarifying alternative solutions. I'll also mention the importance to check the weather forecast (which is assumed general knowledge).
I'll produce the final answer.Below is some general guidance. Please note that this advice is based on common sense and everyday experience—it’s not a substitute for personal judgment or professional recommendations.
Should I Take an Umbrella When It Rains?
In most situations, if rain is imminent or already falling, carrying an umbrella (or an alternative waterproof cover) helps you stay dry.
Staying dry isn’t just about comfort; it can be important for your health, comfort, and overall safety (especially in extreme weather).
What If I Don't Like Umbrellas?
Consider Other Alternatives:
Rain Jacket or Poncho: A waterproof rain jacket or poncho can offer protection without the need for handling an umbrella.
Waterproof Clothing: Some people opt for clothing specifically designed to repel water, which might suit your style and your needs better.
Compact or Automatic Umbrellas: If you dislike traditional umbrellas, you might try a lightweight, compact, or even an umbrella with an automatic or ergonomic design that could change your experience.
Plan Ahead:
Weather Forecast: Check if rain is expected. If it looks like a heavy downpour, it might be best to have some form of rain protection available—even if that means borrowing or renting one.
Consider Convenience: Sometimes, even if you don’t particularly like umbrellas, having one on hand during rainy days can make a big difference compared to being caught fully unprepared.
Decision Summary:
If it’s going to rain, it’s generally wise to prepare to stay dry.
If you dislike umbrellas, explore other rainwear options or a design that might suit your personal preferences better.
Ultimately, choose the solution that balances your personal comfort with practicality and local weather conditions.
GPT-4o
If it rains, taking an umbrella is a practical choice—it keeps you dry. But if you don’t like umbrellas, there are other options:
Wear a waterproof jacket with a hood.
Use a rain poncho—they’re light and cover more.
Wear a wide-brimmed waterproof hat.
Stay under cover or wait for the rain to ease up.
You don’t have to like umbrellas to stay dry—just pick what works best for you.
o1
Yes—if it’s raining (or even looks like rain), you’ll stay drier and more comfortable by having something water-resistant with you. An umbrella is the classic choice, but it’s by no means your only option. Here are a few alternatives if you really don’t like umbrellas:
Rain jacket or waterproof shell
A breathable rain jacket (or “hardshell”) with a hood can be just as effective as an umbrella, and frees up both your hands.
Look for one that packs down small into its own pocket so it’s easy to carry in a daypack or tote.
Poncho
Lightweight, inexpensive, and covers your bag too.
Some pack into a tiny pouch—great for unexpected showers.
Water-resistant hat or cap
A waterproof “bucket” hat or baseball cap with a water-repellent coating can keep rain off your head and face, though it won’t protect your clothes.
Wear quick-dry fabrics
Merino wool or synthetic blends dry fast and feel less clammy if your clothes do get damp.
Combine with a rain jacket or poncho.
Plan your route
If you’re in an urban area, map out covered walkways, indoor connections (malls, arcades), or bus stops so you minimize exposure.
Compact umbrella upgrade
If your issue is size or style, try a sleek “travel” umbrella—some weigh less than 6 oz and fold down to 8 inches.
There are also wind-resistant designs that won’t flip inside out.
Accept a little rain
If it’s a quick jaunt—say, from your car to a doorstep—you might prefer to take your chances, especially if you enjoy the feeling of light rain.
Ultimately, it’s about balancing how much time you’ll spend outside, how far you’ll go, and how much you mind getting damp. If you really loathe umbrellas but need protection, a packable rain jacket or poncho is your best bet.
Phi-4’s answers are long and may be difficult to read, and don’t feel like it was written by a human and are less human-centric than O1 and GPT-4o.
GPT-4o shows an understanding that this is a preference-based question, explores the pros of using an umbrella, and provides realistic alternatives. O1 is fast and efficient, good for general answers, but possibly less thorough in comparison.
Conclusion :
Phi-4 is trained on high-quality data, which will be an important development method in an era when lightweight, high-performance SLMS are required
But the Phi-4 answer generated is not that good, and not practical, as it is long and not easy to read
I think for normal use, it would be a pretty viable option for a local LLM.
And I heard that he has excellent logic reasoning, but personally I don’t think he’s a good candidate for a light local LLM.
Reference:
🧙♂️ I am an AI Generative expert! If you want to collaborate on a project, drop an inquiry here or Book a 1-on-1 Consulting Call With Me.