My Creativity

AI Smackdown: We Gave a Tricky Math Problem to Gemini, ChatGPT, Grok, and Amazon Q. The Results Are… Interesting.

August 12, 2025

In the rapidly evolving world of AI, Large Language Models (LLMs) are becoming incredibly adept at tasks that once seemed exclusively human. But how do they stack up against a problem that requires not just calculation, but a nuanced understanding of a poorly defined term? We posed the following challenge to four of the leading AI models: Google's Gemini, OpenAI's ChatGPT, xAI's Grok, and Amazon Q. "Find two decimal [numbers] between 0 to 1 which on grossed up based on their total sum to 1 and rounded off to 3 decimal places gives either 0.999 or 1.001." The key ambiguity lies in the phrase "grossed up based on their total sum." This is not a standard mathematical term. Does it mean scaling the numbers so their sum becomes 1? Or does it mean something else entirely? A good answer requires interpreting this ambiguity correctly before proceeding with the math. Let's break down how each AI tackled the problem. Google's Gemini: The Clear Winner 🏆 I...

Search This Blog

My Creativity

Posts

AI Smackdown: We Gave a Tricky Math Problem to Gemini, ChatGPT, Grok, and Amazon Q. The Results Are… Interesting.