I asked to explain a funny tweet to ChatGPT, Gemini,Claude,Perplexity Pro,Grok and Mistral. See who got the Joke.
Funny thing is that all major GenAI apps answered and understood the context
This week, I came across a note on Substack that got me thinking—what do these Chatbots can make out of it? Can they truly understand what I or the people are thinking? Do they capture context accurately? I tested multiple AI models with the same prompt and documented the results in screenshots.
Even though it was originally a Substack note, I framed the question as a tweet to maintain consistency across different AI models:
What does this mean?
"Does anyone else say please and thank you to ChatGPT? You know… just in case."
The Best AI Chatbots for Text Generation
Before diving into the results, let’s quickly go over the top AI chatbots. There are countless AI tools, but these are the best in class—efficient, reliable, and widely used. I’ll also mention a few others for awareness, though I personally avoid them due to privacy concerns raised by the AI community. Feel free to explore them, but be mindful of what information you share.
My Ranking for Chatbots Apps(Text Generation):
Claude
ChatGPT
Perplexity
Mistral
Gemini
Grok 2
Qwen
Deepseek
My Ranking for AI Image Generation:
Grok 2
ChatGPT
Mistral
Gemini 2
Perplexity
Qwen (good for video and image generation)
Comparing AI Responses
Have you ever tested different AI chatbots side by side to compare their performance? I do this regularly to see how well they handle different tasks. As of now, the only AI subscription I pay for is Perplexity Pro—the rest I use for free,for now. I know people paying for subsription for more than half of them. Each tool has its strengths and weaknesses, making the choice highly dependent on specific use cases.
So far, Claude and ChatGPT have been the most reliable in terms of response quality. I'm seriously considering upgrading to Claude Pro because of its impressive outputs. For coding and text-based tasks, Claude has been the best, but this could change as new updates roll out frequently. After discussing with other AI users, I found a general consensus—Claude and ChatGPT currently lead the pack.
How Do These Chatbots Actually Work?
At their core, GenAI chatbots function as next-token predictors. They generate text by predicting the most probable next token based on the vast amount of data they’ve been trained on. While the concept is straightforward, their ability to understand context and adapt to user input through advanced training techniques makes them highly effective.
The Experiment: Understanding Context
What fascinated me most was how differently these AI models interpreted the original Substack note(which I have told them its a tweet). Some chatbots captured the context and meaning right away, while others gave generic responses that required follow-ups.
This experiment reinforced what I already suspected—AI effectiveness varies by model, task, and context. If you haven’t compared AI tools yet, I highly recommend testing them yourself. The results might surprise you!
Let me know if you try this and how is your experience.
If you want me to cover any specific topic, feel free to reach out.