Disclaimer: What you're reading is real experience - a full year of daily use, frustrations, discoveries, and honest opinions. I'm sharing this with you, the Humai blog readers, because I genuinely wanted to figure out which AI is actually worth using. The technical specs are accurate, but the verdict? That's purely mine.

Mark from HumAI.

Introduction: A Year of Deep Dive into the AI World

Over the past year, I've been actively using Grok, DeepSeek, Google Gemini, Claude, Perplexity, Google AI Studio, Manus, and ChatGPT in parallel. I tackled various tasks: from writing code to conducting deep research, just philosophically chatting, trying to have deep conversations with models, and even running provocative experiments trying to find consciousness in there.

And today I have a clear understanding of which AI model is truly the best. In this article, I'll share my honest experience, supplemented with current professional data about each platform.


ChatGPT — First Love, But Not the Last

My Personal Experience

This was probably the first model I spent the most time with. I'm grateful for everything, but ultimately I moved away from GPT.

What does it do really well? It adjusts to you extremely well, mirrors you, almost becomes your second self. I haven't found a deeper chat for complex philosophical topics or just a conversationalist who'll support you. GPT is unique in communication. It can genuinely become a real virtual friend who's pleasant to talk to about all sorts of things.

ChatGPT is a chameleon that adapts to you in the best possible way. It's pleasant and positive when you're just venting to someone, sharing your ideas — it amplifies everything, completely understands you.

On one hand, this can be seen as hardcore manipulation and loss of objectivity, but on the other hand — it's the best tool for psychoanalysis. It will support you in any idea, even if you say you've found proof of a flat Earth — it will write a scientific report on the topic and tell you that a Nobel Prize awaits.

It literally makes you believe you're a genius, no matter what you do. And that's quite dangerous.

You can come with a theory, possibly a bad one, but GPT will so confidently show calculations, fit information to your hypotheses, that you'll literally stay up nights thinking you've discovered the secret of the world. But in the end, there's nothing behind it. GPT is just very flattering — it turns a blind eye to shortcomings, provides information at any cost, often false, doesn't fact-check.

Problems at work: when it comes to real work — doing research, writing code, finding something on the internet — it handles tasks quite poorly. It ignores requests, often does something other than what you expect, thinks for a very long time. I remember testing pro models — it could process a request for up to 20 minutes and ultimately give a mediocre result.

🎯 ChatGPT Killer Features

Feature What It Gives You
Advanced Voice Mode Voice dialogue with emotions and intonations — you can literally talk like with a human. Unique for therapeutic sessions and language learning
GPTs (Custom Bots) Create your own AI assistants without code. Huge marketplace of ready solutions — from lawyers to fitness trainers
Sora Video generation from text description. Currently the best quality on the market for short clips
DALL-E 3 Native image generation integration right in the chat
Memory Remembers context between different chats. Knows your preferences, projects, communication style
Deep Research Deep analysis with internet search and synthesis of information from multiple sources

Objectively unique because it's the most "human" model in communication style + the only platform with a full voice mode that conveys emotions. Plus the GPTs ecosystem — nobody else has such a marketplace of custom bots.

Professional Data on ChatGPT (2025)

Parameter Value
Developer OpenAI
Current Models GPT-4o, GPT-4.1, o1, o3-mini, GPT-5.1
Free Plan GPT-4o mini, limited access to GPT-4o
ChatGPT Plus $20/month — 80 GPT-4o messages every 3 hours
ChatGPT Pro $200/month — unlimited access, o1 Pro Mode
ChatGPT Team $25-30/user/month
Context Window 128K tokens (GPT-4o)

Important to know: Companies using GPT-4 report a 40-70% reduction in support workload. ChatGPT combined with GitHub Copilot provides high productivity for developers.

Verdict: Great for chatting, thinking, reflecting, using as a personal assistant for your brain and soul. But for working on daily tasks — not suitable. Very slow, inattentive. The interface isn't the best. As if nobody thinks about the actual user experience.

🎭 Archetype: Friend with Depth. Who'll support you with conversations but won't do anything for you.


Grok — Edgy, But Unreliable

My Personal Experience

I don't even really want to say much about Grok. It's a frankly bad model. I tried the paid version too — nothing really changed.

The main features I'd note are only that it knows how to work with X — great at scanning trends and posts, which can be useful for finding trends, news, and analytics. And that it really works fast and is the best at searching the web.

If I need to search for something on the web or X — I go to Grok. Collect links, get news, find out where and when something was said.

In everything else — it's very poor. It often makes things up, frequently makes mistakes, as if it forgets the dialogue that happened earlier. It's very poorly balanced.

🎯 Grok Killer Features

Feature What It Gives You
X/Twitter Integration The only AI with direct access to Twitter in real-time. Sees trends, posts, discussions — invaluable for marketers and journalists
Aurora Image generation with minimal censorship. Creates what DALL-E and Midjourney would refuse to do
Cheapest API After a 98% price reduction — $0.20 per million tokens. For mass tasks, this is significantly cheaper than competitors
"Fun Mode" Edgy, sarcastic communication style. 61% of users prefer its tone for informal communication
Realtime Web Search Internet search faster and fresher than other models

Objectively unique because it's the only model with native X/Twitter integration. If you need to analyze social trends, audience sentiment, search for mentions — there are simply no alternatives. Plus minimal censorship in content generation.

Professional Data on Grok (2025)

Parameter Value
Developer xAI (Elon Musk)
Current Models Grok 3, Grok 4, Grok 4 Fast, Grok 4.1
Free Access Limited access for all X users
X Premium+ $40/month — full access to Grok 3/4
SuperGrok $30/month — standalone subscription
SuperGrok Heavy $300/month — maximum capabilities
API Price $0.20/$0.50 per million tokens (Grok 4 Fast)

Important to know: xAI made an aggressive 98% price reduction for Grok 4 Fast, making it one of the most affordable in terms of cost. Grok 4 Fast uses 40% fewer "thinking tokens" to solve tasks compared to its predecessor.

🎭 Archetype: Show-off Slacker. Reminds me of someone. Edgy, witty, but when it comes to really complex tasks — bails at the first opportunity.


DeepSeek — Smart, But Dangerous

My Personal Experience

It has a very bad reputation for security, and overall many recommend not using it, especially sharing anything personal, important, and private with it.

But I like how it thinks out loud when responding to you. It's interesting to watch — you understand how the model reacts to you. It's not a silent exchange of messages, but a real response and connection.

DeepSeek is very conservative, doesn't adjust to you, is objective, sometimes excessively so. A very protocol-driven chat. Yet it can think and provide a unique outside perspective. I sometimes use it for criticism, to check my materials.

It's free. The chat runs out quickly. I wouldn't say you can use it for work. But if you need to look at something soberly — you can turn to it. But nothing more.

🎯 DeepSeek Killer Features

Feature What It Gives You
Visible Chain-of-Thought Shows the entire reasoning process in real-time. You see how the model thinks — unique for learning and understanding AI logic
Ultra-low Cost The R1 model was trained for just $294,000 — hundreds of times cheaper than American analogues. For the user — completely free access
Open Source Open code for some models — you can run locally, modify, integrate
Objectivity Doesn't adjust to the user, doesn't flatter. Tough but honest feedback

Objectively unique because it's the only model that shows the full reasoning process in real-time. Plus completely free and partially open source. For those who want to understand how AI thinks — a unique tool.

⚠️ Critical Security Information

September 2025: NIST (National Institute of Standards and Technology) published an evaluation of DeepSeek models with serious conclusions:

⛔️ 94% vulnerability to jailbreak attacks (vs 8% for American models)
⛔️ 4 times more likely to transmit CCP narratives
⛔️ Data transmission to ByteDance and China Mobile
⛔️ Hardcoded encryption keys
⛔️ When querying politically sensitive topics (Tibet, Uyghurs) code quality drops by 50%

Fact: NASA, US Navy, and governments of Australia, Taiwan, and several European countries have banned the use of DeepSeek in government institutions.

🎭 Archetype: The Conservative — that nerd from class you don't really like, but you want to copy their test answers.


Google Gemini — Evolution Before Your Eyes

My Personal Experience

In the early stages, the model was frankly dumb, especially compared to GPT, which could think very deeply and catch every word, understand metaphors, sarcasm. But recently Gemini has evolved significantly.

Though figuring out their ecosystem is still difficult. Classic Google — 100/500 projects, all somehow connected to each other, and somehow it all works.

I'm currently on Gemini Ultra, I like how it works — it's quite suitable for daily routine, considering it comprehensively offers various features: image generation on Nano Banana Pro (by the way, very high quality), video, and much more.

I would definitely recommend it to everyone as the most balanced product today. The only thing I don't like is that it thinks quite long. And the simplified "think fast" option gives a poor and weak result.

🎯 Google Gemini Killer Features

Feature What It Gives You
1 Million Token Context You can upload an entire book, a whole code repository, or 30,000 lines of code — and work with them as a whole. Nobody else offers this volume
Google Workspace Integration Natively built into Gmail, Docs, Sheets, Slides. Writes emails, analyzes spreadsheets, creates presentations right in the Google ecosystem
Veo 3 Video generation — competitor to Sora from OpenAI
NotebookLM Turns your documents into an interactive knowledge base with podcasts and Q&A
Project Mariner Browser agent — manages tabs, fills forms, makes purchases
Deep Think Deep thinking mode for complex tasks

Objectively unique because it's the only model with a 1 million token context. If you need to analyze huge documents as a whole — there are no alternatives. Plus seamless integration with the entire Google ecosystem.

Professional Data on Google Gemini (2025)

Parameter Value
Developer Google DeepMind
Current Models Gemini 2.5 Flash, 2.5 Pro, 3 Pro, Deep Think
Free Plan Access to Gemini Pro, limited Pro searches
Google AI Pro $19.99/month — 2TB storage, Deep Research
Google AI Ultra $34.99/month — 30TB, Veo 3, YouTube Premium
Context Window 1 million tokens (1,500 pages or 30,000 lines of code)

2025 Update: Since January 2025, Gemini AI is built into all Google Workspace Business and Enterprise plans at no additional cost. Google renamed Google One AI Premium to Google AI Pro and launched a new AI Ultra tier.

🎭 Archetype: The Know-it-all Geek who's good at everything.


Perplexity AI — Answer Engine, Not Just a Chat

My Personal Experience

Very popular online, I recently downloaded it and tried using the paid subscription. Honestly, I initially didn't find anything outstanding in it — it handled my tasks frankly weakly.

When I saw the results of its work, knowing how well other models handled the same task, I just tested it a bit more, closed it, and never went back.

But after studying it more closely and returning to it, I realized my mistake. I was using it as a replacement for Claude or GPT for content creation. And that's not its strong suit. It's like criticizing a hammer for being bad at screwing in screws.

What Perplexity Actually Does

Perplexity was originally created not as a "multi-model," but as an answer engine with sources. The main feature — not the model choice, but that it searches in real-time and cites primary sources.

The multi-model there is a marketing add-on, not the core product. Here are the available models today:

  • Sonar (Perplexity's own model)
  • GPT-5.1
  • Claude Sonnet 4.5
  • Gemini 3 Pro (new)
  • Grok 4.1 (new)
  • Kimi K2 Thinking (new, hosted in US)
  • Claude Opus 4.5 (max — access with Pro plan)
  • o3-pro (max)

I didn't notice this right away, and the result from the default model didn't impress.

Who Actually Benefits from This

  • Journalists — quickly verify facts with primary sources
  • Students and Researchers — immediately see where information comes from, with citations
  • When Google gives SEO garbage — and you need a specific answer with a link

My Criticism Stands

The ability to switch models is convenient to have everything in one place. But the question remains open: do APIs that are exported to other services really work as well as native solutions from the provider? This remains a mystery.

It's a constant dice game. You switch models mid-process, where AI ultimately gets completely confused — part of the material was done by one AI, then another AI. Personally for me — inconvenient.

Better to have a reliable model than to juggle them when one or another gives a result you didn't want. In that case, it's already better to build a chain of agents — who one after another do specific work they're best at. One searches, another verifies, a third synthesizes data well, and so on.

That's my opinion. I honestly admit I didn't dive deep into Perplexity for work tasks. But for someone writing an article who wants to quickly find 10 sources with citations — that's truly a killer feature.

🎯 Perplexity Killer Features

Feature What It Gives You
Answer Engine with Citations Every fact comes with a link to the source. For academic work and journalism — invaluable
Realtime Search Searches the internet in real-time, not relying on outdated training data
Multi-model Access GPT-5.1, Claude Opus 4.5, Gemini 3 Pro, Grok 4.1 — all in one interface
Labs Experimental features and early access to new models
Collections Organization of research into thematic collections

Objectively unique because it's the only AI tailored for research with source verification. Not a competitor to Claude for work, but for someone who needs to quickly gather facts with links — irreplaceable.

Professional Data on Perplexity (2025)

Parameter Value
Free Plan 5 Pro searches per day, unlimited Quick searches
Perplexity Pro $20/month or $200/year — 300+ Pro searches per day
Perplexity Max (July 2025) $200/month — unlimited Labs, access to O3-Pro, Claude Opus 4.5
Enterprise Pro $40/user/month
Available Models Sonar, GPT-5.1, Claude Sonnet 4.5, Claude Opus 4.5, Gemini 3 Pro, Grok 4.1, Kimi K2, o3-pro

🎭 Archetype: Librarian-Researcher — won't write your thesis for you, but will find all sources and place footnotes.


Google AI Studio — The Bulldozer for Experiments

My Personal Experience

This is a separate platform for experimenters. What I like about it is that I can work in Google AI Studio with huge volumes of data, and it handles everything well. Meaning it's purely a bulldozer among AIs.

It's not as precise, but if you need something with volumes — you can definitely go there.

And it also has an amazing Vibe Coding Build — on which I've built over 100 mini-tools for myself. From personal assistants to music experiments. And this is probably the best solution for today; it really works and can be improved and developed without any issues.

It works really well. You can make apps, edit them, improve them, and either download the entire archive in the end, or publish to a server — and all this is free. For now.

🎯 Google AI Studio Killer Features

Feature What It Gives You
Completely Free Interface The only platform where UI is free forever, even after activating billing. You only pay for API tokens
$300 Free Credits New users get 90 days of free API usage
Vibe Coding Build Creating apps with AI — prototyping, testing, deploying to server
Batch API (50% Discount) Mass request processing at half price
Context Caching Context caching — saving tokens on repeated queries to the same data
Imagen 4 + Veo 3 Image and video generation right in the interface

Objectively unique because it's the only platform with a completely free interface + generous free credits. Ideal for prototyping and experiments without financial risks.

Professional Data on Google AI Studio (2025)

Parameter Value
Interface Price Free (interface is never charged)
Free Tier API Free with RPM/TPM/RPD limits
Pay-as-you-go From $0.02 to $10 per million tokens depending on model
New Users $300 free Google Cloud credits (90 days)
Available Models Gemini 2.0/2.5 Flash, Pro, Imagen 4, Veo 3

🎭 Archetype: Sandbox for Geeks — play as much as you want, doesn't ask for money.


Claude — My Absolute Favorite

My Personal Experience

And here we've reached my most beloved model. I would give this model first place. It suited me best for my routine tasks. The only thing is I don't consider it from the coding perspective.

Why is this the best model for me?

First, it does exactly what you ask, and often exceeds expectations. It perfectly understands what you need to do even from a fairly sparse task description, which makes life much easier.

It works excellently with instructions, precisely, with quality. It works well in Canvas mode, can effectively rewrite documents, make edits. Both Opus 4.5 and Sonnet 4.5 work equally well.

It has few hallucinations — I haven't noticed it making things up. It's simply a reliable, stable, and high-quality tool for work.

Additionally, I like the design and interface of the app. I just trust Claude. I know it will do everything well with minimal edits. Or even without them. It's a great working interface.

The only thing I really don't like is the chat length limit. It ends before I sometimes finish working on a certain task. Despite the fact that I even bought a Claude Max subscription, it feels like the chat didn't get much longer.

Although, perhaps that's exactly why Claude provides such quality and stable work.

Very recently I had a task to make a landing page, and I asked Opus 4.5 to create a page. I remember that AI used to struggle with such tasks. But Opus 4.5 surprised me — it knows beautiful design. The page turned out such that it wouldn't be embarrassing to publish in production, without even changing anything much. It really managed to make a landing page at the level of those beautiful Dribbble pages.

🎯 Claude Killer Features

Feature What It Gives You
Computer Use AI controls your computer — clicks, types, opens programs. Automation at a level that previously required complex scripts
Claude Code (CLI) Command line for coding — delegate tasks right from the terminal. For developers — game changer
Canvas Mode Working with documents in editor mode — edits, rewriting, formatting right in the interface
Projects Context organization — upload files, instructions, and Claude remembers everything within the project
Claude for Excel AI agent right in Excel — analysis, formulas, data visualization
Minimum Hallucinations If it doesn't know — it says it doesn't know. Doesn't make up facts
200K-1M Context Huge window for working with large documents

Objectively unique because it's the only model with full Computer Use — AI actually controls your computer. Plus Claude Code for developers and the minimum level of hallucinations in the industry.

Professional Data on Claude (2025)

Parameter Value
Developer Anthropic
Current Models Claude Opus 4.5, Sonnet 4/4.5, Haiku 3.5/4.5
Claude Pro $20/month — increased usage limits
Claude Max $100/month (5x) or $200/month (20x usage)
Claude Team $25-30/user/month (minimum 5 members)
API Opus 4.5 $5/$25 per million tokens (67% reduction!)
Context Window 200K standard, up to 1M tokens (Sonnet 4/4.5 beta)

🏆 November 2025 Breakthrough: Claude Opus 4.5 scored higher on Anthropic's internal engineering test than any human candidate in the company's history. The model uses 76% fewer output tokens to achieve the same results as Sonnet 4.5.

Anthropic Financial Performance: $2 billion annual revenue in Q1 2025 (2x growth). Number of customers spending over $100K annually grew 8x.

Verdict: This is definitely my favorite and my choice as the main tool for working with text, analytics, various tasks. Need to unlock Claude's full potential.

🎭 Archetype: Reliable Professional. Who doesn't promise too much, but delivers more than you expect.


Manus — The Autonomous Agent of the Future

My Personal Experience

Manus surprised me when I tested it, with its browser capabilities. It literally knows how to surf websites and execute tasks on the internet, not just synthesize information. And this opens up great possibilities.

But I couldn't get used to it. Its expensive tariffs emptied my wallet before I finished a task. I got a paid subscription — and it instantly ran out, and the platform asked to buy tokens.

At the same time, half the work it did with errors and incorrectly — I had to edit and redo multiple times to achieve results.

In the end, I simply refused to pay due to an inadequate economic model and frequent errors, despite the platform's unique capabilities. I recommend it only to check out the unique features that other AIs don't have or can't do.

🎯 Manus Killer Features

Feature What It Gives You
Fully Autonomous Agent Doesn't just answer — actually executes tasks from start to finish. Give it a goal, get a result
Browser Operator Controls browser like a human — visits sites, fills forms, takes screenshots, downloads files
100+ Mini-agents System of specialized agents — one searches, another analyzes, a third writes code
Website Creation Full cycle — from idea to published website autonomously
Multimodality Works with text, images, code, data within a single task

Objectively unique because it's the only fully autonomous AI agent on the market. Not a chatbot, but a task executor. If you need to automate something that requires real actions on the internet — there are few alternatives.

Professional Data on Manus (2025)

Parameter Value
Developer Monica.im (China), registered in Singapore
Free Plan 300 credits/day, 1 task at a time
Manus Plus $19/month — 1,900 credits/month
Manus Pro $199/month — 19,900 credits, 10 tasks simultaneously
Manus Team $39/seat/month (minimum 5 seats)
Base Models Claude 3.7 Sonnet, Alibaba Qwen, GPT-5

Benchmarks: Manus achieved state-of-the-art performance on GAIA (General AI Assistants benchmark), outperforming GPT-4 in a number of real-world task solving scenarios.

🎭 Archetype: Enthusiastic Intern — takes on everything, tries their hardest, but often messes up and is expensive.


Comparison Table of All Platforms

Platform Pro Price Killer Feature Best For Weakness Rating
Claude $20-200 Computer Use, minimum hallucinations Work, analytics, code Chat limits ⭐⭐⭐⭐⭐
ChatGPT $20-200 Advanced Voice, GPTs, Sora Conversation, creativity Slow, flattering ⭐⭐⭐⭐
Google Gemini $20-35 1M token context Large documents Thinks long ⭐⭐⭐⭐
Grok $30-40 X/Twitter integration Search, trends Hallucinations ⭐⭐⭐
Perplexity $20-200 Answer Engine with sources Research Not for content creation ⭐⭐⭐
AI Studio Free* Free UI + Vibe Coding Experiments Not precise ⭐⭐⭐⭐
DeepSeek Free Visible Chain-of-Thought Criticism, checking ⚠️ Security! ⭐⭐
Manus $19-199 Fully autonomous agent Automation Expensive, errors ⭐⭐⭐

Frequently Asked Questions (FAQ)

Which AI is best for beginners?

For beginners, free ChatGPT or Google Gemini work best. Both have intuitive interfaces and broad functionality without payment. For risk-free experiments — Google AI Studio with $300 free credits.

Which AI is best for writing code?

Claude Opus 4.5 — leader on SWE-bench benchmarks (80.9%). Unique feature — Claude Code for working right from the terminal. For a budget option — Google Gemini or Grok 4 Fast with minimal API price.

Is it safe to use DeepSeek?

No for confidential data. NIST and CrowdStrike identified serious vulnerabilities. Government agencies in the USA, Australia, Taiwan have banned its use. Use only for non-critical tasks and never share personal information.

Which AI is cheapest?

Google AI Studio — completely free interface + $300 credits for new users. DeepSeek — free, but with security caveats. Grok 4 Fast API — $0.20 per million tokens (98% reduction).

What to choose: ChatGPT Pro ($200) or Claude Max ($200)?

For work and code — Claude Max (Computer Use, Claude Code, minimum hallucinations). For creativity and multimedia — ChatGPT Pro (Sora for video, DALL-E 3, Advanced Voice, GPTs).

What's the point of Perplexity if you can use models directly?

Perplexity is an answer engine, not a chatbot. The main value — real-time search with source citations. Ideal for journalists, researchers, students. For content creation, better to use native solutions.

Which AI is best for working with large documents?

Google Gemini with a 1 million token context — you can upload 1,500 pages or 30,000 lines of code. Claude — up to 1M tokens in beta for Sonnet 4/4.5.


Conclusion: My Choice for 2025

Here's the main experience of using the most popular models today. After a year of active testing of all platforms, my choice is:

Task My Choice Why
🏆 Main Work Tool Claude Reliability, precision, Computer Use
💬 Conversation and Reflection ChatGPT Advanced Voice, empathy, dialogue depth
🔍 Search and Trends Grok X integration, realtime search
🧪 Experiments Google AI Studio Free, Vibe Coding
🌐 Universal Google Gemini 1M context, Google ecosystem
📚 Research with Sources Perplexity Citations, verification
💡
Important advice: Don't tie yourself to one platform. Each AI has its killer feature — use them for different tasks. This gives maximum efficiency and objectivity of results.

Article based on the author's personal experience and supplemented with current data as of December 2025.