If you only use AI a few times a week, you almost certainly don’t need to pay for any of these. The free tiers from Google, Anthropic, and OpenAI are all genuinely usable for occasional questions, drafts, and summaries. The case for paying $20 a month starts when you use one of these tools nearly every day and the limits start interrupting work you actually need to finish.
Who this is for
This guide is for people who use a general-purpose AI chatbot most days: writers, marketers, analysts, founders, lawyers, consultants, students, and the engineers and PMs who use chat for non-coding work. If your primary use case is software engineering, see our coding-agent guide instead; the answer there isn’t the same. If you need cited research more than you need open chat, jump to Perplexity below.
Our pick: ChatGPT
ChatGPT is still the chatbot we’d recommend to most people, but the reasoning has shifted in 2026. It isn’t that ChatGPT is the best at any single thing. Claude beats it on writing and constraint-following in our blind grading, Gemini beats it on Workspace integration and audio analysis, and Perplexity beats it on citation quality. ChatGPT wins because the Plus tier at $20 is the only $20 subscription that cleared every task in our six-task multimodal bench without being upsold to a higher plan.
Plus has stayed at $20 a month since ChatGPT first went paid in early 2023, and OpenAI keeps adding to it. As of testing, that included GPT-5.5 (which replaced GPT-5.4 as the default on April 23, 2026), Sora video, the Codex coding agent, Agent Mode, Deep Research, Advanced Voice, Canvas, and ChatGPT Images 2.0 with multilingual in-image text. The trade-off is that the free tier got worse: ads now appear under responses on Free and Go in the US, the free model is GPT-5.3 Instant with a hard 10-messages-per-five-hours cap, and most of the features above are paywalled.
The honest cons. In our long, constraint-heavy prompts, ChatGPT dropped explicit requirements more often than Claude did, especially when we layered five or more constraints in a single prompt. Deep Research on Plus is capped at 10 sessions a month, and analysts who run multiple reports a day will hit that ceiling fast. And the new Pro $100 tier, while a real bargain for heavy users (5× Plus limits and the GPT-5.5 Pro model), splits the lineup in a way that makes the consumer pricing harder to reason about than it used to be.
The runner-up: Claude
If your work is mostly text (drafts, briefs, contracts, edits, research synthesis), Claude is the chatbot to pay for. It produces the most natural, least formulaic prose in our blind grading, and it had the highest constraint-pass rate in our test on multi-part prompts. Pro is $20 a month, or $17 on annual, and gives you Sonnet 4.6 as the default plus limited access to Opus 4.7, with a 1M-token context window on both models at no surcharge.
Claude is also the chatbot we’d choose for clients with real privacy needs. On paid tiers, Anthropic doesn’t train on your data by default, and Team and Enterprise are contractually protected. The Free, Pro, and Max consumer tiers require an explicit opt-in for training as of August 2025.
The catches. Claude can’t generate images, which was a task we wanted to do often enough in our test that it mattered. Pro’s exact usage limits aren’t published, and we hit five-hour cooldowns more often on Claude than on ChatGPT during the same workload. And while Claude’s voice mode exists, ChatGPT’s is meaningfully more polished today. If image and voice are core to your daily use, this isn’t the pick.
The Google-stack pick: Gemini
Google AI Pro at $19.99 a month is the right answer if your week lives in Gmail, Docs, Sheets, Slides, and Drive. The plan includes Gemini 3.1 Pro at the 1M-token context window, 5 TB of cloud storage, Gemini Code Assist, and bundled features like Veo 3.1 video and Nano Banana for image generation. The Gemini app is integrated directly into the Workspace apps you already use, which removes a lot of copy-paste friction we saw in the ChatGPT and Claude workflows.
Gemini also won our multimodal subtests for audio and video analysis. It transcribed and gave useful, specific notes on a 90-second video clip and a 12-minute audio recording, where ChatGPT was slower and Claude couldn’t do the audio task at all. The free tier is the most generous in the category and now includes Gemini 3.5 Flash with Google Search, image generation, and Gemini Live voice mode.
What lost points. In our blind grading on open-ended writing, Gemini’s outputs felt the most generic of the three majors, and voice mode was the most robotic. Google moved Gemini app limits to a “compute-based” system in May 2026 that refreshes every five hours up to a weekly cap, which is honest about the underlying mechanics but harder to plan around than a flat message count. And several headline features (Gemini 3 Pro in AI Mode, Project Mariner, Gemini Spark) are still US-only or sit behind the $99.99 AI Ultra tier.
The cited-research pick: Perplexity
Perplexity is a different shape from the other four. It’s built around the answer engine: every reply links to its sources by default, and the UI is closer to a search engine than a chatbot. Pro at $20/month gives you 20 research queries per day, model switching across frontier models, file uploads, $5/month of Sonar API credits for developers, and unlimited basic search.
The case for Perplexity is straightforward. If you mostly use AI to look things up, draft based on real sources, or fact-check your own work, the inline citations save real time. In our 20-question accuracy test, Perplexity had the highest citation quality and the fewest hallucinations of any tool we ran, which is what we expected. The free Comet browser, which Perplexity dropped the paywall on in March 2026, makes it easy to point the agent at the page you’re already reading and ask questions about it.
What it isn’t good at. Open-ended writing, planning, and brainstorming all read flatter than they do on ChatGPT or Claude; the model wants to retrieve when you wanted it to reason. Multi-model orchestration (Model Council, the feature that runs your query against Claude, GPT, and Gemini in parallel) is gated behind the $200 Max tier. And the multimodal features (image, voice, video) are visibly thinner than ChatGPT’s.
The one we’d skip for general use: Grok
We tested Grok at the SuperGrok tier ($30/month) and the X Premium+ bundle ($40/month). Grok has a real advantage in live data from X, and if your daily work involves social-media monitoring or breaking news, that’s a feature the others can’t match. SuperGrok includes unlimited chat, DeepSearch, Big Brain mode, voice, and Grok Imagine for image and video.
Two problems kept Grok out of our recommendation for general use. First, the price-to-quality ratio is the worst in the category: SuperGrok costs 50% more than ChatGPT Plus, Claude Pro, or Google AI Pro, and underperformed all three on the writing and instruction-following tasks in our blind grading. Full Grok 4.3 access is only confirmed at the $300/month SuperGrok Heavy tier; lower tiers get it in staged rollouts and the UI doesn’t tell you which variant answered.
Second, the image-generation safety record. Reuters and other outlets reported on a late-2025-into-2026 wave of Grok Imagine outputs creating non-consensual deepfakes and child sexual abuse material, leading to investigations of xAI in seven countries. xAI has tightened restrictions since, but until the picture is clearer we can’t recommend Grok for any use case that involves generating images of people.
How to choose
The decision tree is shorter than the comparison tables make it look. If you want one $20 subscription that does the most things competently, pick ChatGPT. If your work is writing and document analysis and you care about careful instruction-following, pick Claude. If you live in Google Docs and Gmail, pick Google AI Pro. If you mostly want cited research, pick Perplexity. We wouldn’t pay for more than one of these at a time, and if you’re testing, all four of those have a free tier that’s worth a week before you commit.