Three-Way Thematic Analysis Comparison: Human vs ChatGPT vs Claude

Dataset Overview

Distribution of tweets by source group (Emergency Response vs Community) across each classifier. The human dataset provides user-type classifications; both AI classifiers applied thematic coding across the same source groups.

Cross-Tabulation: ChatGPT Themes × Claude Themes

Heatmap of 615 tweets classified by both systems. Each cell shows how many tweets received a given CGPT theme (rows) and Claude theme (columns). Darker cells indicate stronger co-assignment. Hover a cell for details.

Sankey Flow: ChatGPT → Claude Theme Assignments

Flow diagram showing how tweets classified into ChatGPT themes (left) were re-classified by Claude (right), for the 615 matched tweets. Width of each band is proportional to tweet count.

Convergence and Divergence Network

Force-directed network where ChatGPT themes (teal) and Claude themes (purple) are nodes. Edges connect themes that share tweets (line thickness = tweet count). Tightly connected pairs signal conceptual convergence; isolated nodes signal unique thematic capture.

Theme Frequency by Group and Classifier

Comparison of theme frequency distributions across Emergency Response and Community tweets for both AI classifiers. Toggle between classifiers and groups using the tabs.

ChatGPT — All

Claude — All

ChatGPT — EM

Claude — EM

ChatGPT — Community

Claude — Community

Sentiment Profiles Across Classifiers and Groups

Side-by-side sentiment distributions reveal how the two classifiers conceptualise emotional tone. Claude uses a 6-category scheme with finer granularity; ChatGPT uses a 5-category scheme. Human group (EM vs Community) is shown for each.

Topical Category Comparison

Both classifiers also assigned topical categories (distinct from narrative themes). ChatGPT used 7 categories; Claude used 10. Parallel bar charts show coverage emphasis differences.

Human Classification: User-Type Distribution

Human coders classified tweet authors by account type (organization, individual, feedbased). This maps to the source-group distinction both AI classifiers used. The EM/Community split drives thematic variance in both AI outputs.