Deconstructing Superman: A Data-Driven Screenplay Analysis
30 analytics from James Gunn's 132-page screenplay — parsed line by line, word by word. What the data reveals about character, pacing, and the writer's craft.
There's a version of Superman that lives in data. Not the mythology, not the iconography — the actual words on the page. James Gunn's 132-page screenplay for Superman is a document you can parse, count, weigh, and measure. And when you do, patterns emerge that no casual read would catch.
We ran the full script through our analytics pipeline: 21,550 words, 243 scenes, 62 speaking roles. Here's what the numbers say.
The DNA of the Script
The first thing the data tells you: this is not a talky movie. Only 29.8% of the screenplay's words are dialogue — compared to the 45-55% you'd expect from a typical drama. The remaining 70.1% is action and description. Gunn lets the camera do the talking.
And here's the kicker: Lois Lane speaks more than Superman. She owns 16.8% of all dialogue. Clark and Superman combined (they're listed separately in the script, reflecting the dual-identity structure) account for 20.4%.
How Characters Sound
Every character has a linguistic fingerprint. Gunn writes Superman terse, almost monosyllabic — just 4.6 words per line. He's a man of action, not speeches. Meanwhile, Mr. Terrific has the richest vocabulary (67.1% unique words), fitting for the "third smartest man alive." Lois, who talks the most, naturally repeats more (50.1%).
| Character | Avg Words/Line | Vocab Richness | Questions Asked |
|---|---|---|---|
| Lois Lane | 4.3 | 50.1% | 22.2% |
| Lex Luthor | 4.5 | 56.7% | 8.0% |
| Superman | 4.6 | 52.0% | 10.8% |
| Clark Kent | 4.3 | 58.1% | 8.7% |
| Jimmy Olsen | 4.4 | 63.3% | 16.1% |
| Mr. Terrific | 4.5 | 67.1% | 13.9% |
| Guy Gardner | 4.6 | 71.4% | 21.8% |
| Robot 4 | 4.7 | 70.0% | 4.3% |
| Engineer | 4.7 | 76.8% | 10.0% |
| Eve | 4.7 | 69.7% | 0.0% |
Lois asks the most questions — 22.2% of her lines end in a question mark. She's a journalist. Lex barely questions anything (8.0%). He already has all the answers.
First and Last Lines
A character's bookend lines often reveal their arc. Superman opens with pain and closes with peace. Lex's last line is literally cut off mid-sentence.
Longest Single Speeches
Guy Gardner's 59-word rant is the longest unbroken speech. Superman's longest is 55 words — about his parents' message from Krypton. The most emotional moments get the most words.
Pacing and Structure
The script accelerates. The final quarter (pages 100-132) packs in 94 scenes — nearly double the first quarter's 45. Average scene length drops from 120 words to just 54. The finale is a machine gun of quick cuts.
| Quarter | Pages | Scenes | Avg Words/Scene |
|---|---|---|---|
| Act 1 | 1–33 | 45 | 120 |
| Act 2A | 34–66 | 50 | 118 |
| Act 2B | 67–99 | 54 | 96 |
| Act 3 | 100–132 | 94 | 54 |
Scenes in the final quarter — nearly double any other act. The script doesn't build to a climax. It detonates.
The rhythm alternates between talky and quiet — pages where dialogue drops to 0% mark pure visual action sequences. Classic Gunn.
Emotional Shape
The screenplay's emotional temperature follows a classic "into the abyss and back" arc. It opens warm, dips into darkness around pages 86-90 and 106-110 (the darkest moments), then climbs back to hope by the finale.
Day vs. Night — Literally
Gunn uses lighting as structure. Act 2B flips to 38 night scenes vs. 12 day — the "darkest before dawn" moment, literally. Act 3 snaps back to almost entirely daylight: 88 day scenes, 0 night.
| Quarter | Day Scenes | Night Scenes |
|---|---|---|
| Q1 (p.1–33) | 38 | 4 |
| Q2 (p.34–66) | 33 | 8 |
| Q3 (p.67–99) | 12 | 38 |
| Q4 (p.100–132) | 88 | 0 |
Punctuation as Emotion
Perfectly balanced urgency and inquiry. The 50 ellipses — those trailing-off moments of uncertainty — pulse between the exclamation points. The script breathes between urgency and hesitation.
The World of the Script
The Comms Hub dominates with 40 scenes — the nerve center of the story. Despite being about a hero who flies, 62.6% of scenes are interiors (152 INT vs. 91 EXT). Most of the drama happens indoors.
Character Connections
Who shares scenes with whom — and who never meets.
| Pair | Shared Scenes |
|---|---|
| Lois + Mr. Terrific | 11 |
| Lex + Superman | 10 |
| Engineer + Lex | 6 |
| Jimmy + Lois | 6 |
| Superman + Mr. Terrific | 6 |
| Clark + Lois | 5 |
| Robot 4 + Superman | 5 |
| Guy + Superman | 4 |
Surprisingly, Lois and Lex never share a scene. Neither do Lex and Clark, or Lex and Jimmy. Gunn keeps the protagonist's worlds separate — the Daily Planet orbit and the Luthor orbit never collide directly.
The Superman Factor
This is the fan-service data. Identity, powers, Krypto, and the DC universe references.
Identity: Who Is He?
"Superman" is used 348 times. "Clark" just 90. "Kal-El" appears only twice. The public persona overwhelms the private one — that's the tension of the story.
Power Usage
Flight dominates everything — 74 references. Heat vision only appears twice. Gunn writes a Superman who flies and punches. He's not a Swiss Army knife of powers.
Krypto Tracker
74 mentions across 39 scenes — Krypto appears in 16% of all scenes. He's not a gimmick. He's a co-lead.
DC Universe Name Drops
Every named character, place, or artifact from DC lore mentioned in the script. Ultraman (58 mentions) is clearly the big bad. Metamorpho (43) has a substantial presence.
Writer's Craft
The technical choices that reveal Gunn's style.
Action Verbs
"Looks" (94) and "sees" (54) dominate — this is a script about watching, observing, witnessing. "Flies" (38) is third. Gunn's Superman is defined by his gaze before his power.
Four of the top eight verbs are about vision: looks, sees, watches, stares. Before Superman acts, he observes. Gunn directs from the page through the character's eyes.
Readability
Flesch-Kincaid grade level: dialogue reads at a 2.4 grade level (extremely simple), action at 5.6. Gunn writes crystal-clear prose — no purple language, no overwritten descriptions. A second-grader could follow the dialogue. That's not a weakness. That's precision.
What the Data Tells Us
Zoom out from the individual metrics and a portrait of the screenplay emerges:
Gunn writes Superman as a visual story. 70% action, terse dialogue, a hero defined by verbs of seeing. This isn't a movie about what Superman says — it's about what he witnesses and how he responds.
The structure accelerates deliberately. The script doubles its scene density in the final act while halving scene length. It doesn't build to a climax — it detonates into one.
Identity is the engine. "Superman" appears nearly four times as often as "Clark." The script itself reflects the character's central tension: the public persona drowns out the private one.
Krypto isn't fan service — he's structure. 16% scene presence makes him a narrative throughline, not a cute distraction.
Gunn keeps worlds separate. Lois never meets Lex. Clark never meets Lex. The Daily Planet orbit and the Luthor orbit run in parallel, colliding only through Superman — the bridge between both worlds.
The numbers don't tell you whether a screenplay is good. But they tell you what a screenplay is. And what this screenplay is, at its core, is a story about a man who looks, who flies, and who speaks as little as possible — told in a script that accelerates like a heartbeat.
This analysis was generated by parsing James Gunn's Superman screenplay (132 pages) using custom scripts. The screenplay was broken into structured data — scenes, dialogue blocks, action lines, and parentheticals — then analyzed for 30 distinct metrics. Character names "Clark" and "Superman" are kept separate to reflect how the script itself treats them as distinct personas. All data reflects the screenplay text only — not the final film.