Page 2 of 2   <      

Amazon's Vital Statistics Show How Books Stack Up

Discussion Policy
Comments that include profanity or personal attacks or other inappropriate comments or material will be removed from the site. Additionally, entries that are unsigned or contain "signatures" by someone other than the actual author will be removed. Finally, we will take steps to block users who violate any of our posting standards, terms of use or privacy policies or any other policies governing this site. Please review the full rules governing commentaries and discussions. You are fully responsible for the content that you post.

How, you may be wondering, will Text Stats enhance your life? Say you must choose between two best-selling novels: "The Kite Runner" by Khaled Hosseini and "The Curious Incident of the Dog in the Night-Time" by Mark Haddon. Both are in paperback; both sell for about $10.

"Kite Runner" is 384 pages long; "Curious Incident" is 240. On Amazon.com, you can read the publisher's synopses and the editorial and reader reviews -- helpful tools for making an intelligent choice.

But now, with Text Stats, you can reduce your decision to utter absurdity. You can learn, for example, which book scores higher on what is known as the Fog Index. Conceived by the late Robert Gunning, an English professor at Oxford University, the index states the number of years of formal education you should have in order to read and comprehend a random passage.

Here's how the index works: It picks a sample -- of 120 words or so -- from the text. It finds the average number of words per sentence, then picks up all the words in the sample that contain three or more syllables. Compound words are ignored; so are verbs that become polysyllabic through tense endings. The first word of each sentence is tossed out, so are proper nouns. Then it takes the polysyllable count, adds it to the average number of words in a sentence, multiplies that number by 0.4 and, voila! The answer is a number that supposedly represents a comprehension grade level.

The Fog Index shows that you should read at a seventh-grade level to digest "Kite Runner" and at ninth-grade level for "Curious Incident." The first novel contains 6 percent complex words, meaning three or more syllables. Five percent of the words in "Curious Incident" are considered complex. There is an average of 1.4 syllables per word in both books.

At 11,702 words per dollar, "Kite Runner" is obviously a better bargain than "Curious Incident," which contains only 6,156 WPD. That is, unless the words in "Curious Incident" are more meaningful, poetic or carefully chosen.

Text Stats still has a few kinks in its system. According to the Fog Index, a simple child's book such as "The Runaway Bunny" requires seventh-grade reading proficiency. And James Joyce's "Ulysses" is said to be easier than 80 percent of other indexed books.

But in its pure form, Text Stats is a triumph of trivialization. By squeezing all the life and loveliness out of poetry and prose, the computer succeeds in numbing with numbers. It's the total disassembling of truth, beauty and the mysterious meaning of words. Except for the Concordance feature, which arranges the 100 most used words in the book into a kind of refrigerator-magnet poetry game. Here's a poem made from the Concordance of Dave Ramsey's "Total Money Makeover: A Proven Plan for Financial Fitness": Emergency. Find first friend. Give kids life. Live myth.

Yes, you heard right: This site is under deconstruction.

Authors! Imagine how Text Stats will help you write books that are, if not better, at least easier to read according to the Fog Index and that offer the reader more words per pound than "Moby-Dick."

Publishers! Who needs editors anymore? If the software can find SIPs, surely it can be programmed to ferret out PCSs (Poorly Constructed Sentences), ORDs (overly romantic drivelings) and DIPs (Dreadfully Implausible Plots).

And readers! You can settle bar bets. Yes, "Ulysses" by James Joyce (9 on the Fog Index) is more complicated than William Faulkner's "The Sound and the Fury" (5.7 on the Fog Index). Yes, Charlotte Bronte provides more words per ounce (13,959 in "Shirley") than her sister Emily (10,444 in "Wuthering Heights"). And, yes, Ernest Hemingway used fewer complex words (5 percent) in his short stories than F. Scott Fitzgerald (9 percent).

That's right! Now you too can sound like a literary insider at Washington cocktail parties. You can throw around statistics and make clever conversation about the hard history books, the long-winded novels, even those thick, heavy, make-you-think philosophy tomes that contain really, really long words. And the beauty of it is, with Amazon's "Search Inside" Text Stats and other features, you won't even have to read them.


<       2


© 2005 The Washington Post Company