Are there acknowledged studies about the relationship of vocabulary and comprehension of English language?

Though I’m not sure whether this question is suitable to EL&U site or not, I’m glad if I can get any input on the relationship of the vocabulary and comprehension of English language, or if you can suggest me other suitable site that I can post this question.

I recently read the book, titled “Read a Million Words (to master English language),” written by Japanese English language scholar at Tokyo Tele-communication University, Kunihide Sakai.

Mr.Sakai cites the following numbers:

  • Vocabulary of 1,000 words allows 80% comprehension of English statement

  • 2,000-word vocabulary allows 86.6 % comprehension of English statement

  • 3,000-word vocabulary allows 90.6 % comprehension of English statement

  • 4,000-word vocabularies allows 92.8 % comprehension of English statement

Though it’s not clear what "English statement" means (maybe I’d do better to ask him directly as well as about the source of number), it’s hardly believable that 3,000 vocabulary level allows 90% comprehension of general statement in English language from my experience of reading English language literatures everyday to look for the subject for posting EL&U.

I know British psychologist and linguist, Charles K. Ogden asserted that we can express everything we want to say with 1,500 words (850 basic English words and their combinations, plus 350 international (foreign) words and 300 from scientific terminology), and promoted Basic English movement during 1930s.

Winston Churchill supported Ogden’s theory and recommended diffusion of Basic English as the international language in his discourse at Harvard University in 1944. However, Ogden's theory is applied only to writing and speaking, not reading and hearing.

Is there any trustworthy study that illuminates the relationship between vocabulary level and ability to comprehend English language statements expressed in day-to-day conversation, literature (not too academic) and daily journals? What is its finding with regards to the relationship between vocabulary and comprehension?


Solution 1:

A person who has a large vocabulary does not necessarily comprehend a higher fraction of all statements than a person with a small vocabulary; that is to say, some people with small vocabularies may understand what people tell them or what they read more quickly and thoroughly than other people with large vocabularies. But it is reasonable to suppose that on the average, and as a general rule, larger vocabulary leads to better understanding.

The trend expressed in the quoted statistics is of plausible form: the first thousand words counting for most understanding, with successively lower return from each increment thereafter.

As to whether a 3000-word vocabulary level typically allows 90% comprehension of general statements in the English language (where, as noted in comments, word and comprehension are nebulous concepts), I find the statistic both plausible and implausible, believable and unbelievable. In favor of the statistic, the vocabularies of typical American English speakers that I encounter seem quite limited; toss out any word not among the top few thousand words and they don't know the meaning of it. Strongly weighing against the statistic are webpages as at testyourvocab.com which shows about 3300 words in the vocabulary of the average 3-year-old, and 20000 to 32000 words in the vocabulary of average people older than 15. A straightdope.com page quotes Stephen Pinker as suggesting the average American high-school graduate knows about 60000 words. In his book The Language Instinct, Pinker says the 45000-word average vocabulary suggested by other researchers rather underestimates vocabulary size.

To recapitulate: (1) vocabulary size is not necessarily a good predictor of language comprehension; (2) nothing in this answer either establishes or disproves any of the statistics mentioned in the question; and (3) I think the terminology in the question is not precise enough to allow definitive answers.