Introduction to the Chinese Single-character Word Database (CSWD)
This online database is based on Liu's (2006) doctoral dissertation (Beijing Normal University, 2006) and was constructed by Seth Levine (Johns Hopkins University) and Xiaowei Zhao (University of Richmond).
The main purpose of this database is to help you select the appropriate materials for your experiment. You can quickly obtain the indices of fifteen variables that are considered potentially important to word processing. It prevents you from having to rate various characters' variables when you conduct your experiments. All that's left for you to do is to select which variables are pertinent to your study and let our database facilitate your research work.
To date, only 2,390 single-character Chinese words are included (no double- or multiple-character words). It contains almost all the single-character nouns, verbs, and adjectives in modern Chinese. They were selected from the Language Corpus System of Modern Chinese Studies (LCSMCS, Sun, Huang, Sun, Li, & Xing, 1997). For more details please read Liu et al., 2007*.
For each word, there are 16 categories, including PinYin (Chinese pronunciation), grammatical category, word frequency, homophone density, phonological frequency, cumulative frequency, number of components, number of strokes, number of word formations, age of learning in textbook, number of meanings, phonological regularity, age of acquisition, word concreteness, concept familiarity, and imageability.
For more details about the variables and other information please refer to Liu (2006) and Liu, Shu, & Li (2007).
If you used the data from the database, please cite the hyperlinked reference below.