Conley, P., Hage, D., & Burgess, C. (1998). Large scale databasses of proper names. Paper presented at the 28th annual SCiP conference, Dallas.


Few tools for research in proper names have been available -- specifically, no large scale corpus of proper names. The authors constructed three frequency counts of proper names, one based on U.S. phone book listings, one from the Brown corpus (Kucera & Francis, 1967) and the other derived form a 300 million word database of Usenet text (Burgess & Livesay, 1998). These proper namecounts are freely available online for download. Potentials for these proper name counts range form using the names as stimuli in experiments to using the names as filters in software.