Publications
Things I have written that other people have agreed to print.
Journal Articles and Book Chapters
2012
Underspecification of cognitive status in reference production: Some empirical predictions
TopiCS (Topics in Cognitive Science) Special issue on Production of Referring Expressions: Bridging the Gap between Computational and Empirical Approaches to Reference 4(2), 249-268 (pdf)
2011
Language Preservation: A Case Study in Collecting and Digitizing Machine-Tractable Language Data
Journal of the Chicago Colloquium on Digital Humanities and Computer Science 1(3)
2009
Demonstrative pronouns in natural discourse
Anaphora Processing: Linguistic, Cognitive and Computational Modeling, ed. by Antonio Branco, Tony McEnery, and Ruslan Mitkov. John Benjamins, 351-364 (pdf)
2007
The Grammer-Pragmatics Interface: Essays in Honor of Jeanette K. Gundel.
Pragmatics and Beyond Series. John Benjamins
2005
Pronouns without NP Antecedents: How do we know when a pronoun is referential
Anaphora Processing: Linguistic, Cognitive and Computational Modelling, ed. by Antonio Branco, Tony McEnery, and Ruslan Mitkov. John Benjamins, 351-364 (pdf)
2004
Mood and modality: Out of the theory and into the fray
Natural Language Engineering Journal 10, 57-89.
2002
Embedding knowledge elicitation and MT systems within a single architecture
Machine Translation 17(4), 271-305
2001
Cognitive Status and Definite Descriptions in English: Why Accommodation is Unnecessary
Journal of English Language and Linguistics 5, 273-295 (pdf)
2000
Statut cognitif et forme des anaphoriques indirects
Verbum 22, 79-102 (pdf)
1997
Pragmatic Determinants of Intonation Contours for Dialogue Systems
International Journal of Speech Technology 1, 109-120 (pdf)
1993
Cognitive status and the form of referring expressions in discourse
Language 69, 274-307 (pdf)
Conference Papers
2011
Underspecification of cognitive status in reference production: Some empirical predictions
Proceedings of the Annual Cognitive Science Society Meeting. Pre-Cog Sci 2011 Workshop: Bridging the Gap between Computational, Empirical and Theoretical Approaches to Reference, 249-268
2009
Linguistic Dumpster Diving: Geographical Classification of Arabic Text
Proceedings of the Chicago Colloquia on Digital Humanities and Computer Science. (pdf)
Investigations on Standard Arabic Geographical Classification
Proceedings of the Computational Approaches to Arabic Script-based Languages Workshop (pdf)
User choice as an evaluation metric for web translation services in cross language instant messaging applications
Proceedings of the Machine Translation Summit XII (pdf)
2007
Directly and indirectly anaphoric demonstrative and personal pronouns in newspaper articles
Proceedings of the Sixth Annual Discourse Anaphora and Anaphor Resolution Colloquium. Lagos, Portugal
Understanding Defense Threat Reduction Agency science & technology investment strategies
Proceedings of the 2007 Chemical Biological Information Systems Conference. Austin, TX.
2006
Guarani: A case study in resource development for quick ramp-up MT
Proceedings of the Seventh Biennial Conference of the Association for Machine Translation in the Americas (pdf)
2005
The role of ontologies in a linguistic knowledge acquisition task
Proceedings of The Electronic Metastructure for Endangered Languages Data Workshop on Linguistic Ontologies and Data Categories for Language Resources (pdf)
2004
Demonstrative pronouns in natural discourse. Anaphora Processing: Linguistic
Proceedings of DAARC 2004 (Discourse Anaphora and Anaphora Resolution Colloquium) (pdf)
2003
A Discourse System for Conversational Characters
Proceedings of the Fourth International Conference on Intelligent Text Processing and Computational Linguistics, ed. by Alexander Gelbukh. Heidelberg: Springer-Verlag., 492-495
2002
Stress and in-focus pronominals
International Conference on Discourse Anaphora and Anaphor Resolution (DAARC2002). Lisbon
2001
Modularity in knowledge elicitation and language processing
Proceedings of the Third Annual High Desert Linguistics Conference, 93-104 (pdf)
2000
MT and topic-based techniques to enhance speech recognition systems for professional translators
Proceedings of CoLing , 1061-1065 (pdf)
1999
Language recognition for mono- and multi-lingual documents
Proceedings of the Vextal Conference, 209-214 (pdf)
Multilingual document language recognition
Proceedings of the Machine Translation Summit VII, 317-323
1998
An autonomous web-based multilingual corpus collection tool
Proceedings of the International Conference on Natural Language Processing and Industrial Applications, 142-148 (pdf)
Multilingual document language recognition
Proceedings of the Machine Translation Summit VII, 317-323
1997
Topic-Comment Structure, Syntactic Structure and Prosodic Tune
Workshop on Prosody and Grammar in Interaction (pdf)
1995
Pragmatic factors in the production of intonation contours for conversational systems
Proceedings of the Annual International Voice Technologies Conference 14, 55-63 (pdf)
Prosodic tune and information structure
Proceedings of the Annual Meeting of the Canadian Linguistic Association
1994
Accenting phenomena, association with focus, and the recursiveness of focus-ground
Ninth Amsterdam Colloquium (pdf)
A computational model of information structure and intonation. Focus and natural language processing
IBM Working Papers of the Institute for Logic and Linguistics, ed. by Peter Bosch and Rob van der Sandt. Heidelberg: IBM 1, 61-70
1992
Generation of accent in nominally premodified NPs
Proceedings of the International Conference on Computational Linguistics 14, 253–259
1990
Givenness, implicature, and the form of referring expressions in discourse
Berkeley Linguistics Society 16, 442-453
1989
Givenness, implicature, and demonstrative expressions in English discourse
Chicago Linguistic Society 25/2, 89-103
1988
On the generation and interpretation of demonstrative expressions
Proceedings of the International Conference on Computational Linguistics 12, 216–221
Presentations and Invited Talks
2010
Language preservation: A case study in collecting and digitizing machine-tractable language data
Chicago Colloquium for Digital Humanities, 249-268
2005
Annotating the structure of science
Department of Homeland Security Advanced Scientific Computing Program Text Analysis Workshop
2003
Referring expressions in computer-mediated conversations
Vancouver Studies in Cognitive Science Annual Conference
2001
What's the big deal? (The difficulties of machine translation)
University of Maryland Eastern Shore African Language Research Seminar
2000
Interpreting nothingness (computational treatment of ellipsis in Russian)
University of El Paso Linguistics Seminar
A field linguist in a box
Linguistic Exploration Workshop. Annual Meeting of the Linguistic Society of America
Definiteness, accomodation, and cognitive status
Invited paper at the conference on Linguistics and the English Language. Université de Touolouse-Le Mirail.
Working Papers
2006
Data-Centric Computing with the Netezza Architecture
Sandia Report SAND2006-1853 (pdf)
2005
Word sense disambiguation using extremely large databases
Memoranda in Computer and Cognitive Science. New Mexico State University.
Multi-document summarization of enormous clustered data sets
Memoranda in Computer and Cognitive Science. New Mexico State University.
From knowledge elicitation system to teaching tool
Working Paper #04-05, Institute for Language and Information Technologies, University of Maryland Baltimore County.
User-extensible on-line lexicons for language learning
Working Paper #05-05, Institute for Language and Information Technologies, University of Maryland Baltimore County.
The Boas II named entity elicitation system
Working Paper #08-05, Institute for Language and Information Technologies, University of Maryland Baltimore County
1992
BRIDGE: Basic Research on Intonation for Dialogue Generation
University of Edinburgh Department of Linguistics occasional paper