My PhD thesis, October 2003, Abstract, download ps file. or pdf file.

Book chapters
  1. Aminul Islam and Diana Inkpen, "Semantic similarity of short texts", in Nicolas Nicolov, Galia Angelova, and Ruslan Mitkov (eds.), Recent Advances in Natural Language Processing V, John Benjamins Publishing Company (Selected Papers from RANLP 2007), Current Issues in Linguistic Theory, 2009, to appear.
  2. Diana Inkpen, "Traduction automatique et comparaison des synonymes français et anglais'', Les cahiers scientifiques de l'association francophone pour le savoir,  Colloque C-521: Les nouvelles technologies et le traitement automatique des langues au coeur des dispositifs d'apprentissage, 2005, p. 43-52, pdf file
  3. Diana Zaiu Inkpen, Olga Feiguina, and Graeme Hirst, "Generating more-positive or more-negative text'', in Computing Attitude and Affect in Text, Springer, Dordrecht, The Netherlands, 2006, p.187-196, (Selected papers from the Proceedings of the Workshop on Attitude and Affect in Text, AAAI 2004 Spring Symposium), Edited by James G. Shanahan, Yan Qu, Janyce Wiebe. pdf file
  4. Diana Zaiu Inkpen and Graeme Hirst, "Near-synonym choice in natural language generation'', in Recent Advances in Natural Language Processing III, John Benjamins Publishing Company, 2004 (Selected Papers from RANLP 2003), edited by Nicolas Nicolov, Kalina Bontcheva, Galia Angelova, and Ruslan Mitkov, p. 141-152, pdf file
Journal Publications:

  1. Qibo Zhu, Diana Inkpen, Ash Asudeh, "Automatic extraction of translations from web-based bilingual materials", Machine Translation, 21 (3): 139-163, 2008, pdf file. The definitive version is available through SpringerLink
  2. Aminul Islam and Diana Inkpen, "Semantic Text Similarity using Corpus-Based Word Similarity and String Similarity", ACM Transactions of Knowledge Discovery from Data (TKDD), 2(2), 2008, available through the ACM Digital Library
  3. Oana Frunza and Diana Inkpen, "Partial Cognate Disambiguation", Language Resources and Evaluation, Sept 2008, pdf file. The definitive version is available through SpringerLink
  4. Aminul Islam, Diana Inkpen, and Iluju Kiringa, "Applications of Corpus-based Semantic Similarity and Word Segmentation to Database Schema Matching", The International Journal on Very Large Data Bases (VLDB), 17(5): 1293-1320, 2008, available through Springer Link
  5. Diana Inkpen, "A statistical model of near-synonym choice'', ACM Transactions of Speech and Language Processing 4(1): 1-17, January 2007 pdf file. The definitive version is available through the ACM Digital Library
  6. Diana Inkpen and Graeme Hirst, "Building and using a lexical knowledge-base of near-synonym differences'', Computational Linguistics 32(2): 223-262, June 2006, pdf file. The definitive version is available at MIT Press
  7. Alistair Kennedy and Diana Inkpen, "Sentiment Classification of Movie Reviews Using Contextual Valence Shifters'', Computational Intelligence 22(2):110-125, May 2006, pdf file. The definitive version is available at www.blackwell-synergy.com .
  8. Diana Inkpen and Alain Desilets, "Extracting Semantically-Coherent Keyphrases from Speech'', Canadian Acoustics journal 32(3):130-131, special issue of Acoustics Week in Canada, Oct. 2004, pdf file
  9. Diana Inkpen, "Semantic Similarity Knowledge and its Applications", Studia Universitatis Babes-Bolyai Informatica, LII (1), Romania, 2007, pdf file

Publications in international conference and workshop proceedings (refereed, unless otherwise indicated):

  1. Aminul Islam and Diana Inkpen, "Real-Word Spelling Correction using Google Web 1T n-gram Dataset", in Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM 2009), Hong Kong, Nov. 2009, pdf file
  2. Muath Alzghool and Diana Inpken, Novel Techniques for Data Fusion in Spontaneous Speech Retrieval, in Proceedings of the Third Workshop on Searching Spontaneous Conversational Speech (SSCS 2009), ACM Multimedia, Beijing, China, Oct 2009, pdf file
  3. Maria Fernanda Caropreso, Diana Inkpen, Shahzad Khan, Fazel Keshtkar, "Automatic Generation of Narrative Content for Digital Games", in Proceedings of the 2009 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-K'09), Dalian, China, Sep. 2009, pdf file
  4. Fazel Keshtkar and Diana Inkpen, "Using Sentiment Orientation Features for Mood Classification in Blogs", in Proceedings of the 2009 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-K'09), Dalian, China, Sep. 2009, pdf file
  5. Aminul Islam and Diana Inkpen. "Managing the Google Web 1T 5-gram Data Set", in Proceedings of the 2009 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-K'09), Dalian, China, Sep. 2009, pdf file
  6. Aminul Islam and Diana Inkpen. "Real-Word Spelling Correction Using the Google Web 1T N-gram Dataset with Backoff", in Proceedings of the 2009 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-K'09), Dalian, China, Sep. 2009, pdf file
  7. Aminul Islam and Diana Inkpen, "Real-Word Spelling Correction Using Google Web 1T 3-grams", in Proceedings of the Conference on Empirical Methods in Natural Language Processing EMNLP 2009, Sigapore, Aug. 2009, pdf file
  8. Maria Fernanda Caropreso, Diana Inkpen, Shahzad Khan, Fazel Keshtkar, "Visual Development Process for Automatic Generation of Digital Games Narrative Content", in Proceedings of the Workshop on Language Generation and Summarisation (UCNLG+Sum), ACL-IJCNLP 2009, Singapore, Aug. 2009, pdf file
  9. Qibo Zhu, Diana Inkpen, Ash Asudeh. "Inducing translations from officially published materials in Canadian government websites", Machine Translation Summit XII, Ottawa, ON, Canada, Aug. 2009, pp. 176-183, pdf file
  10. Martin Scaiano and Diana Inkpen, "Automatic Frame Extraction from Sentences", in Proceedings of the 22th Canadian Conference on Artificial Intelligence AI 2009, May 2009, Kelowna, BC, Canada, pdf file
  11. Maria Fernanda Caropreso, Diana Inkpen, Shahzad Khan and Fazel Keshtkar, "Novice-Friendly Natural Language Generation Template Authoring Environment", in Proceedings of the 22th Canadian Conference on Artificial Intelligence AI 2009, May 2009, Kelowna, BC, Canada, pdf file
  12. Alexandre Kouznetsov, Stan Matwin, Diana Inkpen, Amir Razavi, Oana Frunza, Morvarid Sehatkar and Leanne Seaward, "Classifying Biomedical Abstracts Using Committees of Classifiers and Collective Ranking Techniques", in Proceedings of the 22th Canadian Conference on Artificial Intelligence AI 2009, May 2009, Kelowna, BC, Canada, pdf file
  13. Muath Alzghool and Diana Inkpen, "Cluster-based Model Fusion for Spontaneous Speech Retrieval", Workshop on Searching Spontaneous Conversational Speech, SIGIR 2008, Singapore, Aug. 2008, pdf file
  14. Oana Frunza and Diana Inkpen. "Representation and classification techniques for clinical data focused on obesity and its co-morbidities", In Proceedings of the Second i2b2 Shared-Task and Workshop Challenges in Natural Language Processing for Clinical Data Obesity Challenge, Washington DC, Aug. 2008.
  15. Oana Frunza and Diana Inkpen. "Textual Information Help in Predicting Functional Properties of the Genes", In Proceedings of ACL BioNLP 2008, Columbus, OH, June 2008, poster, pdf file
  16. Using the Complexity of the Distribution of Lexical Elements as a Feature in Authorship Attribution, by Leanne Spracklin, Diana Inkpen and Amiya Nayak, in Proceedings of the sixth international conference on Language Resources and Evaluation (LREC 2008), Marrakech, Morocco, May 2008, pdf file
  17. Combining Multiple Models for Speech Information Retrieval, by Muath Alzghool and Diana Inkpen, in Proceedings of the sixth international conference on Language Resources and Evaluation (LREC 2008), Marrakech, Morocco, May 2008, pdf file
  18. Aminul Islam and Diana Inkpen, "Semantic Similarity of Short Texts", in Proceedings of the International Conference RANLP-2007 (Recent Advances in Natural Language Processing), Borovets, Bulgaria, Sept. 2007, pdf file
  19. Aminul Islam, Diana Inkpen, and Iluju Kiringa, Database Schema Matching using Corpus-based Semantic Similarity and Word Segmentation, in Proceedings of the Fifth International Workshop on Databases, Information Systems and Peer-to-Peer Computing (DBISP2P 2007), Vienna, Sept. 2007, pdf file
  20. Muath Alzghool and Diana Inkpen, "Experiments for the Cross Language Speech Retrieval Task at CLEF 2006'', Proceedings of CLEF 2006, Lecture Notes in Computer Science, Springer-Verlag 4730, 2007, p.778-785, pdf file.
  21. Diana Inkpen, "Near-synonym Choice in an Intelligent Thesaurus", in Proceedings of the Human Language Technology Conference / North American chapter of the Association for Computational Linguistics, HLT-NAACL 2007, New York City, NY, April 2007, pdf file
  22. Oana Frunza and Diana Inkpen, "A Tool for Detecting French-English Cognates and False Friends", Traitement Automatique des Langues Naturelles, TALN-2007, Toulouse, France, 2007, pdf file
  23. Md. Aminul Islam, Diana Inkpen, and Iluju Kiringa, "A Generalized Approach to Word Segmentation Using Maximum Length Descending Frequency and Entropy Rate", in A. Gelbukh (Ed.): Proceedings of the Eigth International Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2007), LNCS 4394, Berlin Heidelberg: Springer-Verlag, p.175-185, 2007, pdf file
  24. Oana Frunza and Diana Inkpen, "Semi-Supervised Learning of Partial Cognates using Bilingual Bootstrapping", in Proceedings of the Joint Conference of the International Committee on Computational Linguistics and the Association for Computational Linguistics, COLING-ACL 2006, Sydney, Australia, Aug. 2006. pdf file
  25. Diana Inkpen, Muath Alzghool, Gareth Jones and Douglas Oard, "Investigating Cross-Language Speech Retrieval for a Spontaneous Conversational Speech Collection", in Proceedings of the Human Language Technology Conference / North American chapter of the Association for Computational Linguistics, HLT-NAACL 2006, New York City, NY, June 2006, short paper, p.61-64, pdf file (updated).
  26. Aminul Islam and Diana Inkpen, "Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words", Proceedings of the Fifth Conference on Language Resources and Evaluation (LREC 2006), Genoa, Italy, May 2006. pdf file
  27. Diana Inkpen, Darren Kipp, and Vivi Nastase, "Machine Learning Experiments for Textual Entailment", The Second PASCAL Challenges Workshop on Recognizing Textual Entailment, Venice, Italy, Apr. 2006, p.10-15, pdf file
  28. Diana Inkpen, Muath Alzghool, and Aminul Islam. "Using various indexing schemes and multiple translations in the CL-SR task at CLEF 2005'', Proceedings of CLEF 2005, Lecture Notes in Computer Science, Springer-Verlag 4022, 2006, pdf file (updated).
  29. Diana Inkpen and Alain Desilets, "Semantic Similarity for Detecting Recognition Errors in Automatic Speech Transcripts'', HLT-EMNLP 2005 (Human Language Technology Conference joint with Conference on Empirical Methods in Natural Language Processing), Vancouver, Canada, Oct. 2005, p.49-56, pdf file
  30. Diana Inkpen, Oana Frunza, and Greg Kondrak. "Automatic Identification of Cognates and False Friends in French and English'', RANLP-2005, Bulgaria, Sept. 2005, p.251-257, pdf file
  31. Alistair Kennedy and Diana Inkpen. Sentiment Classification of Movie and Product Reviews Using Contextual Valence Shifters, in Proceedings of FINEXIN 2005, Workshop on the Analysis of Informal and Formal Information Exchange during Negotiations, Ottawa, May 2005, pdf file
  32. Oana Frunza, Diana Inkpen, and David Nadeau, "A Text Processing Tool for the Romanian Language", in Proceedings of the EuroLAN 2005 Workshop on Cross-Language Knowledge Induction, p.16-22, Romania, July 2005, pdf file
  33. Diana Inkpen and Darren Kipp, "A prototype natural language interface for animation systems'', in Proceedings of the IEEE International workshop on Haptic Audio Visual Environments and their Applications Haptic Audio Visual Environments and their Applications (HAVE 2004), Oct. 2004, pdf file
  34. Diana Zaiu Inkpen, Olga Feiguina, and Graeme Hirst, "Generating more-positive or more-negative text'', in Proceedings of the Workshop on Attitude and Affect in Text, Technical Report SS-04-07, AAAI Spring Symposium, March 2004, Stanford University. Download ps file or pdf file
  35. Diana Zaiu Inkpen and Graeme Hirst. "Near-Synonym Choice in Natural Language Generation.'' RANLP-2003, p.204-211, Bulgaria, Sept. 2003. Download ps file or pdf file
  36. Diana Zaiu Inkpen and Graeme Hirst. "Automatic sense disambiguation of the near-synonyms in a dictionary entry.'' Proceedings of the Fourth International Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2003), Mexico City, Feb. 2003, LNCS 2588, Springer-Verlag, 2003, p.265-281, Download ps file or pdf file
  37. Diana Zaiu Inkpen and Graeme Hirst. "Acquiring Collocations for Lexical Choice between Near-Synonyms.'' ACL 2002 Workshop on Unsupervised Lexical Acquisition, Philadelphia, 2002. Download ps file or pdf file
  38. Diana Zaiu Inkpen and Graeme Hirst. "Building a lexical knowledge-base of near-synonym differences.'' Workshop on WordNet and Other Lexical Resources, Second meeting of the North American Chapter of the Association for Computational Linguistics, Pittsburgh, June 2001. Abstract (HTML) Download: Adobe PDF file (95 Kb); PostScript file (106 Kb)
  39. Diana Zaiu Inkpen and Graeme Hirst. "Experiments on extracting knowledge from a machine-readable dictionary of synonym differences.'' In: Gelbukh, Alexander (editor), Computational Linguistics and Intelligent Text Processing (Proceedings, Second Conference on Intelligent Text Processing and Computational Linguistics, Mexico City, February 2001), Lecture Notes in Computer Science 2004, Berlin: Springer-Verlag, 2001, p.264-278. Abstract (HTML) Download: Adobe PDF file (123 Kb); Compressed PostScript file (1431 Kb)

  40. A. Ferencz, T. Ratiu, M. Ferencz, T. Kovacs, I. Nagy, D. Zaiu, "ROMVOX: Text-to-Speech Synthesis of Romanian", 9th International Workshop on Natural Language Generation, Niagara-on-the-Lake, Ontario, Canada, August 1998
  41. I.A. Letia, M. Joldos, C. Cenan, D. Zaiu, A. Andreica, "Decision trees and rule induction in simulated soccer agents", In A. Drogoul, M. Tambe, T. Fukuda (eds), Collective Robotics, Lecture Notes in Computer Science 1456, Springer-Verlag, p.110-122, 1998
  42. I.A. Letia, M. Joldos, C. Cenan, D. Zaiu, A. Andreica, "State/action behavioral classifiers for simulated soccer players", in Proceedings of the 2nd RoboCup Workshop, (ICMAS-98), Paris, France, p.151-164, July 1998
  43. Doina Tatar, Diana Zaiu, "Unification Based and Object-Oriented Based Approaches to Grammars", Logical Aspects of Computational Linguistics (LACL'97), p.65-69, Nancy, France, Sept. 1997
  44. A. Ferencz, R. Arsinte, I. Nagy, T. Ratiu, M. Ferencz, G. Toderean, D. Zaiu, "Experimental Implementation of Pitch-Syncronuous Synthesis Method for the ROMVOX Text-to-Speech System", Vol. 5, p. 24-39, EuroSpeech'97, Rhodes-Greece, September 1997
  45. Diana Zaiu, "Romanian Morphology using PC-Kimmo", in Proceedings of the International Workshop SPEECH and COMPUTER (SPECOM'97), p.25-30, Cluj- Napoca, Romania, October 1997
  46. Diana Zaiu, "Modelling HPSG in ALE for the Romanian Language", CONTI'96, International Conference on Technical Informatics, p.1-8, Timisoara, Romania, November 1996
  47. Attila Ferencz, Radu Arsinte, Teodora Ratiu, Maria Ferencz, Diana Zaiu, Gavril Toderean, "Experimental Implementation of the LPC-MPE (Multi-Pulse Excitation) Synthesis Method for the ROMVOX Text-to-Speech System", SPECOM'96, p.159-164, St. Petersburg, Russia, Oct. 1996
  48. Attila Ferencz, Diana Zaiu, Teodora Ratiu, M. Ferencz, "An experimental Text-to-Speech System for Romanian Language", SACCS'95, International Workshop, p.241-245, Iasi, October 1995
  49. Diana Zaiu, Attila Ferencz, Teodora Ratiu, M. Ferencz, "Grapheme to Phoneme Conversion Algorithms in Text-to-Speech Systems", Intelligent Computer Communication, International Workshop, p.188-191, Cluj-Napoca, June 1995
  50. Tudor Muresan, Sanda Cherata, Alina Muresan, Diana Zaiu, Codruta Zdrenghea, "PROLOG Prototype for Romanian-English Reversible Translation", International Conference on Technical Informatics, p.118-122, Timisoara, November 1994
Summer Schools Attended:

Back to Diana Inkpen's Page