CSI 5386: NLP
Project Description

Project Proposal (2-3 pages) Due: After the reading week
Final Project Report Due: At the end of the exam period
Project presentation: Last class
Demo (optional during presentation)



Introduction

In this project, you are expected (1) to select a particular area of NLP that interests you, (2) to conduct a literature search on this area, (3) to focus on a specific problem in the area you selected, and (4a) to design and implement a novel learning scheme or (4b) to extend an existing scheme to deal with the problem you have identified. Alternatively (4c), you can compare the performance of different existing schemes on the specific problem you have identified in (1), (2) and (3) and on different corpora.

It is important to start working on this project early. I suggest that you start reading the textbook, some of its suggested follow-up material, conference proceedings, journals, and papers available from the Web, early enough to settle quickly on a subject of interest to you. I will be available for discussions both before the project proposal is due and after that, during the development of your research.

In order to help you select a topic, here is a list of project suggestions though you are more than welcome to propose your own idea.

 

Sources of datasets and project ideas:

·       SemEval

·       CLEF

·       Kaggle (search for text data)

·       TREC

Other project suggestions