CSCI 544 — Applied Natural Language Processing


Research Project


Updates (April 18):

(April 17):


Overview

The research project is an in-depth activity that will be carried out in teams of four. The project can be on any aspect of natural language processing. You will formulate a research question, identify resources and tools to address the question, implement and evaluate a system that uses these resources and tools, demonstrate the system, and write up a report.

Procedure

Proposal structure

The proposal describes your plan for the research project, and will serve as the skeleton for the final report. As a plan it is subject to change and does not represent a firm commitment, but it should show that you’ve thought through the relevant aspects of your research. The proposal should be a document of about 500 words, written in English in good academic style. The structure of the document should be as follows.

The proposal should be written after you have received some feedback about the general direction of your project on the Blackboard thread. You will receive written feedback about your proposal, which should help you with writing the final report; however, feedback on the proposal might take some time, so don’t delay collecting your data and implementing your system while waiting for comments on your proposal. For feedback on specific issues that arise with the project, use the Blackboard thread.

Code demonstrations

Each team will meet with the instructor and the TAs for a brief demonstration to show how their code works. Code demonstrations take place on Monday, April 18, in SAL 213. Timed sessions were arranged individually with each team.

Final report

The final report describes the research you have done, reporting on the method and results, relating the research to other work in the field, and offering conclusions and directions for future work. The report should be about 2000 words long, not counting the references; reports that substantially exceed this length will be penalized. The structure is similar to the proposal, but with more detail, and two additional sections following the method section.

The six main content sections (introduction, materials, procedure, evaluation, results, and discussion) carry equal weight. Therefore, they should be of similar lengths – this means reserving about 300–350 words for each section. This is only a general guideline, as you may find that some sections require more text than others. However, if you find you have more to say than fits within the length requirement, then you’ll need to concentrate on the more important aspects of your project.

When giving examples of text in languages other than English, please use the following multi-line format, to make the examples readable to English speakers. Below is an example for how to present a sentence in Hindi.

1. The original text in its native script:किसने दवाई को खरीदा
2. A transcription into Latin script:kisne davaaii ko khariidaa
3. A word-by-word gloss:whoERG medicine ACC bought
4. A translation into English:‘Who bought the medicine?’

Line 2 is not needed if the language natively uses a version of the latin script. Also, the line numbers and explanations on the left are not needed in the report.

Grading

The grade for the assignment will be broken down as follows.

The research project counts for 30% of the overall course grade.