CS590D/STAT598M Data Mining
Fall 2007: Project information

The semester project is a significant undertaking that will allow you to experience the entire process of data mining. You will choose a dataset, a task, apply one or more models/algorithms to the data, and evaluate the modeling results.

Choose an area (data, model, or algorithm) that is interesting to you, with a project scope that is likely to be doable in a semester. The only broad restriction on topic choice is that it must be a data mining application. It is not necessary to design/code your own data mining algorithm, but you can if you choose. If you choose to use existing software and modeling techniques, you will need to compare more than one model in your evaluation and explore reasons why one model performs best.

Here are a couple of ideas for projects. They need to be fleshed out with more detail; the list should be viewed as ideas for inspiration. If none of these interests you, feel free to propose your own topic.

Proposal: Due Sept 24

Before a project is undertaken, the key idea must be approved by the instructor. Send the instructor a few paragraphs by email describing (briefly):

Final report: Due Dec 7

The final report should be a 6-8 page report that includes: