comp3425 Data Mining
Maximum of 10 pages, excluding cover sheet, bibliography and |
|
A4 margin, at least 11 point type size, use of typeface, margins |
|
You are expected to write in a style appropriate to a professional report. You may refer to http://www.anu.edu.au/students/learningdevelopment/writing-assessment/report-writing for some stylistic advice. You are expected to use the question and sub-question numbering in this assignment to identify the relevant answers in your report.
No particular layout is specified, but you should use no smaller than 11 point typeface and stay within the maximum specified page count. Page margins, heading sizes, paragraph breaks and so forth are not specified but a professional style must be maintained. Text beyond the page limit will be treated as non-existent.
This is a single-person assignment and should be completed on your own. Make certain you carefully reference all the material that you use, although the nature of this assignment suggests few references will be needed. It is unacceptable to cut and paste another author's work and pass it off as your own. Anyone found doing this, from whatever source, will get a mark of zero for the assignment and, in addition, CECS procedures for plagiarism will apply.
No particular referencing style is required. However, you are expected to reference conventionally, conveniently, and consistently. References are not included in the page limit. Due to the context in which this assignment is placed, you may refer to the course notes or course software where appropriate (e.g. “For this experiment Rattle was used”), without formal reference to original sources, unless you copy text or images which always requires a formal reference to the source.
An assessment rubric is provided. The rubric will be used to mark your assignment. You are advised to use it to supplement your understanding of what is expected for the assignment and to direct your effort towards the most rewarding parts of the work.
(a) In your own words, briefly describe the purpose and means of data collection.
3. Association mining: What factors affect satisfaction with the country’s future?
mining to find out which factors might be indicative of a person’s response to A1.
(a) Generate association rules, adjusting min_support and min_confidence parameters as you need.
What parameters do you use? Bearing in mind we are looking for insight into what factors affect A1,
find 3 interesting rules, and explain both objectively and subjectively why they are interesting.
(b) Comment on whether, in general, association mining could be a useful technique on this data.
4. Study a very simple classification task
(a) This should be a very easy task for a learner. Why? Hint: Think how Opinionated is defined.
(a) Explain which you chose of a regression tree or neural net and justify your choice.
6. More Complex Classification
and its cluster model for k-means to answer the following.
8. Qualitative Summary of Findings (Hint: approx 1/2 page)
Assessment Rubric COMP3425 Data Mining
- 2022-03-04
- 2022-03-04
- 2022-03-04
- 2022-03-04