
Data Mining Final Assessment
Quiz by Yashaswini B M
Tag the questions with any skills you have. Your dashboard will track each student's mastery of each skill.
Which of these is correct about data mining?
The classification or mapping of a class using a predefined class or group is called:
What is the analysis conducted for uncovering some interesting statistical correlations between various associated-attribute-value pairs called?
__________ are the data objects that don’t comply with the general model or behaviour of the available data:
The primary use of data cleaning is:
The classification of the Data Mining System consists of:
The class under study in Data Characterization is known as:
What does the letter K stand for in the K-nearest neighbour algorithm?
What does noise mean in data mining?
Which process is used to rescaling the values of data into a range from 0 to 1?
What is often considered the early methods to identify patterns in data?
Bayesian classifiers is
Classification is
Classification accuracy is
Naive prediction is
Data mining application domains are
What is the main limitation of the Naive Bayesian classification algorithm?
Which one of the following statements about the K-means clustering is incorrect?
The analysis performed to uncover the interesting statistical correlation between associated -attributes value pairs are known as the _______.
Confidence is the conditional probability that a randomly selected transaction will include all the items in the consequent given that the transaction includes all the items in the antecedent.
One major shortcoming of association analysis is that the support confidence framework often-
Which data mining method is used to analyse transaction data?
In a transaction data set where we have three variables:Customer Identification
Product bought.
Order of product purchase.
What model role must we assign to (2) ie; Product bought.
In data mining a decision tree is a predictive model?
Without proper data processing it is very difficult to select the appropriate model for the data.