Implementation of The Data in Rapidminer
Implementation of The Data in Rapidminer
Implementation of The Data in Rapidminer
Dataset
This data in this study uses data from Kaggle entitled Book Data. This data
consists of 460 pieces, with data on book titles, publisher names and publisher years.
This data has attributes or features that describe each example.
When importing data, note that each table column is assigned the correct type
attribute. If given the wrong attributes, the end result will also be wrong.
a) Binominal: data type used for data types that only have two types, such as
YES / NO, or 0/1.
b) Polynominal: the type of data used for data types that have more than 2 types.
c) REAL: used for data types that have a decimal in the number.
d) Integer: used for data types that use numbers for data, without any decimals.
For each data, the following are the types of data used for training data and testing
data :
1.Book Titles : Polynominal
2.Publisher Names : Polynominal
3.Publisher Years : Integer
At the data book processing stage, the accuracy of each publisher, the name of the
book, and the year of publication will be shown. where cross validation will show
accurate results from the titles of books that are most interested in to those that are
least in demand from year to year
Modeling is a stage that directly involves data mining techniques, using the decision
tree technique to describe the book title from the year it was published, the name of
the book and also where the book was published.
The decision tree will also specify the existing book data by describing how many
books have been published from a publisher according to the book title listed.