Data Mining Java code for Data Preprocessing

Đang Thực Hiện

Mô tả

In this project, the students are to implement data pre-processing techniques and apply them to a gene expression dataset.

The dataset contains 62 samples collected from colon-cancer patients. 40 of the samples are labeled as ”negative” and 22 are labeled as ”positive.” Each tuple (row) in the dataset is a sample containing the readings for the genes, and the class (which is the last column) of the sample. Each gene is an attribute. The columns are separated by ”,”, which is a commonly used format in data mining. We will refer to the genes as G0, ..., GN, assigned in the left-to-right order as given in the original file.

You will write a C++ or Java program to handle the following two tasks:

Task 1. Task 2.

Discretize the data using equi-density binning with 3 bins for each of the first k attributes.

Use the entropy-based binning method to discretize all genes and to select the top-k genes, ranked in decreasing information gain order. Use 3 bins for each gene. Information gain for three bins is a generalization of the two-bins case (based on size-weighted entropy). To get three bins you should first divide the range of a given attribute into two bins and then divide one of the two bins into two more bins. The two splits should maximize the size-weighted entropy gain for the three intervals. (You should select between the two splits (one for the left interval and one for the right interval) as the the second split based on size-weighted entropy gain.)

IF POSSIBLE I HAVE A SIMILAR JAVA PROJECT THAT REQUIRES MODIFICATION OF THE CODE A LITTLE TO MEET THE GIVEN REQUIREMENTS

Kỹ năng: Khai thác dữ liệu, Java

Xem thêm: java code extract linkedin data, java project data mining, java code encrypt decrypt data, java code read usb data, java crawler data mining, code tree based data mining algorithm java, website data extract source code java, source code online update data java, source code association rule data mining, java code yahoo finance data, java code process forex data, data processing source code java, java code write file mysql database using poi, cnet code association rules data mining, java redirect data code, java send data usb

Mã Dự Án: #13118265

Đã trao cho:

dreaminfotechno

Hello, We have good experience of PHP,HTML,HTML5,CSS, Ajax,jquesry,JavaScript,Mysql, Graphic Design,Website Design,Bootstrap,wordpress,Codeigniter, open cart , E-commerce etc. We also have good experience in J2E Thêm

$50 USD trong 1 ngày
(4 Đánh Giá)
1.7

2 freelancer đang chào giá trung bình $30 cho công việc này

shahiddar

Hello, My name is shahid from Kashmir Over the last 7 years, I have worked for several clients. Joined Freelancer with over 7 years rich experience in the field.I have successfully completed more than 1000 projects in Thêm

$10 USD trong 0 ngày
(6 Đánh Giá)
4.3