Skip to content
/ m2n Public

Additional information about the experiments in the Neurocomputing paper "Noise detection in the meta-learning level"

Notifications You must be signed in to change notification settings

lpfgarcia/m2n

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 

Repository files navigation

M2N

The experiments are divided in two steps. The first is related to the base-level and the second to the meta-level performance. To run the measures we need some R packages and the DCoL library.

Technical Requirements

R version 3.1.2 -- "Pumpkin Helmet"

DCoL Library: http://dcol.sourceforge.net/

Packages: CORElearn, e1071, FNN, foreign, infotheo, rpart, R.utils, rrcov and mvpart.

Set the experiments

Before start to run the measures we need the binary of the DCoL library. Download the code of DCoL Library at http://dcol.sourceforge.net/ and follow the instructions to compile the code in the README file. After that, put the executable (binary file) inside the m2n folder with the name "dcol".

Open the R inside the m2n folder.

Install the packages:


install.packages(c("CORElearn", "e1071", "FNN", "foreign", "infotheo", 
"rpart", "R.utils", "rrcov", "mvpart"))

Base-level

In the base-level folder we have the datasets used in this work, the measures to characterize the datasets and some complementary information about the base-level.

How to load the datasets?

The easiest way to load the datasets (arff files) is with the packages RWeka or foreign:

# load the iris dataset
require("foreign")
data = read.arff("base-level/database/iris.arff");

How to run the measures?

Load the source codes:

# load the complexity measures
source("base-level/measures/complexity.r")

# load the characterization measures
source("base-level/measures/characterization.r")

Run the functions with the data set as parameter:

# characterization measures
characterization(iris);

# complexity measures
complexity(iris);

Meta-level

In the meta-level folder we have the meta-datasets in the rand and pairwise folder and some complementary information about the base-level.

How to load the datasets?

The easiest way to load the datasets (.RData files) is with the dget function:

# load the iris dataset with random noise
data = dget("meta-level/database/rand/iris.RData");

Contact

Luis Paulo Faina Garcia - lpfgarcia [at] gmail.com

University of São Paulo - Campus São Carlos

References

[1] K. Bache, M. Lichman, UCI machine learning repository, http://archive.ics.uci.edu/ml (2013).

[2] A. Orriols-Puig, N. Maciá, T. K. Ho, Documentation for the data complexity library in C++, Tech. rep., La Salle - Universitat Ramon Llull (2010).

[3] M. Reif, A comprehensive dataset for evaluating approaches of various meta-learning tasks, in: First International Conference on Pattern Recognition and Methods (ICPRAM), 2012.

[4] R Core Team, R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, Austria (2014). URL http://www.r-project.org/

About

Additional information about the experiments in the Neurocomputing paper "Noise detection in the meta-learning level"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages