Above: How your final Process panel will look in RapidMiner after you do this Module.
1/5
Import data into RapidMiner.
Getting your data into RapidMiner is usually the first task you need to start your analysis. In this tutorial, you will learn how to import files into the central storage of RapidMiner, called the Repository.
|
ACTIVITY |
|
|
|
|
|
|
EXPLANATION |
|
|
|
|
The Repository panel, in the upper left corner by default, is the place to store all your data, processes, and
|
2/5
Add data to the process.
|
ACTIVITY |
|
|
|
|
|
|
EXPLANATION |
|
|
|
|
When you drag data from the repository into the process, it transforms into a data-loading operator (in this case, Retrieve Altoona Crime Rates). Data is not actually loaded (or delivered at the round output ports of each operator) until you run the process, which is what we do in the next few steps.
|
3/5
Create a connection in the process for results.
|
ACTIVITY |
|
|
|
|
|
|
EXPLANATION |
|
|
|
|
Only data which is delivered to one of the result ports (“res”) on the right can be seen after the execution of the process. If your process does not have at least one connection to a result port, you won’t see any results when you execute it!
|
4/5
Execute the process.
|
ACTIVITY |
|
|
|
|
|
|
EXPLANATION |
|
|
|
|
Once run, you automatically switch to the Results
|
5/5
Inspect data using summary statistics.
Congratulations, you have just imported your first data set! You will import data from now on in the same way. Even just simply importing data in RapidMiner gives you a lot of useful information. See the Challenge questions below for some examples.
|
CHALLENGE |
|
|
|
|
Using the
|