This example illustrates mining data where the build data is in two tables that must be joined before building the model.
For an overview of text mining, see Text Mining.
This example uses two tables for input:
MINING_DATA_BUILD_VCUSTOMER_COMMENTS, which contains customer comments arranged according to CUST_IDNote: The table CUSTOMER_COMMENTS is not available; you cannot create an activity that can be run.
This example adds complex additional data to a table. For more information about adding complex additional data, see "Complex Additional Data" in Chapter 3 - Overview of Mining Activity Guides in the Oracle Data Mining Tutorial; the tutorial includes screen shots that illustrate the process.
The table CUSTOMER_COMMENTS has the following structure:
CUST_ID, the primary key, type NUMBER, size 22COMMENTS, type VARCHAR2, size 4000COMMENTS are customer comments. There will be several different comments for each customer.
We want to predict customers for whom the value of AFFINITY_CARD is 1.
These are the steps to build a classification model:
MINING_DATA_BUILD_V. This is a view used by the Oracle Data Mining sample programs.Classification as the Function Type (this is the default), and select Support Vector Machine as the Algorithm.MINING_DATA_BUILD_V. Check Join additional data with case table. Select CUST_ID as the Unique Identifier. For all other choices, use the defaults. Click Next.CUSTOMER_COMMENTS in the Available Tables list and move it to the Selected Tables list. Click Edit to define the relationship.CUST_ID for both Case Table Column and Related Table Column.One to Many for Relationship Type; you select One to Many because there are several comments for each customer. In Transactional Column Mappings, click New to define a new mapping.COMMENTS as the Value Column. Change the Mining Type to text. Verify that Data is Sparse is checked. Accept all other defaults.
Note: You must change the mining type from categorical to text for text mining to take place. If the dropdown menu does not appear, you have selected an algorithm that does not support text mining.
AFFINITY_CARD as the Target. Note that the Mining Type of COMMENTS is text.
Click OK.
DEMO_TEXT_COMPLEX_JOIN.DEMO_TEXT_COMPLEX_JOIN is displayed. Note that the activity has all of the steps of an activity that builds a Support Vector Machine model plus the steps Text and Test(Text). In these additional steps, Oracle Data Miner does all of the processing required to prepare the text column for mining. The Options for these steps support advanced text mining features such as customized stoplists.Copyright © 2008, Oracle. All rights reserved.