Download | - View accepted manuscript: A Practical Data-Driven Framework for Parallel Data Mining (PDF, 265 KiB)
|
---|
Author | Search for: Yang, Chunsheng; Search for: Létourneau, Sylvain |
---|
Format | Text, Article |
---|
Conference | 9th World Multi-Conference on Systemics, Cybernetics and Informatics (WMSCI 2005), July 10-13, 2005, Orlando, Florida, USA |
---|
Subject | parallel data mining; feature extraction; model evaluation or testing; JavaParty |
---|
Abstract | In many practical applications, data mining results must be quickly delivered. To achieve the required efficiency, without sacrificing the quality of the results, practitioners are now looking at ways to parallelize the most computationally expensive steps of the data mining process. Realizing that a complete rewriting of existing sequential programs into parallel ones is often too tedious and expensive, we propose a framework which re-uses existing sequential programs to perform parallel data mining on a computer cluster. The proposed framework relies on the JavaParty system and can be used to parallelize both Java and non-Java programs. This paper details the framework, illustrates the implementation, and presents early experimental results showing the benefits of the approach. |
---|
Publication date | 2005 |
---|
In | |
---|
Language | English |
---|
NRC number | NRCC 47440 |
---|
NPARC number | 5764990 |
---|
Export citation | Export as RIS |
---|
Report a correction | Report a correction (opens in a new tab) |
---|
Record identifier | 7fc04822-e48f-4a7c-aa10-6c38d4fc218e |
---|
Record created | 2009-03-29 |
---|
Record modified | 2020-10-09 |
---|