Storming Media: Pentagon Reports and DocumentsPentagon Reports: Fast. Definitive. Complete.     
New Account »
Forgot Password?
Advanced Search »

ComputersComputer Programming and Software

Robust Order Statistics Based Ensembles for Distributed Data Mining

Authors: Kagan Tumer; Joydeep Ghosh; TEXAS UNIV AT AUSTIN DEPT OF ELECTRICALAND COMPUTER ENGINEERING
 
Abstract: This chapter is rooted in the ensemble framework and shows how order statistics can be used in the design of a "meta-learner" that examines the outputs of multiple distributed classifers and provides a final decision. Order statistics is one of the key tools of robust statistics, tailored to handling data with outliers. in a distributed data mining scenario in which there is wide variability among the individual classifers because of the underlying quality of the local data that they examine, a meta-learner should be able to tolerate a few outlier classifer results. The robust properties of order statistics based approaches such as median filtering and m-estimators (Arnold, Balakrishnan, and Nagaraja 1992), have been observed in many disciplines. Thus they are an obvious candidate for meta-learning in such environments. ANNOTATION: Reprint: Robust Order Statistics Based Ensembles for Distributed Data Mining

Limitations: APPROVED FOR PUBLIC RELEASE
Pages: 27
Report Date: 2001
Contract Number: DAAG55-98-0-0230, DAAD19-99-1-
Report Number: A458593
Keywords relating to this report:
CLASSIFICATION
DATA BASES
INFORMATION RETRIEVAL
ORDER STATISTICS
REPRINTS
Adobe PDF - $18.95
Printed Format - $29.95
Please check the box for the format you wish to order.
Shipping Terms
About Electronic Delivery

Email This Abstract