Abstract:Anomaly detection method was used for calibration data concentration significantly different from other data objects. In this paper, the multiphase clustering aimed at resolving the import of noise data and the lack of the attributive sample, and changing the traditional passive learning of bayes for active learning ways to structure the superior performance classifier. In the preprocessing stage, a clustering algorithm based on density is introduced to handle noise data. And the output of the densitybased clustering algorithm can be used as the input of Kmeans, which responsible for handling the training samples with absent values. At classification time, we introduce adaboost algorithm into naive bayes to generate a more effective classifier.