Data mining highly multiple time series of astronomical observations
Huang, Fuchun (2004) Data mining highly multiple time series of astronomical observations. In: Data mining V: data mining, text mining and their business application. Zanasi, A, Ebecken, N. F. F and Brebbia, C. A, eds. WIT Press, Southampton, UK, pp. 375-382.
Abstract
This is a case study of data mining a large data set of astronomical interest. Our first concern is the outliers apparently existing in the data set. We used a robust method to do curve fitting and identify outliers, and estimated the occurrence intensity of outliers. We find that the occurrence intensity of outliers varies considerably over time. Besides, we designed a test which led to rejection of the hypothesis that all observation series are independent of each other. Combining this fact with our estimation of the occurrence intensity of outliers we believe there are common factors transiently acting on many series of observations. Additionally, we analyse gaps in time series and summarise simple but possibly interesting characteristics of data from a methodological viewpoint of data mining.
Dimensions Badge
Altmetric Badge
Additional Information | This paper has been presented at The Data Mining Conference in 2004 was held in Malaga, Spain. |
Item type | Book Section |
URI | https://vuir.vu.edu.au/id/eprint/1624 |
DOI | DOI: 10.2495/DATA040361 |
ISBN | 1-85312-722-9 |
Subjects | Historical > RFCD Classification > 240000 Physical Sciences Historical > Faculty/School/Research Centre/Department > School of Engineering and Science |
Keywords | data mining, highly multiple time series, loess, MACHO project, nonparametric curve fitting, outliers |
Citations in Scopus | 0 - View on Scopus |
Download/View statistics | View download statistics for this item |