Classification of Precipitation Data Based on Smoothed Periodogram

Park, Man-Sik;Kim, Hee-Young;

doi:10.5351/KJAS.2008.21.3.547

The Korean Journal of Applied Statistics (응용통계연구)

Volume 21 Issue 3
/
Pages.547-560
/
2008
/
1225-066X(pISSN)
/
2383-5818(eISSN)

The Korean Statistical Society (한국통계학회)

DOI QR Code

Classification of Precipitation Data Based on Smoothed Periodogram

평활된 주기도를 이용한 강수량자료의 군집화

Park, Man-Sik (Dept. of Preventive Medicine, Korea University) ;
Kim, Hee-Young (Dept. of Preventive Medicine, Korea University)

박만식 (고려대학교 의과대학 의학통계학교실) ;
김희영 (고려대학교 의과대학 의학통계학교실)

Published : 2008.06.30

https://doi.org/10.5351/KJAS.2008.21.3.547 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

It is well known that spectral density function determines auto-covariance function of stationary time-series data and smoothed periodogram is a consistent estimator of spectral density function. Recently, Kim and Park (2007) showed that smoothed- periodogram based distances performs very well for the classification. In this paper, we introduce classification methods with smoothed periodogram and apply the approaches to the monthly precipitation measurements obtained from January, 1987 through December, 2007 at 22 locations in South Korea.

스펙트럼 밀도함수(spectral density function)는 시계열 자료가 정상성(stationarity)을 만족하는 경우에 주파수 영역(frrqllrnFr domain)에서 시계열 자료의 자기공분산함수(auto-covariance function)을 결정짓는 함수이고, 평활된 주기도(smoothed periodogram)는 스펙트럼 밀도함수의 일치 추정량(consistent estimator)이 됨이 잘 알려져 있다. 본 연구에서는 시계열 자료를 평활된 주기도를 이용하여 군집화하는 방법을 소개한다. 최근 김희영과 박만식 (2007)의 연구에 의하면 이 거리는 정상시계열들을 효율적으로 분류하고 있음을 알 수 있다. 본 연구는 시계열 자료를 분류하는데 사용된 기존의 거리들을 간략히 소개하고, 우리나라 22개 지역에서 1987년 1월부터 2007년 12월까지 측정한 월별 강수량 자료를 대상으로 평활된 주기도 거리를 이용하여 지역을 군집화한다.

Keywords

References

고정웅, 백희정, 권원태 (2005) 한반도 우기의 강수 특성과 지역 구분, Asia-Pacific Journal of Atmospheric Sciences, 41, 101-114
김성렬, 양진석 (1995) 한국의 온대 저기암성 강수지역 구분, <한국지역지리학회지>, 1, 45-60
김희영 , 박만식 (2007). Clustering time-series based on frequency domain, <한국통계학회 추계학술발표회 논문집>,73.
문영수 (1990). 클러스터분석에 의한 한국의 강수지역 구분, Asia-Pacific Journal of Atmospheric Sciences, 26, 203-215
이동규, 박정균 (1999) 군집 분석을 이용한 남한의 여름철 강수지역구분, Asia-Pacific Journal of Atmospheric Sciences, 35, 511-518
이승호 (1993) 계량적 분석에 의한 한국의 강수지역구분, <지역과 환경>, 11, 1-15
Bartlett, M. S. (1946). On the theoretical specification and sampling properties of auto- correlated time series, Supplement to the Journal of the Royal Statistical Society, 8, 27-41 https://doi.org/10.2307/2983611
Brockwell, P. J. and Davis, R. A. (1991). Time Series: Theory and Methods, Springer- Verlag, New York
Caiado, J., Crato, N. and Pena, D. (2006). A periodogram-based metric for time series classification, Computational Statistics & Data Analysis, 50, 2668-2684 https://doi.org/10.1016/j.csda.2005.04.012
Corduas, M. and Piccolo, D. (2008). Time series clustering and classification by the autoregressive metric, Computational Statistics and Data Analysis, 52, 1860-1872 https://doi.org/10.1016/j.csda.2007.06.001
Fu, T. C., Chung, F. L., Ng, V. and Luk, R. (2001). Pattern discovery from stock time series using self-organizing maps, KDD 2001 Workshop on Temporal Data Mining, August 26-29, San Francisco, 27-37
Galeano, P. and Pena, D. (2000). Multivariate analysis in vector time series, Resenhas, 4, 383-403
Goldstein, D. R., Ghosh, D. and Conlon, E. M. (2002). Statistical issues in the clustering of gene expression data, Statistica Sinica, 12, 219-240
Kakizawa, Y., Shumway, R. H. and Taniguchi, M. (1998). Discrimination and clustering for multivariate time series, Journal of the American Statstical Association, 93, 328-340 https://doi.org/10.2307/2669629
Kalpakis, K., Gada, D. and Puttagunta, V. (2001). Distance measures for effective clustering of ARIMA time series, In Proceedings of the 2001 IEEE international conference on data mining, 273-280
Liao, T. W. (2005). Clustering of time series data-a survey, Pattern Recognition, 38, 1857-1874 https://doi.org/10.1016/j.patcog.2005.01.025
Maharaj, E. A. (2000). Clustering of time series, Journal of Classification, 17, 297-314 https://doi.org/10.1007/s003570000023
Pattarin, F., Paterlini, S. and Minerva, T. (2004). Clustering financial time series: An application to mutual funds style analysis, Computational Statistics & Data Analysis, 47, 353-372 https://doi.org/10.1016/j.csda.2003.11.009
Piccolo, D. (1990). A distance measure for classifying ARIMA models, Journal of Time Series Analysis, 11, 153-164
Shumway, R. H. (2003). Time-frequency clustering and discriminant analysis, Statistics & Probability Letters, 63, 307-314 https://doi.org/10.1016/S0167-7152(03)00095-6

Cited by

Threshold Modelling of Spatial Extremes - Summer Rainfall of Korea vol.27, pp.4, 2014, https://doi.org/10.5351/KJAS.2014.27.4.655
Categorical time series clustering: Case study of Korean pro-baseball data vol.27, pp.3, 2016, https://doi.org/10.7465/jkdi.2016.27.3.621

The Korean Journal of Applied Statistics (응용통계연구)

Classification of Precipitation Data Based on Smoothed Periodogram

평활된 주기도를 이용한 강수량자료의 군집화

Abstract

Keywords

References

Cited by

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)