DOI QR코드

DOI QR Code

Design of Dynamic Buffer Assignment and Message model for Large-scale Process Monitoring of Personalized Health Data

개인화된 건강 데이터의 대량 처리 모니터링을 위한 메시지 모델 및 동적 버퍼 할당 설계

  • Received : 2015.10.13
  • Accepted : 2015.12.11
  • Published : 2015.12.31

Abstract

The ICT healing platform sets a couple of goals including preventing chronic diseases and sending out early disease warnings based on personal information such as bio-signals and life habits. The 2-step open system(TOS) had a relay designed between the healing platform and the storage of personal health data. It also took into account a publish/subscribe(pub/sub) service based on large-scale connections to transmit(monitor) the data processing process in real time. In the early design of TOS pub/sub, however, the same buffers were allocated regardless of connection idling and type of message in order to encode connection messages into a deflate algorithm. Proposed in this study, the dynamic buffer allocation was performed as follows: the message transmission type of each connection was first put to queuing; each queue was extracted for its feature, computed, and converted into vector through tf-idf, then being entered into a k-means cluster and forming a cluster; connections categorized under a certain cluster would re-allocate the resources according to the resource table of the cluster; the centroid of each cluster would select a queuing pattern to represent the cluster in advance and present it as a resource reference table(encoding efficiency by the buffer sizes); and the proposed design would perform trade-off between the calculation resources and the network bandwidth for cluster and feature calculations to efficiently allocate the encoding buffer resources of TOS to the network connections, thus contributing to the increased tps(number of real-time data processing and monitoring connections per unit hour) of TOS.

ICT 힐링플랫폼은 만성질환 예방을 목적으로 하며 개인의 생체신호 및 생황습관 등의 정보에 기반을 둔 질환 조기 경보를 목표로 한다. 이를 위한 2-step 개방형 시스템(TOS)에는 힐링플랫폼과 개인건강데이터 저장소간의 중계가 설계되었으며 데이터 처리과정을 실시간으로 전송(모니터링)하기 위한 대량 커넥션 기반의 publish/subscribe(pub/sub) 서비스가 고려되었다. 그러나 TOS pub/sub의 초기 설계에서는 커넥션 메시지를 deflate 알고리즘으로 인코딩하기 위해, 커넥션의 유휴(idle) 여부 및 메시지의 종류에 상관없이 동일한 버퍼를 할당한다. 본 논문의 동적 버퍼 할당은 다음과 수행된다. 우선 각 커넥션의 메시지 전송 유형을 큐잉하고, 각 큐는 tf-idf를 통해 특징(feature)추출 연산 후 벡터로 변환하여 k-means 클러스터에 입력하여 군집을 생성한다. 특정 군집으로 분류된 커넥션은 해당 군집의 자원 테이블에 따라 자원을 재할당 한다. 이때 각 군집의 센트로이드(centroid)는 해당 군집을 대표하는 큐잉 패턴을 사전에 선택하여 자원참조 테이블(버퍼 크기별 인코딩 효율)로 도출한다. 제안된 설계는 TOS의 인코딩 버퍼 자원을 네트워크 커넥션에 효율적으로 배분하기 위해, 군집 및 특징 연산을 위한 연산 자원과 네트워크 대역폭 간의 trade-off를 수행함으로써 TOS의 tps(단위 시간당 실시간 데이터 처리 모니터링 연결수)를 높이는데 활용할 수 있다.

Keywords

References

  1. JeMin Song, MyungSic Kim, KyeongJa Jeong and MoonSun Shin, "RBAC-based health care service platform for individual recommended health information service", Journal of the Korea Academia-Industrial cooperation Society(JKAIS) Vol.15, No.3, pp.1740-1748, 2014. https://doi.org/10.5762/KAIS.2014.15.3.1740
  2. Seokjin Im, Hee-Joung Hwang, "Development of Smart Health Client based on Real-Time Health Information Sharing Framework", The Journal of The Institute of Internet, Broadcasting and Communication (JIIBC) Vol.14, No.3, pp.131-137, 2014.6. https://doi.org/10.7236/JIIBC.2014.14.3.131
  3. Kim, Il Kon. "Health Innovation for Aging Society." Healthcare informatics research Vol.21, No.2 pp.63-66. 2015. https://doi.org/10.4258/hir.2015.21.2.63
  4. Frank E, Gillett "The Personal Cloud" Forrester Research, 2009.
  5. Dropbox Developers, https://www.dropbox.com/developers
  6. International Organization for Standardization. Health informatics - Person-owned document repository for PHR applications and health information exchange. Geneva: International Organization for Standardization; 2015. (ISO/AWI TR 20055).
  7. Roy Fielding, "Architectural Styles and the Design of Network-based Software Architectures", Dissertation of Doctor of Philosophy in Information and Computer Science, University of California, IRVINE, 2000.
  8. YoungJun Jeon, HeeJoung Hwang "Design of Two-Step Open System for Personalized Health Data Access", The Journal of The Institute of Internet, Broadcasting and Communication (JIIBC) Vol.15, No.4, pp.177-183, 2015.8. https://doi.org/10.7236/JIIBC.2015.15.4.177
  9. Vertx framework http://vertx.io/
  10. Rajaraman, Anand, and Jeffrey D. Ullman. "Mining of massive datasets", Vol. 77. Cambridge: Cambridge University Press, 2012.
  11. Luhn, Hans Peter. "A statistical approach to mechanized encoding and searching of literary information", IBM Journal of research and development Vol.1. No.4, pp.309-317, 1957. https://doi.org/10.1147/rd.14.0309
  12. MacQueen, James. "Some methods for classification and analysis of multivariate observations", Proceedings of the fifth Berkeley symposium on mathematical statistics and probability. Vol.1. No.14. 1967.
  13. Hazelcast: In-Memory Data Grid https://hazelcast.com/
  14. Mahout framework http://mahout.apache.org/