当前位置:网站首页>Thesis learning -- Analysis and Research on similarity query of hydrological time series
Thesis learning -- Analysis and Research on similarity query of hydrological time series
2022-07-01 07:38:00 【Graduate students are not late】
List of articles
Write it at the front :《 hydrological 》;2009 year ;
author : Li Wei 、 Sun Honglin

1 Abstract
- Hydrological time series similarity query , It can be used for Rain flood process prediction 、 Environmental evolution analysis 、 Analysis of hydrological process law Other aspects .
- The most direct application is , Answer questions often asked in flood control command :“ The current hydrological process is equivalent to the same process in which period in history ”
- Introduce the theory and technology of data warehouse and data mining .
2 introduction

3 Problem description
Traditional time series similarity search , It mainly emphasizes precise matching , But in data mining applications , Because of the huge amount of data , Generally, it is based on approximate matching “ Approximate search ”.
The key work of hydrological time series similarity mining is :
Division of subsequences . In the National Hydrological Database , Flood engineering has been divided according to the theory of runoff generation , Form an excerpt of various elements .
however , In the daily value class , It needs to be divided according to the type of problem to be solved , We need to make the partition rules It conforms to the hydrological theory , And suitable for computer processing .Sequence feature extraction . Generally, the sequence is transformed , For example, Fourier transform 、 Wavelet transform or piecewise average mapping to feature space .
Determination of similarity measure . For hydrological processes , Different hydrological processes have different characteristics . Therefore, according to the characteristics of hydrological process , Determine the appropriate similarity measures .
4 Theoretical methods
Similarity query of hydrological time series , The data objects to be processed are based on hydrological data , The process can be divided into two main stages : Query preparation stage and Similarity query stage .
Query preparation stage . Include Data preprocessing And Feature extraction of time series .
① In any data mining task , Data preprocessing is one of the essential key tasks , Data preprocessing in this model involves data integration 、 Data purification 、 Data selection and sequence regularization transformation ;
② Pattern representation of time series is a prerequisite for time series data mining , It is one of the key problems of hydrological time series similarity mining , Its effect directly affects the results of data mining .Similarity query stage . Users submit query requests , Based on the pattern representation, the system performs pattern matching according to the similarity measurement , And display the results visually to users .
Pattern matching ( Similarity measure )+ Pattern representation of time series It is also called the two cornerstones of time series similarity query .
5 Piecewise linear representation based on feature points
Time series pattern representation :
This article USES : Piecewise linear representation based on feature points , As a pattern representation of time series .(PLR)For the time series with obvious periodicity and frequent fluctuations of short-term patterns , It can effectively realize data compression , So as to grasp the change characteristics of the overall pattern of time series .
An example of segmentation is shown in the figure below :

5.1 Piecewise linear representation

5.2 Definition of characteristic points

6 Similarity measure of time series
The definition of similarity measure of time series should meet the following conditions :
(1) Similarity measures allow for imprecise matching , Support multiple deformations of time series ;
(2) The calculation of similarity measure must be efficient ;
(3) Similarity measures should support fast indexing ;
(4) Similarity measure can be applied to other data mining fields , Such as clustering and classification of time series 、 Frequent pattern discovery and exception discovery, etc ;Common similarity measures are :Minkowski distance 、 Dynamic time bending distance 、 Longest common substring, etc .
6.1 Dynamic pattern matching distance (DPM)
- DPM Distance is not calculated based on matching between points , They are matched by patterns .
- advantage : The definition of patterns is very flexible ; The average length of the pattern is generally much larger than 1, The dimension reduction of time series is realized ( The number of patterns in time series is much smaller than the length of time series )
6.2 Algorithm steps
Defining patterns . Extracting pattern features from time series , Transform time series into feature space , Get the pattern representation of the time series .
For piecewise linear representations , A pattern is an interpolated segment of a time series field , It can be characterized by the length of the line segment 、 Slope, etc ;Define the distance between patterns
边栏推荐
- I bet on performance and won the CTO of the company. I want to build Devops platform!
- Saving db4i depth camera pictures with MATLAB
- Those high-frequency written tests and interview questions in [Jianzhi offer & Niuke 101] - linked list
- 1286_FreeRTOS的任务优先级设置实现分析
- Is it safe to buy funds on the brokerage account
- redisson看门狗机制,redisson看门狗性能问题,redisson源码解析
- 浅谈CVPR2022的几个研究热点
- [programming compulsory training 3] find the longest consecutive number string in the string + the number that appears more than half of the times in the array
- 【mysql学习笔记25】sql语句优化
- Apple账号密码自动填充
猜你喜欢

Basic knowledge of MATLAB

Jax's deep learning and scientific computing

2022制冷与空调设备运行操作国家题库模拟考试平台操作

iNFTnews | 从《雪崩》到百度“希壤”,元宇宙30年的16件大事
![[lingo] solve quadratic programming](/img/4d/3f7de69943f29a71c4039299c547f7.png)
[lingo] solve quadratic programming

2022 mobile crane driver test exercises and online simulation test

運維管理系統,人性化操作體驗

LeetCode+ 71 - 75

2022年茶艺师(中级)复训题库及答案

Will Internet talents be scarce in the future? Which technology directions are popular?
随机推荐
[Shenzhen IO] precise Food Scale (some understanding of assembly language)
C language implementation [minesweeping game] full version (implementation source code)
【微服务|openfeign】Feign的日志记录
【技能】创建.bat快速打开网页
Oracle创建自增id
下载Xshell和Xftp
【推荐系统】美团外卖推荐场景的深度位置交互网络DPIN的突破与畅想
kubernetes资源对象介绍及常用命令(二)
Reply and explanation on issues related to "online training of network security education in 2022"
Stepsister becomes stepmother, son breaks off relationship with himself, and musk, the world's richest man, why is it so miserable?
Image style migration cyclegan principle
Redisson uses the full solution - redisson official document + comments (Part 2)
如何制作专属的VS Code主题
Cadence OrCAD Capture “网络名”相同,但是未连接或连接错误的解放方案之nodename的用法
Minecraft 1.16.5模组开发(五十一) 方块实体 (Tile Entity)
运维管理有什么实用的技巧吗
2022年流动式起重机司机考试练习题及在线模拟考试
The computer has a network, but all browser pages can't be opened. What's the matter?
Is it safe and reliable for Huatai Securities to open an account? How to open Huatai Securities Account
LeetCode+ 71 - 75