当前位置:网站首页>Thesis learning -- Analysis and Research on similarity query of hydrological time series
Thesis learning -- Analysis and Research on similarity query of hydrological time series
2022-07-01 07:38:00 【Graduate students are not late】
List of articles
Write it at the front :《 hydrological 》;2009 year ;
author : Li Wei 、 Sun Honglin

1 Abstract
- Hydrological time series similarity query , It can be used for Rain flood process prediction 、 Environmental evolution analysis 、 Analysis of hydrological process law Other aspects .
- The most direct application is , Answer questions often asked in flood control command :“ The current hydrological process is equivalent to the same process in which period in history ”
- Introduce the theory and technology of data warehouse and data mining .
2 introduction

3 Problem description
Traditional time series similarity search , It mainly emphasizes precise matching , But in data mining applications , Because of the huge amount of data , Generally, it is based on approximate matching “ Approximate search ”.
The key work of hydrological time series similarity mining is :
Division of subsequences . In the National Hydrological Database , Flood engineering has been divided according to the theory of runoff generation , Form an excerpt of various elements .
however , In the daily value class , It needs to be divided according to the type of problem to be solved , We need to make the partition rules It conforms to the hydrological theory , And suitable for computer processing .Sequence feature extraction . Generally, the sequence is transformed , For example, Fourier transform 、 Wavelet transform or piecewise average mapping to feature space .
Determination of similarity measure . For hydrological processes , Different hydrological processes have different characteristics . Therefore, according to the characteristics of hydrological process , Determine the appropriate similarity measures .
4 Theoretical methods
Similarity query of hydrological time series , The data objects to be processed are based on hydrological data , The process can be divided into two main stages : Query preparation stage and Similarity query stage .
Query preparation stage . Include Data preprocessing And Feature extraction of time series .
① In any data mining task , Data preprocessing is one of the essential key tasks , Data preprocessing in this model involves data integration 、 Data purification 、 Data selection and sequence regularization transformation ;
② Pattern representation of time series is a prerequisite for time series data mining , It is one of the key problems of hydrological time series similarity mining , Its effect directly affects the results of data mining .Similarity query stage . Users submit query requests , Based on the pattern representation, the system performs pattern matching according to the similarity measurement , And display the results visually to users .
Pattern matching ( Similarity measure )+ Pattern representation of time series It is also called the two cornerstones of time series similarity query .
5 Piecewise linear representation based on feature points
Time series pattern representation :
This article USES : Piecewise linear representation based on feature points , As a pattern representation of time series .(PLR)For the time series with obvious periodicity and frequent fluctuations of short-term patterns , It can effectively realize data compression , So as to grasp the change characteristics of the overall pattern of time series .
An example of segmentation is shown in the figure below :

5.1 Piecewise linear representation

5.2 Definition of characteristic points

6 Similarity measure of time series
The definition of similarity measure of time series should meet the following conditions :
(1) Similarity measures allow for imprecise matching , Support multiple deformations of time series ;
(2) The calculation of similarity measure must be efficient ;
(3) Similarity measures should support fast indexing ;
(4) Similarity measure can be applied to other data mining fields , Such as clustering and classification of time series 、 Frequent pattern discovery and exception discovery, etc ;Common similarity measures are :Minkowski distance 、 Dynamic time bending distance 、 Longest common substring, etc .
6.1 Dynamic pattern matching distance (DPM)
- DPM Distance is not calculated based on matching between points , They are matched by patterns .
- advantage : The definition of patterns is very flexible ; The average length of the pattern is generally much larger than 1, The dimension reduction of time series is realized ( The number of patterns in time series is much smaller than the length of time series )
6.2 Algorithm steps
Defining patterns . Extracting pattern features from time series , Transform time series into feature space , Get the pattern representation of the time series .
For piecewise linear representations , A pattern is an interpolated segment of a time series field , It can be characterized by the length of the line segment 、 Slope, etc ;Define the distance between patterns
边栏推荐
- 软件测试方法和技术 - 基础知识概括
- Subclasses call methods and properties of the parent class with the same name
- 组件的自定义事件①
- Which securities company is better or safer for mobile phone account opening
- Custom events of components ②
- [lingo] solve quadratic programming
- Redisson utilise la solution complète - redisson Documents officiels + commentaires (Partie 1)
- [MySQL learning notes27] stored procedure
- Oracle创建自增id
- 赌上了绩效,赢了公司CTO,我要搭DevOps平台!
猜你喜欢

Redisson utilise la solution complète - redisson Documents officiels + commentaires (Partie 1)

2022广东省安全员A证第三批(主要负责人)特种作业证考试题库模拟考试平台操作

LeetCode+ 71 - 75

Minecraft 1.16.5模组开发(五十一) 方块实体 (Tile Entity)

Conscience Amway universal wheel SolidWorks model material website
![C language implementation [minesweeping game] full version (implementation source code)](/img/70/60f9a61bd99fa5fb5fab679a32528e.png)
C language implementation [minesweeping game] full version (implementation source code)

2022危险化学品经营单位主要负责人试题及模拟考试

Apple account password auto fill

如何让两融交易更极速

如何制作专属的VS Code主题
随机推荐
[MySQL learning notes27] stored procedure
What information does the supplier need to know about Audi EDI project?
atguigu----脚手架--02-使用脚手架(2)
【目标检测】目标检测界的扛把子YOLOv5(原理详解+修炼指南)
The computer has a network, but all browser pages can't be opened. What's the matter?
【微服务|openfeign】Feign的日志记录
论文学习——水文时间序列相似性查询的分析与研究
Minecraft 1.16.5 module development (51) tile entity
2022年茶艺师(中级)复训题库及答案
运维管理有什么实用的技巧吗
How do the top ten securities firms open accounts? In addition, is it safe to open a mobile account?
redisson使用全解——redisson官方文檔+注釋(上篇)
Oracle创建自增id
Kdtree notes
[Shenzhen IO] precise Food Scale (some understanding of assembly language)
[R language] age sex frequency matching select samples for case-control study, and perform frequency matching on age and sex
如何制作专属的VS Code主题
【编程强训2】排序子序列+倒置字符串
運維管理系統,人性化操作體驗
Is it safe and reliable for Huatai Securities to open an account? How to open Huatai Securities Account