当前位置:网站首页>Thesis learning -- Analysis and Research on similarity query of hydrological time series
Thesis learning -- Analysis and Research on similarity query of hydrological time series
2022-07-01 07:38:00 【Graduate students are not late】
List of articles
Write it at the front :《 hydrological 》;2009 year ;
author : Li Wei 、 Sun Honglin
1 Abstract
- Hydrological time series similarity query , It can be used for Rain flood process prediction 、 Environmental evolution analysis 、 Analysis of hydrological process law Other aspects .
- The most direct application is , Answer questions often asked in flood control command :“ The current hydrological process is equivalent to the same process in which period in history ”
- Introduce the theory and technology of data warehouse and data mining .
2 introduction
3 Problem description
Traditional time series similarity search , It mainly emphasizes precise matching , But in data mining applications , Because of the huge amount of data , Generally, it is based on approximate matching “ Approximate search ”.
The key work of hydrological time series similarity mining is :
Division of subsequences . In the National Hydrological Database , Flood engineering has been divided according to the theory of runoff generation , Form an excerpt of various elements .
however , In the daily value class , It needs to be divided according to the type of problem to be solved , We need to make the partition rules It conforms to the hydrological theory , And suitable for computer processing .Sequence feature extraction . Generally, the sequence is transformed , For example, Fourier transform 、 Wavelet transform or piecewise average mapping to feature space .
Determination of similarity measure . For hydrological processes , Different hydrological processes have different characteristics . Therefore, according to the characteristics of hydrological process , Determine the appropriate similarity measures .
4 Theoretical methods
Similarity query of hydrological time series , The data objects to be processed are based on hydrological data , The process can be divided into two main stages : Query preparation stage and Similarity query stage .
Query preparation stage . Include Data preprocessing And Feature extraction of time series .
① In any data mining task , Data preprocessing is one of the essential key tasks , Data preprocessing in this model involves data integration 、 Data purification 、 Data selection and sequence regularization transformation ;
② Pattern representation of time series is a prerequisite for time series data mining , It is one of the key problems of hydrological time series similarity mining , Its effect directly affects the results of data mining .Similarity query stage . Users submit query requests , Based on the pattern representation, the system performs pattern matching according to the similarity measurement , And display the results visually to users .
Pattern matching ( Similarity measure )+ Pattern representation of time series It is also called the two cornerstones of time series similarity query .
5 Piecewise linear representation based on feature points
Time series pattern representation :
This article USES : Piecewise linear representation based on feature points , As a pattern representation of time series .(PLR)For the time series with obvious periodicity and frequent fluctuations of short-term patterns , It can effectively realize data compression , So as to grasp the change characteristics of the overall pattern of time series .
An example of segmentation is shown in the figure below :
5.1 Piecewise linear representation
5.2 Definition of characteristic points
6 Similarity measure of time series
The definition of similarity measure of time series should meet the following conditions :
(1) Similarity measures allow for imprecise matching , Support multiple deformations of time series ;
(2) The calculation of similarity measure must be efficient ;
(3) Similarity measures should support fast indexing ;
(4) Similarity measure can be applied to other data mining fields , Such as clustering and classification of time series 、 Frequent pattern discovery and exception discovery, etc ;Common similarity measures are :Minkowski distance 、 Dynamic time bending distance 、 Longest common substring, etc .
6.1 Dynamic pattern matching distance (DPM)
- DPM Distance is not calculated based on matching between points , They are matched by patterns .
- advantage : The definition of patterns is very flexible ; The average length of the pattern is generally much larger than 1, The dimension reduction of time series is realized ( The number of patterns in time series is much smaller than the length of time series )
6.2 Algorithm steps
Defining patterns . Extracting pattern features from time series , Transform time series into feature space , Get the pattern representation of the time series .
For piecewise linear representations , A pattern is an interpolated segment of a time series field , It can be characterized by the length of the line segment 、 Slope, etc ;Define the distance between patterns
边栏推荐
- 如何制作专属的VS Code主题
- 2022年流动式起重机司机考试练习题及在线模拟考试
- Conscience Amway universal wheel SolidWorks model material website
- What information does the supplier need to know about Audi EDI project?
- Basic knowledge of MATLAB
- [recommendation system] breakthrough and imagination of deep location interactive network dpin for meituan takeout recommendation scenario
- Alibaba OSS postman invalid according to policy: policy condition failed: ["starts with", "key", "test/"]
- 微软宣布开源 (GODEL) 语言模型聊天机器人
- PWN攻防世界int_overflow
- The programmer of Beipiao posted a post for help late at night: I am lonely when my girlfriend is gone
猜你喜欢
2022 tea master (intermediate) recurrent training question bank and answers
【深圳IO】精确食品称(汇编语言的一些理解)
【目标检测】目标检测界的扛把子YOLOv5(原理详解+修炼指南)
Warm congratulations on the successful listing of five elements hehe liquor
Redisson utilise la solution complète - redisson Documents officiels + commentaires (Partie 1)
热烈祝贺五行和合酒成功挂牌
C# 读写自定义的Config文件
We found a huge hole in MySQL: do not judge the number of rows affected by update!!!
【编程强训】删除公共字符(哈希映射)+组队竞赛(贪心)
Mysql与Redis一致性解决方案
随机推荐
运维管理系统,人性化操作体验
ONES 创始人王颖奇对话《财富》(中文版):中国有没有优秀的软件?
奥迪AUDI EDI 项目中供应商需要了解哪些信息?
Minecraft 1.16.5模组开发(五十一) 方块实体 (Tile Entity)
weback5基础配置详解
【微服务|openfeign】Feign的日志记录
热烈祝贺五行和合酒成功挂牌
Eigen矩阵运算库快速上手
redisson看门狗机制,redisson看门狗性能问题,redisson源码解析
[R language] age sex frequency matching select samples for case-control study, and perform frequency matching on age and sex
2022 tea master (intermediate) recurrent training question bank and answers
Apple account password auto fill
kubernetes资源对象介绍及常用命令(二)
Eigen matrix operation Library
Inftnews | from "avalanche" to Baidu "xirang", 16 major events of the meta universe in 30 years
Alibaba OSS postman invalid according to policy: policy condition failed: ["starts with", "key", "test/"]
【推荐系统】美团外卖推荐场景的深度位置交互网络DPIN的突破与畅想
[microservice openfeign] feign's log record
Ctfhub port scan (SSRF)
Autosar 学习记录(1) – EcuM_Init