当前位置:网站首页>November 22, 2021 [reading notes] - bioinformatics and functional genomics (Section 5 of Chapter 5 uses a comparison tool similar to blast to quickly search genomic DNA)
November 22, 2021 [reading notes] - bioinformatics and functional genomics (Section 5 of Chapter 5 uses a comparison tool similar to blast to quickly search genomic DNA)
2022-06-30 07:38:00 【Muyiqing】
- 5.5 Use something similar to BLAST A quick search of the genome using the comparison tool DNA
- demand : With the genome DNA The number of databases is growing , Contrast requires more and more tools
- Can be found in the genome DNA Exons found in
- Consider the genome when comparing DNA Contains sequencing errors
- There are corresponding algorithms to solve the problem that the genomes of related species are deleted in the comparison 、 repeat 、 The problem of inversion or displacement
- There are corresponding algorithms to solve DNA Small differences between sequences , Such as SNP site
- Use the standard set to evaluate the effect of genome alignment
- When using a sequential evolutionary random model (ROSE) The software package ships a simulated sequence set for testing , You can get the global comparison tool LAGAN The highest sensitivity , Local comparison tool ( Such as BLASTZ) The comparison in the compilation section is more accurate
- PatternHunter: Discontinuous seeds improve sensitivity
- PatternHunter Add a mismatch site between the matching sites , Improved speed and sensitivity ( Two models are described )
- Match as 1, Mismatch as 0 For example , The model format is as follows :
- BLASTN:11111111111
- PatternHunter:110100110010101111( Another kind 11101001010011011)
- reason : Very few clips are shared between adjacent seed matches , It makes the matching more independent than using the continuous seed model
- BLASTZ and MegaBLAST The isologous protein search algorithm also uses this strategy
- Icon

- Icon
- BLASTZ
- Compare the human and mouse genomes DNA Sequence .
- function :
- Pedigree specific sporadic repeats are removed from both sequences
- use 12 A word length match does not allow empty spaces to extend it , When the score exceeds a certain threshold , The extension will allow vacancies , namely 1110100110010101111
- For sections adjacent to the successful comparison, the second step is repeated with a lower ( More sensitive ) Word length , such as 7.
- have access to UCSC visualization BLASTZ Compared genome sequences
- Icon

- Icon
- have access to UCSC visualization BLASTZ Compared genome sequences
- Enredo and Pecan( A little )
- Ensemble For multiple sequence alignment , The comparison results are more accurate than other software based on other criteria
- MegaBLAST And discontinuities MegaBLAST
- MegaBLAST:NCBI Optimized for fast comparison of long DNA Tools for querying sequences , The default word length is 28, Adjustable to 256, Increase the running speed .
- Small word length , High sensitivity , Low running speed .
- The output similarity percentage threshold can be defined
- Corresponding match and mismatch scores can be defined
- Discontinuous MegaBLAST Tools for comparing more distant related genome sequences .
- Icon

- Icon
- MegaBLAST:NCBI Optimized for fast comparison of long DNA Tools for querying sequences , The default word length is 28, Adjustable to 256, Increase the running speed .
- class BLAST Comparison tool (BLAT)
- A very fast genome DNA Search tools
- BLAT Put the entire genome DNA The database is broken down into word indexes , These words contain all the non overlapping words in the genome 11-mers.
- BLAT The database indexing strategy used is also SSAH2 And subsequent MegaBLAST use
- Other properties :
- BLAST The extension is fired when two matches occur ,BLAT Multiple matches are required ;
- BLAT The main purpose is to find the data similar to the query sequence 95% The above matches
- BLAT Will search the boundaries of introns and exons , In essence, it establishes a model of gene structure .
- Query case , Icon

- Query case , Icon
- LAGAN
- Double sequence alignment
- Icon

- Icon
- The global double sequence alignment is carried out in three steps
- 1. In the two sequences, a local alignment is first generated to identify a set of anchors , Allow multiple short imprecise word matches instead of long exact word matches ;
- 2. Generate a rough global map , A collection containing the largest anchors sorted by score ;
- 3. Calculate the final global comparison , Limited to the priority areas defined by the rough map .
- Double sequence alignment
- SSAHA2
- SSAHA2 take DNA The database is converted to a hash table with fixed word length , Double sequence alignment can quickly find a match in the hash table .
- demand : With the genome DNA The number of databases is growing , Contrast requires more and more tools
- Welcome to join the group , Or add VX:bbplayer2021, Invite in

边栏推荐
- Tencent and Fudan University "2021-2022 yuan universe report" with 102 yuan universe collections
- STM32 register on LED
- Ad\dxp how to solve the problem of not knowing the schematic Library
- Dynamic memory management
- STM32 infrared communication
- PMIC power management
- ADC basic concepts
- Application of stack -- using stack to realize bracket matching (C language implementation)
- Proteus catalog component names and Chinese English cross reference
- 24C02
猜你喜欢

Raspberry pie 4B Getting Started Guide

Cubemx completes STM32F103 dual serial port 485 transceiver transmission

Record the problem that the system file cannot be modified as an administrator during the development process

2021 private equity fund market report (62 pages)

Deloitte: investment management industry outlook in 2022
![Arm debug interface (adiv5) analysis (I) introduction and implementation [continuous update]](/img/30/375860665aa1cc761adffc0e782744.jpg)
Arm debug interface (adiv5) analysis (I) introduction and implementation [continuous update]

Efga design open source framework fabulous series (I) establishment of development environment

Minecraft 1.16.5 module development (50) guide book

RT thread kernel application development message queue experiment

Research Report on search business value in the era of big search in 2022
随机推荐
Is it safe to open a stock account by mobile phone? What do I need to prepare for opening an account?
Use of ecostruxure (2) IEC61499 to establish function blocks
Investment and financing analysis report of Supply Chain & logistics industry in 2021
Processes, jobs, and services
2021 China Enterprise Cloud index insight Report
How to quickly delete routing in Ad
The most convenient serial port screen chip scheme designed at the charging pile in China
记录开发过程中无法使用管理员身份修改系统文件问题
C51 minimum system board infrared remote control LED light on and off
Final review -php learning notes 9-php session control
1、 Output debugging information: makefile file debugging information $(warning "tests" $(mkfile\u path)); makefile file path
Final review -php learning notes 11-php-pdo database abstraction layer
Video player (I): process
Introduction notes to pytorch deep learning (10) neural network convolution layer
4diac getting started example
DXP shortcut key
Implementation of binary search in C language
Use of ecostruxure (3) creating composite function blocks
Research Report on search business value in the era of big search in 2022
期末複習-PHP學習筆記6-字符串處理




