当前位置:网站首页>Soap comparison result file description
Soap comparison result file description
2022-06-30 15:03:00 【Pan Gao】
Preface
More , Please visit mine Personal blog .
soap yes genomics The development of a short sequence alignment software ( Software home page ). But it's a pity , This website is no longer maintained .
Because I have undertaken an old project now (2012 Year of ), Used in the project soap comparison , Therefore, the results of the comparison software on the Internet are summarized here .
soap Format files can be opened with a plain text editor , Some of the contents are as follows :
CL100152537L1C001R001_82 TTATAAATAAAACTCCCATCTCCCTGGGACAGAGC FFEGGEFGDGGGFGFGGFGGGFGGF;@BAFF;[email protected] 18 a 35 + chr8 89537925 0 35M 35
CL100152537L1C001R001_100 AGAAAACACTCCCTCAGGGAAGTGCCAGCCCTCCT >[email protected]?DGFGGEGGFGB7?7FAAF>GF9BBGFGGGF 1 a 35 + chr11 65819516 1 G->15A2 35M 15G19
...
From left to right , Sequential representation :
- Number :read The number of .
- read Sequence : If read Compare the negative chain of the upper reference sequence , Will be reverse complemented into a positive chain .
- Mass value : The quality value of the sequence , Consistent with sequence order , If read Reverse complementarity , The mass value will also change with .
- The number of comparisons : The times of comparison . There is no comparison read Will be ignored .
- a/b:pair-end Compared markers , Express read From which file .
- length :read length , If it is a tolerance comparison , The length will be the length of the missing fragment .
- +/-: Compare the positive or negative chains of the upper reference sequence .
- Chromosome name : Chromosome name of the reference sequence .
- site : The position of the first base on the chromosome , from 1 Start .
- Number of mismatches : The default is 0.
- Details of the mismatch :
G->15A2It means a mismatch , The position in the reference sequence is the position of +15( from 0 Start ), On the reference sequence is G,read On is A, Mass value is 2. - Compare the numbers on the :
35Mintend 35 Base pairs are matched . - Details of the comparison :
15G19It means before 15 A comparison has been made , The first 16( Position on the reference sequence +16) One is a mismatch , Back 19 The two are still comparable .
边栏推荐
- Repair of incorrect deletion of win10 boot entry
- Programming of left-hand trapezoidal thread
- Is pioneer futures safe? What are the procedures for opening futures accounts? How to reduce the futures commission?
- Knowledge learned from the water resources institute project
- Finding the median of two arrays by dichotomy
- CCF numerical sorting (Full Score code + problem solving ideas + skill summary) 201503-2
- Matlab judge palindrome number (only numbers)
- 机械工程师面试的几个问题,你能答上来几个?
- How to realize selective screen recording for EV screen recording
- 1105 spiral matrix (25 points)
猜你喜欢

CCF adjacent number pairs (Full Score code + problem solving ideas + skill summary) 201409-1

Lihongyi machine learning 2020 homework summary

@PathVariable

Component communication mode

CCF image rotation (Full Score code + problem solving idea) 201503-01

CCF drawing (full mark code + problem solving ideas + skill summary) February 2, 2014

Knowledge learned from the water resources institute project

Matlab construction operation example

CCF numerical sorting (Full Score code + problem solving ideas + skill summary) 201503-2

August 24, 2021 deque queue and stack
随机推荐
1107 social clusters (30 points)
1018 public bike Management (30 points)
Text matching - [naacl 2022] GPL
Xiao Sha's pain (thinking problem)
O - ACM contest and blackout (minimum spanning tree, Kruskal)
1027 colors in Mars (20 points)
August 24, 2021 deque queue and stack
1015 reversible primes (20 points)
Finding the median of two arrays by dichotomy
Complement (Niuke)
Repair of incorrect deletion of win10 boot entry
Detailed explanation of settimeout() and setinterval()
2021-07-15Caused by: org. quartz. ObjectAlreadyExistsException: Unable to store Job : ‘DEFAULT. TASK_ 1‘
Machine learning feature selection
J - Borg maze (minimum spanning tree +bfs)
K - rochambau (joint search, enumeration)
1150 traveling salesman problem (25 points)
Matlab finds prime numbers within 100
1105 spiral matrix (25 points)
Double pointer circular linked list