当前位置:网站首页>The combination of deep learning model and wet experiment is expected to be used for metabolic flux analysis
The combination of deep learning model and wet experiment is expected to be used for metabolic flux analysis
2022-07-05 08:49:00 【Python code doctor】
background
On the road of protein structure transformation , Need powerful computer programming assistance , In the field of deep learning , Found a powerful tool , So study how to combine with wet experiment .
Tools
DeeplearningApproach Anconda python And various dependent packages Computer Wet experimental data
Realize the idea
1. First , Good configuration DeeplearningApproach, How to configure see another article 《 Deep learning predicts enzyme activity parameters, improves enzyme constraint model construction, and builds ab initio environment 》.
2. Use the mutation sites with existing data to predict , For example, I want to predict the sequence of a protein L59,W60,Y153,R416 Site mutation to alanine (A), So I need to configure my tsv file , I wrote a script for the mutation here , The code is as follows :
import copy
def sort(a,b,c,d):
for n,i in enumerate(a):
if n+1 == b: # Determine whether it is the site I want
d = a[n] # Assign this site of the original sequence to d
a = copy.deepcopy(a) # Here you need a deep copy , Otherwise it will change WT Sequence , Of course, if it is the same site, it can be predicted as others 19 Amino acids can be used without
a[n] = c # This site of the original sequence is replaced by the site I need ,
i = a[n] # Extract the changed amino acids
break
else:
a[n] = i
a = ''.join(a)
# print(' Has already put the first %d Amino acids %s Change it to %s, The output sequence is as follows :%s'%((n+1),d,i,a))
print(' Enter the substrate name '+' Enter the chemical formula of the substrate '+a) # The difference between a plus sign and a comma
def main():
# Manual input
# while True:
# a = list(input(' Please enter the amino acid sequence :'))
# b = int(input(' Please select the amino acid site to be modified :'))
# c = input(' Please select the single letter abbreviation of amino acid to be changed :')
# sort(a,b,c,d=None)
aa_list = ['A','G','V','L','I','P','F','Y','W','S','T','C','M','N','Q','D','E','K','R','H']
pro = ' Input amino acid sequence '
pro_list = []
for p in pro:
pro_list.append(p)
for i in aa_list:
sort(pro_list,231,i,d=None)
if __name__ == '__main__':
main()
3. By extraction WT,L59,W60,Y153,R416 Of Kcat Value to draw a column chart ( Of course, it's OK to look without drawing , Just a few data ), Compare with the existing experimental data :



Deep excavation
1) How to use it DeeplearningApproach characteristic ?
Make one tsv library , This library contains unit point mutations and double site mutations of all amino acids near the active site of the protein , And WT Of Kcat It's worth comparing , Screen beneficial mutations for experiments .
2) Can you make DeeplearningApproach More suitable for a certain exact experiment , Make it Kcat The value is more accurate , And then use it Kcat Do metabolic flux analysis , Maybe we can start from two aspects :
- Adjust the parameters , To optimize the fitting / Under fitting phenomenon , Do some super parameter optimization , Make it more appropriate to the existing experimental results .
- Using a large amount of experimental data ( At least one million ) Retraining the model , Get a model with strong specificity ( It is almost impossible to achieve , Just a guess )
Be careful
The experimental data are fictitious , To introduce the method .
边栏推荐
- Basic number theory - fast power
- Hello everyone, welcome to my CSDN blog!
- 696. 计数二进制子串
- LLVM之父Chris Lattner:为什么我们要重建AI基础设施软件
- Guess riddles (4)
- 12、动态链接库,dll
- Use arm neon operation to improve memory copy speed
- 【日常训练--腾讯精选50】557. 反转字符串中的单词 III
- kubeadm系列-02-kubelet的配置和启动
- [Niuke brush questions day4] jz55 depth of binary tree
猜你喜欢
Typescript hands-on tutorial, easy to understand
Ros-10 roslaunch summary
Guess riddles (5)
资源变现小程序添加折扣充值和折扣影票插件
Guess riddles (2)
Guess riddles (3)
猜谜语啦(9)
Daily question - input a date and output the day of the year
Programming implementation of ROS learning 5-client node
Halcon clolor_ pieces. Hedv: classifier_ Color recognition
随机推荐
Bit operation related operations
Guess riddles (4)
Run menu analysis
Digital analog 2: integer programming
Redis实现高性能的全文搜索引擎---RediSearch
GEO数据库中搜索数据
Numpy 小坑:维度 (n, 1) 和 维度 (n, ) 数组相加运算后维度变为 (n, n)
Halcon shape_ trans
STM32 lights up the 1.8-inch screen under Arduino IDE
Redis implements a high-performance full-text search engine -- redisearch
Solutions of ordinary differential equations (2) examples
Program error record 1:valueerror: invalid literal for int() with base 10: '2.3‘
Task failed task_ 1641530057069_ 0002_ m_ 000000
Halcon wood texture recognition
Search data in geo database
Several problems to be considered and solved in the design of multi tenant architecture
MPSoC QSPI flash upgrade method
golang 基础 —— golang 向 mysql 插入的时间数据和本地时间不一致
The first week of summer vacation
[牛客网刷题 Day4] JZ35 复杂链表的复制