当前位置:网站首页>Biological sequence intelligent analysis platform blog (1)
Biological sequence intelligent analysis platform blog (1)
2022-06-11 07:06:00 【Boiled wine cos】
This project is completely written by the team members , And plan to publish papers
background (Introduction)
The following is the general biological and computer background of the project
Bioinformatics is an important frontier subject of life science and natural science , It is always inseparable from the analysis of biological sequences , To understand the specific meaning of a large amount of biological data . The sequencing technology has not been developed yet , When the amount of data is very small , Manual data processing can meet the needs of the time . But today, with the large-scale expansion of biological sequences , The obviously backward manual processing method has lagged far behind the computer processing data , Simultaneous machine learning 、 Deep learning 、 With the rapid development of naturallanguageprocessing and other technologies, new methods for processing biological sequences are developing and advancing .
But this also puts forward higher requirements for biological workers , When they want to extract the information behind biological sequences efficiently and quickly, they have to learn programming , Thus, we can make use of relevant cutting-edge data analysis tools . therefore , Conform to the development of the times , Develop a system that can receive biological sequence analysis and automate sequence analysis 、 Available platforms or toolkits for forecasting and related visualization , Such demand is the general trend .
However , Limited by many factors , The traditional method based on machine learning is not accurate enough , Cannot handle unbalanced data sets . Biological sequence analysis and deep learning algorithms related to natural language processing have also been widely introduced into biological sequence analysis , for example Bidirectional Encoder Representations from Transformers (BERT) Model , It applies Attention based architecture Transformer, Great achievements have been made in most naturallanguageprocessing tasks . meanwhile ,AAAI Relevant papers have appeared at the conference to show that BERT The model can perform well in biological sequence analysis .
Last , Based on the above requirements and Analysis , We developed a system based on BERT Model Web The server . Compared to currently available tools , Our server platform has the following main advantages :
- As far as we know , Our platform is the first one based on BERT Network platform for sequence binary classification analysis , And provide downloading and visualization of analysis results .
- Besides , The server can handle unbalanced data sets .
- Our server can perform characterization and visualization for the next step .
- Unlike other machine learning based platforms , The workflow of my server can be seen as a black box . We provide end-to-end services , The user uploads his sequence and gets the result , There is no need to set specific machine learning method parameters .
- Our deep learning model has good migration ability , It allows us to quickly upgrade on other follow-up tasks .

Division of labor
The biological sequence intelligent analysis platform group was established in 2021 year 9 month 20 The opening meeting was held on the th , The specific contents of the meeting mainly include task allocation and work schedule .
Project ai Part of the python Mainly by jinjunru , The back-end port is guochangrui , The front part is Jiang Yi , The research part of the thesis is chenchaoyi and fengjiuxin , The writing of the thesis is in the charge of the graduate student yinchenglin . Because there is no ready-made code for the project to run , All the code of our project needs to be written by hand , We all started working on the second working day , I plan to write the code in three to four weeks , And deployed to our lab's servers for global use .

The technology stacks used in the project are :python、pytorch、springboot、react、antd、cdn etc. , The reserve of knowledge required is huge , The members of our group need to learn the corresponding knowledge in a short time .
Corresponding to the computing machines of some team members 、 Experimental equipment and discussion room are provided by the laboratory , about ai Algorithms, various network technologies and other related knowledge mainly come from students' self-study . As a result of the discussion , The team decided to use cs Architecture to write the general framework of the model and front-end and back-end code .
python End architecture
use pytorch Training , Rely on the back end for storage .
The package needed , namely requirements.txt
pytorch
matplotlib
seaborn
transformers
边栏推荐
- 模块化笔记
- Pytest automated test - easy tutorial (01)
- Atom, the top stream editor, will leave the historical stage on December 15
- saltstack部署zabbix状态文件编写
- 【Matlab图像融合】粒子群优化自适应多光谱图像融合【含源码 004期】
- 关于组织开展2022年宁波市重点首版次软件申报工作的通知
- News web page display
- **Count the characters with the largest number of words**
- During unity panoramic roaming, AWSD is used to control lens movement, EQ is used to control lens lifting, and the right mouse button is used to control lens rotation.
- Flutter 约束容器组件
猜你喜欢

Leetcode-141. Linked List Cycle

Explain the difference between void 0 and undefined

资深OpenStacker - 彭博、Vexxhost升级为OpenInfra基金会黄金成员

client-go gin的简单整合六-list-watch二(关于Rs与Pod以及Deployment的完善)

Leetcode-104. Maximum Depth of Binary Tree

Saltstack deployment LNMP

LEARNING TARGET-ORIENTED DUAL ATTENTION FOR ROBUST RGB-T TRACKING

First day of database
![pycharm出现error.DeprecatedEnv: Env FrozenLake-v0 not found (valid versions include [‘FrozenLake-v1‘])](/img/1c/4013479ce1fc5b0ff2ebeb754f05a9.png)
pycharm出现error.DeprecatedEnv: Env FrozenLake-v0 not found (valid versions include [‘FrozenLake-v1‘])

Xunwei dry goods | Ruixin micro rk3568 development board TFTP & NFS writing (Part 1)
随机推荐
. Net C Foundation (6): namespace - scope with name
Drawing with qpainter
Shuttle container component
Group arrays by a specified size
Reconstruction and preheating of linked list information management system (2) how to write the basic logic using linear discontinuous structure?
3.1 naming rules of test functions
SQL language - query statement
通过 Ingress 进行灰度发布
Starting from scratch (V) realize bullet positioning and animation
【Matlab WSN通信】A_Star改进LEACH多跳传输协议【含源码 487期】
[matlab WSN communication] a_ Star improved leach multi hop transmission protocol [including source code phase 487]
Nodejs database (Part 2)
Leetcode hot topic 100 topic 21-25 solution
Senior openstacker - Bloomberg, vexxhost upgraded to gold member of openinfra Foundation
微信小程序开发(原生和uniapp)DOM标签对比介绍
你知道IT人才外派服务报价是怎样的么?建议程序员也了解下
【Matlab图像加密解密】混沌序列图像加密解密(含相关性检验)【含GUI源码 1862期】
Unity 全景漫游过程中使用AWSD控制镜头移动,EQ控制镜头升降,鼠标右键控制镜头旋转。
Method to determine whether it is an array
开源漫画服务器Mango