当前位置:网站首页>CADD course learning (6) -- obtain the existing virtual compound library (drugbank, zinc)
CADD course learning (6) -- obtain the existing virtual compound library (drugbank, zinc)
2022-07-05 07:24:00 【Stunned flounder (】
CADD Course study (6)-- Get the existing virtual compound library (Drugbank、ZINC)
Drugbank Database introduction
DrugBank database DrugBank It is a bioinformatics and chemical informatics database provided by the University of Alberta , It is a unique bioinformatics and chemical informatics resource , It combines detailed drug data with comprehensive drug target information .
Recently released DrugBank edition 5.1.9,2022-01-03 edition ) contain 13577 Drug entries , These include 2634 An approved small molecule drug 、1377 Approved Biotechnology ( protein / peptide ) medicine 、131 Nutrients and 6375 An experimental drug . Besides ,5241 A non elemental protein ( Drug target / enzyme / transporter / carrier ) Sequences are associated with these drug entries , Every DruaCard The entry contains 200 Multiple data fields , Half of them are used for drugs / Chemical data , The other half is used for drug target or protein data .
DuoBank The biggest feature is that it supports comprehensive and complex search , combination DrugBank Teachable software , These tools allow scientists to easily detect elements, compare drug structures with new drug matching targets 、 Study drug mechanism and explore new drugs .
ZINC Database introduction
ZINC Database a free database of commercially available compounds for virtual screening .ZINC Contains more than 1300 Ten thousand species 3D Format of the purchasable compound .ZINC Located at the University of California, San Francisco (UCSF) Department of Pharmaceutical Chemistry Shoichet Provided by laboratory .
ZINC Database is a small molecular structure database , There are a large number of small molecular compounds on the market in this database, which provides a very convenient drug property test for drug research and development , There is no need to design a synthetic route to obtain small molecular compounds before testing the activity of related drugs . Especially with the development of computing technology, more and more computer-aided drug design schemes have accelerated the process of drug screening . Through ZINC After screening a large number of molecules in the database, the screened compounds that may be active can be directly passed ZINC Provide the connection to find suppliers to buy small molecule compounds , So as to conveniently and quickly determine the in vitro activity of drugs .
ZINC The free database contains ChemBridge、Enamine and PubChem And many other compound data , You can download all of them for free and download the data of a single supplier .
ZINC The database includes a fragment library 、 Generic drug library 、 Drug bank 、 Natural products warehouse, etc , These compounds contain suppliers 、 Information about the number of rotatable bonds, hydrogen bond receptors and donors 、 According to customer needs , Download the row virtual filter of the specified database .
ZINC20
ZINC The scale of is expanding ,ZINC20 Now it includes 14 Billion compounds , among 13 Billion from 150 Companies in total 310 Product catalogs . these The compound satisfies 90/90/90 The rules , More than 90% Every 90 Update every day and 90% The above compounds can be purchased . The new datasets include 1010 Molecules , Not added to ZlNC in .
In order to study the molecular diversity in on-demand library and physical screening platform , The author carried out experiments from two aspects: skeleton diversity and molecular shape . Yes ZINC Customize the library on demand ( Most of it comes from Enamine REAL) And several other public physical screening libraries (NIH Small molecule library MLSMR,UCSF Small molecule library SMDC,ZIN Of Ro4 Compound inventory ) Calculation Bemis-Murcko Skeleton and count the number of compounds in each skeleton .
The results show that , More than 97% Compounds of cannot be found in ZINC Found in inventory , The number of new skeletons increases almost linearly with the number of molecules . When the number of skeletons increases 16 Times , The number of molecules in the on-demand library is ZINC Inventory 88 times . Use NPMI Methods after classifying the molecular shapes of each library , The molecules of the on-demand library are also more diverse in structure than the physical screening library , Discoid ( Such as benzene ring ) And spherical ( Such as adamantane ) The number of molecules increased significantly .
Search for
download
Select the scope to download
Download method :
1. stay ZINC Select a certain molecular weight and logP Data of nature range , download smi, get ZINC-downloader-2D-4mi.wget File worker
2. download wgetwin-1531-binary And extract the , Click on wget.exe file 93. Set up wgetwin-1531-binary Is in the system environment variable PATH A member of the variable ( Try not to include Chinese in the catalogue );
4. hold ZINC-downloader-2D smi.wget Document and wget.exe Put the files in the same directory ;
5. open cmd window ,wget.exe -i ZINC-downloader-2D-smi.wget
ChEMBL Database introduction
ChEMBL The database is the European Bioinformatics Institute (European Bioinformatics Institute,EB1) Developed an online Free database , It collects bioactivity data of various targets and compounds from a large number of literatures , It provides a very convenient platform for pharmaceutical chemists to query the bioactivity data of targets or compounds . By 2019 year 10 month 29 Japan , The database collects a total of 12482 A target ,187.9 10000 compounds , share 15500 Ten thousand pieces of bioactivity information .
Through this database , Users can quickly query the current reported compounds and their activity information of a target , You can also query which targets of a compound to do a biological activity test and its data . These data are from various reported literatures , The data is relatively reliable , And can trace the source , Query the source of the data . Through this database , Users can save a lot of time in consulting literature and collecting compound data , Quickly obtain accurate compounds and their biological data , Further accelerate the speed of drug design and drug development .
Natural products and traditional Chinese medicine ingredients database
Marine natural products database :http://mc3d.qnlm.ac/
TCMSP Pharmacology database and analysis platform of traditional Chinese medicine system :https//old tcmsp-e.com/tcmsp.php
Natural products database :http:/harmdata.ncmicn/virtualcompound/index.asp
边栏推荐
- Ggplot2 drawing learning notes in R
- Concurrent programming - deadlock troubleshooting and handling
- Raspberry pie 4B arm platform aarch64 PIP installation pytorch
- Typescript get timestamp
- 公安专业知识--哔哩桐老师
- 2022.06.27_ One question per day
- Install deeptools in CONDA mode
- 目标检测系列——Faster R-CNN原理详解
- Energy conservation and creating energy gap
- Jenkins reported an error. Illegal character: '\ufeff'. Class, interface or enum are required
猜你喜欢
U-boot initialization and workflow analysis
PHY drive commissioning --- mdio/mdc interface Clause 22 and 45 (I)
Brief description of inux camera (Mipi interface)
window navicat连接阿里云服务器mysql步骤及常见问题
M2dgr slam data set of multi-source and multi scene ground robot
Mipi interface, DVP interface and CSI interface of camera
PostMessage communication
I 用c I 实现队列
Don't confuse the use difference between series / and / *
[software testing] 04 -- software testing and software development
随机推荐
Application of MATLAB in Linear Algebra (4): similar matrix and quadratic form
U-Boot初始化及工作流程分析
SOC_ SD_ CMD_ FSM
Powermanagerservice (I) - initialization
Miracast技术详解(一):Wi-Fi Display
氫氧化鈉是什麼?
I implement queue with C I
[software testing] 05 -- principles of software testing
Qu'est - ce que l'hydroxyde de sodium?
Simple operation of running water lamp (keil5)
Batch convert txt to excel format
目标检测系列——Faster R-CNN原理详解
And let's play dynamic proxy (extreme depth version)
How to delete the virus of inserting USB flash disk copy of shortcut to
M2DGR 多源多场景 地面机器人SLAM数据集
Jenkins reported an error. Illegal character: '\ufeff'. Class, interface or enum are required
Basic series of SHEL script (III) for while loop
[software testing] 02 -- software defect management
M2dgr slam data set of multi-source and multi scene ground robot
【idea】Could not autowire. No beans of xxx type found