当前位置:网站首页>CADD course learning (6) -- obtain the existing virtual compound library (drugbank, zinc)
CADD course learning (6) -- obtain the existing virtual compound library (drugbank, zinc)
2022-07-05 07:24:00 【Stunned flounder (】
CADD Course study (6)-- Get the existing virtual compound library (Drugbank、ZINC)
Drugbank Database introduction
DrugBank database DrugBank It is a bioinformatics and chemical informatics database provided by the University of Alberta , It is a unique bioinformatics and chemical informatics resource , It combines detailed drug data with comprehensive drug target information .
Recently released DrugBank edition 5.1.9,2022-01-03 edition ) contain 13577 Drug entries , These include 2634 An approved small molecule drug 、1377 Approved Biotechnology ( protein / peptide ) medicine 、131 Nutrients and 6375 An experimental drug . Besides ,5241 A non elemental protein ( Drug target / enzyme / transporter / carrier ) Sequences are associated with these drug entries , Every DruaCard The entry contains 200 Multiple data fields , Half of them are used for drugs / Chemical data , The other half is used for drug target or protein data .
DuoBank The biggest feature is that it supports comprehensive and complex search , combination DrugBank Teachable software , These tools allow scientists to easily detect elements, compare drug structures with new drug matching targets 、 Study drug mechanism and explore new drugs .
ZINC Database introduction
ZINC Database a free database of commercially available compounds for virtual screening .ZINC Contains more than 1300 Ten thousand species 3D Format of the purchasable compound .ZINC Located at the University of California, San Francisco (UCSF) Department of Pharmaceutical Chemistry Shoichet Provided by laboratory .
ZINC Database is a small molecular structure database , There are a large number of small molecular compounds on the market in this database, which provides a very convenient drug property test for drug research and development , There is no need to design a synthetic route to obtain small molecular compounds before testing the activity of related drugs . Especially with the development of computing technology, more and more computer-aided drug design schemes have accelerated the process of drug screening . Through ZINC After screening a large number of molecules in the database, the screened compounds that may be active can be directly passed ZINC Provide the connection to find suppliers to buy small molecule compounds , So as to conveniently and quickly determine the in vitro activity of drugs .
ZINC The free database contains ChemBridge、Enamine and PubChem And many other compound data , You can download all of them for free and download the data of a single supplier .
ZINC The database includes a fragment library 、 Generic drug library 、 Drug bank 、 Natural products warehouse, etc , These compounds contain suppliers 、 Information about the number of rotatable bonds, hydrogen bond receptors and donors 、 According to customer needs , Download the row virtual filter of the specified database .
ZINC20
ZINC The scale of is expanding ,ZINC20 Now it includes 14 Billion compounds , among 13 Billion from 150 Companies in total 310 Product catalogs . these The compound satisfies 90/90/90 The rules , More than 90% Every 90 Update every day and 90% The above compounds can be purchased . The new datasets include 1010 Molecules , Not added to ZlNC in .
In order to study the molecular diversity in on-demand library and physical screening platform , The author carried out experiments from two aspects: skeleton diversity and molecular shape . Yes ZINC Customize the library on demand ( Most of it comes from Enamine REAL) And several other public physical screening libraries (NIH Small molecule library MLSMR,UCSF Small molecule library SMDC,ZIN Of Ro4 Compound inventory ) Calculation Bemis-Murcko Skeleton and count the number of compounds in each skeleton .
The results show that , More than 97% Compounds of cannot be found in ZINC Found in inventory , The number of new skeletons increases almost linearly with the number of molecules . When the number of skeletons increases 16 Times , The number of molecules in the on-demand library is ZINC Inventory 88 times . Use NPMI Methods after classifying the molecular shapes of each library , The molecules of the on-demand library are also more diverse in structure than the physical screening library , Discoid ( Such as benzene ring ) And spherical ( Such as adamantane ) The number of molecules increased significantly .
Search for
download
Select the scope to download
Download method :
1. stay ZINC Select a certain molecular weight and logP Data of nature range , download smi, get ZINC-downloader-2D-4mi.wget File worker
2. download wgetwin-1531-binary And extract the , Click on wget.exe file 93. Set up wgetwin-1531-binary Is in the system environment variable PATH A member of the variable ( Try not to include Chinese in the catalogue );
4. hold ZINC-downloader-2D smi.wget Document and wget.exe Put the files in the same directory ;
5. open cmd window ,wget.exe -i ZINC-downloader-2D-smi.wget
ChEMBL Database introduction
ChEMBL The database is the European Bioinformatics Institute (European Bioinformatics Institute,EB1) Developed an online Free database , It collects bioactivity data of various targets and compounds from a large number of literatures , It provides a very convenient platform for pharmaceutical chemists to query the bioactivity data of targets or compounds . By 2019 year 10 month 29 Japan , The database collects a total of 12482 A target ,187.9 10000 compounds , share 15500 Ten thousand pieces of bioactivity information .
Through this database , Users can quickly query the current reported compounds and their activity information of a target , You can also query which targets of a compound to do a biological activity test and its data . These data are from various reported literatures , The data is relatively reliable , And can trace the source , Query the source of the data . Through this database , Users can save a lot of time in consulting literature and collecting compound data , Quickly obtain accurate compounds and their biological data , Further accelerate the speed of drug design and drug development .
Natural products and traditional Chinese medicine ingredients database
Marine natural products database :http://mc3d.qnlm.ac/
TCMSP Pharmacology database and analysis platform of traditional Chinese medicine system :https//old tcmsp-e.com/tcmsp.php
Natural products database :http:/harmdata.ncmicn/virtualcompound/index.asp
边栏推荐
- Eclipse project recompile, clear cache
- Microservice registry Nacos introduction
- [tf1] save and load parameters
- 2022.06.27_ One question per day
- Solve tensorfow GPU modulenotfounderror: no module named 'tensorflow_ core. estimator‘
- MySQL setting trigger problem
- Machine learning Seaborn visualization
- I 用c I 实现队列
- Rough notes of C language (2) -- constants
- Binary search (half search)
猜你喜欢
CADD课程学习(5)-- 构建靶点已知的化合结构(ChemDraw)
Literacy Ethernet MII interface types Daquan MII, RMII, smii, gmii, rgmii, sgmii, XGMII, XAUI, rxaui
Chapter 2: try to implement a simple bean container
Today, share the wonderful and beautiful theme of idea + website address
第 2 章:小试牛刀,实现一个简单的Bean容器
SOC_ SD_ DATA_ FSM
IPage能正常显示数据,但是total一直等于0
Jenkins reported an error. Illegal character: '\ufeff'. Class, interface or enum are required
2022年PMP项目管理考试敏捷知识点(7)
When jupyter notebook is encountered, erroe appears in the name and is not output after running, but an empty line of code is added downward, and [] is empty
随机推荐
Miracast技术详解(一):Wi-Fi Display
Basic series of SHEL script (III) for while loop
2022 PMP project management examination agile knowledge points (7)
Application of MATLAB in Linear Algebra (4): similar matrix and quadratic form
Anaconda navigator click open no response, can not start error prompt attributeerror: 'STR' object has no attribute 'get‘
How can Oracle SQL statements modify fields that are not allowed to be null to allow nulls?
Idea to view the source code of jar package and some shortcut keys (necessary for reading the source code)
借助 Navicat for MySQL 软件 把 不同或者相同数据库链接中的某数据库表数据 复制到 另一个数据库表中
Word import literature -mendeley
[vscode] recommended plug-ins
ImportError: No module named ‘Tkinter‘
What is soda?
[tf1] save and load parameters
Simple operation of running water lamp (keil5)
氫氧化鈉是什麼?
[software testing] 02 -- software defect management
Ugnx12.0 initialization crash, initialization error (-15)
What is sodium hydroxide?
Now there are HTML files and MVC made with vs (connected to the database). How can they be connected?
Hdu1231 maximum continuous subsequence (divide and conquer or dynamic gauge or double pointer)