当前位置：网站首页>elastic stack

elastic stack

2022-07-03 02:02:00 【Cucumber stained with blood】

One What is a search engine

Is based on customer needs , Use a certain algorithm to present the data to customers through retrieval

Two Inverted index

Put a piece of text according to certain rules , Conduct participle , Split into different entry Then record the unique identification of the entry and data （ID） Relationship .
give an example
If we were jd Search for “ mobile phone ” key word , At this time, we are client , Server side Various brands of mobile phones will be displayed , Its names are various , Such as huawei mobile phone 、 nokia mobile phone 、 Apple mobile phone etc. , This is the time Server side will “xx mobile phone ” Split into entry , The server will search for this entry Return to client .

3、 ... and elasticsearch Concept

ES=elaticsearch Abbreviation , Elasticsearch It is an open source and highly extensible distributed full-text retrieval engine , It can store almost in real time 、 Retrieving data ; It's very extensible , It can be extended to hundreds of servers , Handle PB Level of data .

Four elasticsearch The core concept

Indexes ：es Where to store data , It can be understood as a database in a relational database
mapping ： Defines the type of each field , It can be understood as table structure in relational database
file ：es The smallest data unit in , Often in json Format display , A document is equivalent to a row of data in a relational database
Inverted index ： Put a piece of text according to certain rules , Carry out word segmentation , Split into different entries, and then record the unique identification of entries and data （ID） Relationship .
type ： A kind of type It's like a table
1：es5 In a index There can be Multiple type.
2：es6 In a index Only by One type.
3：es7 Already in the remove 了 type, Default _doc.

5、 ... and elasticsearch Use scenarios

Original link

https://blog.csdn.net/laoyang360/article/details/52227541

summary ：es The role in the whole architecture is “ Search engine ” Role , You need to convert the data in the database Real time synchronization To es in , Supply clients retrieval .

6、 ... and elasticsearch Official website

https://www.elastic.co/

7、 ... and install es

edition ：elasticsearch-7.4.0
es Is based on jdk Environmental Science , however es Included in the deployment jdk Environmental Science , No need for us to install it again , also es Corresponding jdk Version environment , It is recommended to use the jdk edition

1. Upload and unzip

tar xvf elasticsearch-7.4.0-linux-x86_64.tar.gz -C ~/APP

2. Modify the configuration file

cat << EOF >> /home/finance/APP/elasticsearch-7.4.0/config/elasticsearch.yml # To configure elasticsearch Cluster name , It is suggested to change it to a meaningful name . cluster.name: elasticsearch # The name of the node ,elasticsearch A random name is assigned , It is suggested to change it to a meaningful name , Easy to manage . node.name: node-1 # Set to 0.0.0.0 Allow Internet access . network.host: 0.0.0.0 #es Of http port  http.port: 9200 # This configuration is required to initialize the cluster master The election  cluster.initial_master_nodes: ["node-1"] # Data directory location  path.data: /home/finance/data/es7 # Log path  path.logs: /home/finance/logs/es7 EOF

2.1 Other configurable parameters

Parameters	explain
cluster.name	To configure elasticsearch The cluster name of , The default is elasticsearch. It is suggested to change it to a meaningful name .
node.name	The node name ,es A name will be randomly assigned by default , It is recommended to specify a meaningful name , Easy to manage
path.conf	Set the storage path of the configuration file ,tar or zip Package installation defaults to es In the root directory config Folder ,rpm Installation defaults to /etc/ elasticsearch
path.data	Set the storage path of index data , The default is es In the root directory data Folder , Multiple storage paths can be set , Separated by commas
path.logs	Set the storage path of the log file , The default is es In the root directory logs Folder
path.plugins	Set the storage path of the plug-in , The default is es In the root directory plugins Folder
bootstrap.memory_lock	Set to true Can lock up ES Memory used , Avoid memory swap
network.host	Set up bind_host and publish_host, Set to 0.0.0.0 Allow Internet access
http.port	Set up external service http port , The default is 9200.
transport.tcp.port	Communication ports between cluster nodes
discovery.zen.ping.timeout	Set up ES Auto discover node connection timeout time , The default is 3 second , If the network delay is high, it can be set to be larger
discovery.zen.minimum_master_nodes	The minimum number of main nodes , The formula for this value is ：(master_eligible_nodes / 2) + 1 , such as ： Yes 3 Main nodes that meet the requirements , So here it's set to 2

3. Optimize system parameters

Use root user

# modify limit Maximum number of openings 
cat << EOF >> /etc/security/limits.conf  * soft nofile 65536 * hard nofile 65536 * soft nproc 4096 * hard nproc 4096 EOF

# Modify the maximum virtual memory size 
cat << EOF >> /etc/sysctl.conf vm.max_map_count=655360 fs.file-max=655360 EOF

# Effective configuration 
sysctl -p

4. Optimize jvm Parameters

# Default 1G Change it to 512, It can also be changed according to business needs 
vim config/jvm.options
-Xms512m
-Xmx512m

5. Configure environment variables and start

For safety's sake ,es Don't allow root User start .

echo 'export PATH=$PATH:/home/finance/APP/elasticsearch-7.4.0/bin' >> /etc/profile.d/elasticsearch.sh
source /etc/profile.d/elasticsearch.sh

# start-up 
elasticsearch -d

# verification , You can also use web page 
curl -I localhost:9200

8、 ... and es Index related operations

1. Interface request mode

Get Make a request to a specific resource （ Request specific page information , And return the entity body ）
Post Submit data to the specified resource for processing request （ Submit Form 、 Upload files ）, It may also lead to the establishment of new resources or the modification of existing resources
Put Upload the latest content to the specified resource location （ The content of the specified document is replaced by the data transmitted from the client to the server ）
Head With the server get Request a consistent response , The response body will not return , Get the original information contained in the small message header （ And get The request is similar to , There is no specific content in the response returned , For getting headers ）
Delete Request server delete request-URL Resources marked *（ Ask the server to delete the page ）
Trace Echo requests received by server , For testing and diagnosis
opions Returns the server's support for a specific resource HTML Request method or web Server send * Test server function （ Allow clients to view server performance ）