当前位置:网站首页>UTF encoding and character set in golang
UTF encoding and character set in golang
2022-07-04 21:04:00 【Nanyidao street】
One 、UTF Coding and Golang Character set
1. Character set
A bit is either 1 Or 0, In any case, you can't get a letter A, We can take these for example A-Z A mapping relationship between the characters of and numbers , such as 0100 0001 representative A, Then we create a character set , Collect these mapping relationships , Get a character number comparison table , Just Called character set
2.ASCII Character set
ASCII Only 128 character , The extended character set has 256 individual
3.GB2312 Character set
ASCII Chinese characters are not supported , And then there is GB2312 Character set
4.Unicode Character set
There are many characters not included in the above character set , We want to make a general character set ,Unicode This is what the association does
5. Fixed length coding , Variable length coding
5.1 Fixed length coding
If you want to express "eggo The world ", We use it directly Unicode Character sets get their numbers , But how to divide the order after getting the number is another problem , For example, it is randomly divided into " lean to one side "
terms of settlement : No matter how long these characters are , Unify according to the longest boundary , The number of digits is not high enough to fill 0, The character boundary problem is solved ,
The new problem : Waste of memory , And the more symbols in the character set , The larger the coding span ,“ Fixed length coding wastes significantly ”, We have to find a way to solve the problem of memory consumption
5.2 Variable length coding
Fixed length coding is not OK , We use variable length coding , Small numbers use fewer bytes , Large number multi-purpose bytes
The solution is as follows :
[0,127] One byte , The highest flag bit is 0
[128,2047] Two bytes , Highest flag bit 110, There are also fixed flags 10
[2048,65535], Highest flag bit 1110, There are two fixed flag bits 10
01100101, The highest byte is 0, Remove the flag bit , The other corresponding is e
11100100 10111000 1001011 use 1110 start , Remove the three flag bits , The remaining parts are combined , You can get the world " the "
6.UT8 Detailed explanation
UTF-8 It is variable length encoding , It can be used 1~4 Byte representation ,
The coding rules are as follows :
1. For one byte , The first is 0, be left over 7 For use Unicode Coding means
2. about n Bytes (n>1), The first byte of front n Position as 1, Bytes left The first two are 10
边栏推荐
猜你喜欢
NetWare r7000 Merlin system virtual memory creation failed, prompting that the USB disk reading and writing speed does not meet the requirements. Solution, is it necessary to create virtual memory??
[1200. Différence absolue minimale]
Advantages of RFID warehouse management system solution
render函数与虚拟dom
Quelques suggestions pour la conception de l'interface
HWiNFO硬件检测工具v7.26绿色版
网件r7000梅林系统5g不稳定 5g信号经常掉线解决方法
In the face of the same complex test task, why can the elder sort out the solution quickly? Ali's ten-year test engineers showed their skills
MySQL --- 数据库查询 - 聚合函数的使用、聚合查询、分组查询
Summary of the mistakes in the use of qpainter in QT gobang man-machine game
随机推荐
[solution] paddlepaddle 2 X call static graph mode
RFID仓储管理系统解决方案的优点
搭建一个仪式感点满的网站,并内网穿透发布到公网 1/2
记一次重复造轮子(Obsidian 插件设置说明汉化)
HWiNFO硬件检测工具v7.26绿色版
MySQL --- 数据库查询 - 聚合函数的使用、聚合查询、分组查询
word中插入图片后,图片上方有一空行,且删除后布局变乱
仿ps样式js网页涂鸦板插件
Advantages of RFID warehouse management system solution
After inserting a picture into word, there is a blank line above the picture, and the layout changes after deletion
Go notes (1) go language introduction and characteristics
Flet tutorial 07 basic introduction to popupmenubutton (tutorial includes source code)
企业数字化转型最佳实践案例:基于云的数字化平台系统安全措施简介与参考
jekins初始化密码没有或找不到
word中插入圖片後,圖片上方有一空行,且删除後布局變亂
Gobang go to work fishing tools can be LAN / man-machine
Idea plug-in
NetWare r7000 Merlin system virtual memory creation failed, prompting that the USB disk reading and writing speed does not meet the requirements. Solution, is it necessary to create virtual memory??
[micro service SCG] use of predict
二叉树的四种遍历方式以及中序后序、前序中序、前序后序、层序创建二叉树【专为力扣刷题而打造】