当前位置:网站首页>UTF encoding and character set in golang
UTF encoding and character set in golang
2022-07-04 21:04:00 【Nanyidao street】
One 、UTF Coding and Golang Character set
1. Character set
A bit is either 1 Or 0, In any case, you can't get a letter A, We can take these for example A-Z A mapping relationship between the characters of and numbers , such as 0100 0001 representative A, Then we create a character set , Collect these mapping relationships , Get a character number comparison table , Just Called character set
2.ASCII Character set
ASCII Only 128 character , The extended character set has 256 individual
3.GB2312 Character set
ASCII Chinese characters are not supported , And then there is GB2312 Character set
4.Unicode Character set
There are many characters not included in the above character set , We want to make a general character set ,Unicode This is what the association does
5. Fixed length coding , Variable length coding
5.1 Fixed length coding
If you want to express "eggo The world ", We use it directly Unicode Character sets get their numbers , But how to divide the order after getting the number is another problem , For example, it is randomly divided into " lean to one side "
terms of settlement : No matter how long these characters are , Unify according to the longest boundary , The number of digits is not high enough to fill 0, The character boundary problem is solved ,
The new problem : Waste of memory , And the more symbols in the character set , The larger the coding span ,“ Fixed length coding wastes significantly ”, We have to find a way to solve the problem of memory consumption
5.2 Variable length coding
Fixed length coding is not OK , We use variable length coding , Small numbers use fewer bytes , Large number multi-purpose bytes
The solution is as follows :
[0,127] One byte , The highest flag bit is 0
[128,2047] Two bytes , Highest flag bit 110, There are also fixed flags 10
[2048,65535], Highest flag bit 1110, There are two fixed flag bits 10
01100101, The highest byte is 0, Remove the flag bit , The other corresponding is e
11100100 10111000 1001011 use 1110 start , Remove the three flag bits , The remaining parts are combined , You can get the world " the "
6.UT8 Detailed explanation
UTF-8 It is variable length encoding , It can be used 1~4 Byte representation ,
The coding rules are as follows :
1. For one byte , The first is 0, be left over 7 For use Unicode Coding means
2. about n Bytes (n>1), The first byte of front n Position as 1, Bytes left The first two are 10
边栏推荐
- RFID仓库管理系统解决方案有哪些功能模块
- Common verification rules of form components -1 (continuously updating ~)
- Poster cover of glacier
- HMS Core 统一扫码服务
- 面对同样复杂的测试任务为什么大老很快能梳理解决方案,阿里十年测试工程师道出其中的技巧
- [server data recovery] a case of RAID5 data recovery stored in a brand of server
- 剑指 Offer II 80-100(持续更新)
- 网件r7000梅林系统虚拟内存创建失败,提示USB磁盘读写速度不满足要求解决办法,有需要创建虚拟内存吗??
- 科普达人丨一文看懂阿里云的秘密武器“神龙架构”
- D3.js+Three.js数据可视化3d地球js特效
猜你喜欢
How does the computer save web pages to the desktop for use
D3.js+Three.js数据可视化3d地球js特效
RFID仓库管理系统解决方案有哪些功能模块
hash 表的概念及应用
【1200. 最小絕對差】
colResizable.js自动调整表格宽度插件
The concept and application of hash table
What if the win11 shared file cannot be opened? The solution of win11 shared file cannot be opened
实操自动生成接口自动化测试用例
What if win11u disk refuses access? An effective solution to win11u disk access denial
随机推荐
Gobang go to work fishing tools can be LAN / man-machine
Alibaba testers use UI automated testing to achieve element positioning
Après l'insertion de l'image dans le mot, il y a une ligne vide au - dessus de l'image, et la disposition est désordonnée après la suppression
Automatic insertion of captions in word
Why is TCP three handshakes and four waves
网络命名空间
hash 表的概念及应用
How to solve the problem that win11 cannot write the value to the registry key?
Managed service network: application architecture evolution in the cloud native Era
Foxit pdf editor v10.1.8 green version
Hands on deep learning (III) -- convolutional neural network CNN
Advantages of semantic tags and block level inline elements
实战模拟│JWT 登录认证
PS竖排英文和数字文字怎么改变方向(变竖直显示)
What if the WiFi of win11 system always drops? Solution of WiFi total drop in win11 system
[1200. Minimum absolute difference]
语义化标签的优势和块级行内元素
What if the brightness of win11 is locked? Solution to win11 brightness locking
What if the computer page cannot be full screen? The solution of win11 page cannot be full screen
Poster cover of glacier