当前位置:网站首页>Daily challenges of search engines_ 4_ External heterogeneous resources - Zhihu
Daily challenges of search engines_ 4_ External heterogeneous resources - Zhihu
2020-11-08 07:14:00 【I don't know.】
Write it at the front :
Search engine is an extremely complex system engineering , Search engines don't work wonders , It needs a little bit of polishing . This series records daily problems , In a way that looks at leopards , A little bit to show the charm of search engines .
To the body :
The island effect of mobile ecology is becoming more and more obvious , But they have a certain relationship with each other . For general search engines , Not all the resources 、 Ecology is satisfied one by one , External resources will certainly be introduced .
Compared with Jingdong 、 Ctrip 、 Meituan and others have a large number of searches every day , But unlike general search , They search for their own ecological output , Or structured content . It doesn't have to be like a general search engine at this point , Bear this kind of " Pain ".

The main way to introduce and retrieve external resources is to provide services by exposing interfaces and cards . There are also apps that jump to provide services .

( So now every big factory is building its own ecological content , Standard formatted data , It's also easy to manage . Like the headline 、 There was no. 、 Penguin 、 Even Zhihu column .)
But when resources need to be integrated into the search engine integrated results display page , It will bring A lot of questions to think about :
1 External ways of providing , It's database building , Or request api The way . The magnitude of the database ? The magnitude of the diversion ? Can you resist . Each has its own advantages and disadvantages , Think about it first .
2 How to build a database ? It's built with its own big library ? Or build a separate library ? Both ways have their own advantages and disadvantages .
3 The fields that create the library 、 Recall 、 How to align sorted fields ? How to deal with missing fields ?
4 The way of sorting side fusion , And ecological considerations .
5 Scalability considerations , How to put the standard 、 Put in storage 、 Sorting and other levels of work can be reused as much as possible , Unify management as much as possible .
6 api How to introduce resources , In terms of its content understanding , It's almost hard to do .
6 Audit operational controls . There is no way to audit , Content is not controlled , If there is sensitivity 、 Vulgar content can have a big impact . If the way of warehousing is better ,api The way is a problem .
版权声明
本文为[I don't know.]所创,转载请带上原文链接,感谢
边栏推荐
- Golang anonymous structure member, named structure member, inheritance, composition
- Astra: Apache Cassandra的未来是云原生
- PerconaXtraDBCluster8.0 最详尽用法指南
- nvm
- Unparseable date: 'Mon Aug 15 11:24:39 CST 2016',时间格式转换异常
- GET,POST,PUT,DELETE,OPTIONS用法与说明
- Python image recognition OCR
- Assembly function MCALL systemstack asmcgocal system call
- Web Security (4) -- XSS attack
- 哔哩哔哩常用api
猜你喜欢
面部识别:攻击类型和反欺骗技术
Ladongo open source full platform penetration scanner framework
Littlest JupyterHub| 02 使用nbgitpuller分发共享文件
Mouse small hand
搜索引擎的日常挑战_4_外部异构资源 - 知乎
Macquarie Bank drives digital transformation with datastex enterprise (DSE)
PCR and PTS calculation and inverse operation in TS stream
Astra: Apache Cassandra的未来是云原生
Simple use of future in Scala
Problems of Android 9.0/p WebView multi process usage
随机推荐
On the concurrency of update operation
ts流中的pcr与pts计算与逆运算
C语言I博客作业03
1.深入Istio:Sidecar自动注入如何实现的?
Oschina plays on Sunday - before that, I always thought I was a
GET,POST,PUT,DELETE,OPTIONS用法与说明
Judging whether paths intersect or not by leetcode
Astra: the future of Apache Cassandra is cloud native
[original] about the abnormal situation of high version poi autosizecolumn method
FORTRAN 77 reads some data from the file and uses the heron iteration formula to solve the problem
Get tree menu list
Adobe Lightroom / LR 2021 software installation package (with installation tutorial)
5G+AR出圈,中国移动咪咕成第33届中国电影金鸡奖全程战略合作伙伴
leetcode之判断路径是否相交
Do you really understand the high concurrency?
ROS learning: remote start ROS node
【原创】关于高版本poi autoSizeColumn方法异常的情况
Web Security (3) -- CSRF attack
1. In depth istio: how is sidecar auto injection realized?
Web Security (1) -- browser homology strategy