当前位置:网站首页>Daily challenges of search engines_ 4_ External heterogeneous resources - Zhihu
Daily challenges of search engines_ 4_ External heterogeneous resources - Zhihu
2020-11-08 07:14:00 【I don't know.】
Write it at the front :
Search engine is an extremely complex system engineering , Search engines don't work wonders , It needs a little bit of polishing . This series records daily problems , In a way that looks at leopards , A little bit to show the charm of search engines .
To the body :
The island effect of mobile ecology is becoming more and more obvious , But they have a certain relationship with each other . For general search engines , Not all the resources 、 Ecology is satisfied one by one , External resources will certainly be introduced .
Compared with Jingdong 、 Ctrip 、 Meituan and others have a large number of searches every day , But unlike general search , They search for their own ecological output , Or structured content . It doesn't have to be like a general search engine at this point , Bear this kind of " Pain ".
The main way to introduce and retrieve external resources is to provide services by exposing interfaces and cards . There are also apps that jump to provide services .
( So now every big factory is building its own ecological content , Standard formatted data , It's also easy to manage . Like the headline 、 There was no. 、 Penguin 、 Even Zhihu column .)
But when resources need to be integrated into the search engine integrated results display page , It will bring A lot of questions to think about :
1 External ways of providing , It's database building , Or request api The way . The magnitude of the database ? The magnitude of the diversion ? Can you resist . Each has its own advantages and disadvantages , Think about it first .
2 How to build a database ? It's built with its own big library ? Or build a separate library ? Both ways have their own advantages and disadvantages .
3 The fields that create the library 、 Recall 、 How to align sorted fields ? How to deal with missing fields ?
4 The way of sorting side fusion , And ecological considerations .
5 Scalability considerations , How to put the standard 、 Put in storage 、 Sorting and other levels of work can be reused as much as possible , Unify management as much as possible .
6 api How to introduce resources , In terms of its content understanding , It's almost hard to do .
6 Audit operational controls . There is no way to audit , Content is not controlled , If there is sensitivity 、 Vulgar content can have a big impact . If the way of warehousing is better ,api The way is a problem .
本文为[I don't know.]所创,转载请带上原文链接,感谢
- iOS上传App Store报错:this action cannot be completed -22421 解决方案
- Got timeout reading communication packets解决方法
- NOIP 2012 提高组 复赛 第一天 第二题 国王游戏 game 数学推导 AC代码(高精度 低精度 乘 除 比较)+60代码(long long)+20分代码(全排列+深搜dfs)
- Is blazor ready to serve the enterprise?
- QT hybrid Python development technology: Python introduction, hybrid process and demo
- Wanxin Finance
- OSChina 周日乱弹 —— 之前呢,我一直以为自己是个……
- Summary of knowledge points of Jingtao project
- C语言I博客作业03
- scala 中 Future 的简单使用
Distributed consensus mechanism
Delphi10's rest.json And system.json Step on the pit
QT hybrid Python development technology: Python introduction, hybrid process and demo
Solve the problem of rabbitmq message loss and repeated consumption
Learn Scala if Else statement
Experience the latest version of erofs on Ubuntu
About the promotion of the whole stack of engineers, from the introduction to give up the secret arts, do not click in to have a look?
Go sending pin and email
Everything is 2020, LINQ query you are still using expression tree
Fortify漏洞之 Privacy Violation(隐私泄露)和 Null Dereference(空指针异常)
Privacy violation and null dereference of fortify vulnerability
Blazor 准备好为企业服务了吗?
Windows subsystem Ubuntu installation
Visual Studio 2015 未响应/已停止工作的问题解决
SQL Server 2008R2 18456错误解决方案
Unparseable date: 'Mon Aug 15 11:24:39 CST 2016',时间格式转换异常
SQL Server 2008R2 18456 error resolution
Ulab 1.0.0 release
Tail delivery
Sum up some useful functions
搜索引擎的日常挑战_4_外部异构资源 - 知乎
CPP (1) installation of cmake
Littlest JupyterHub| 02 使用nbgitpuller分发共享文件
Awk implements SQL like join operation
C++基础知识篇:C++ 基本语法