当前位置:网站首页>Foresniffer tutorial: extracting data
Foresniffer tutorial: extracting data
2022-06-30 10:40:00 【Full stack programmer webmaster】
today , The tutorial that Xiaobian brings to you is : How to smell in front ForeSpider Mid extraction data . The main contents include : How to select a form , How to collect lists / Table data consists of two parts . The details are as follows :
One , How to select a form
stay ForeSpider In reptile , Forms are reusable table structures , The created form can be used repeatedly for multiple tasks .
Data table selection page
1. Select the form
Method 1 : Through the drop-down menu , Or fill out the form ID, Select an existing form . Method 2 : Build a watch quickly , Click Create Form , Enter the quick table creation page , New form .(>> For details, see quick table creation ) Method 3 : Free to build a watch , Click on “ Collection configuration ”-“ Data table ”, Click to pick “ Collect forms ” hinder .(>> For details, please refer to free table creation )
Data table creation page
2. Data storage mode
It refers to data collection , Storage in the database . ① Insert : Default is insert . If you encounter duplicate data that already exists in the database , No longer insert . ② Update only : If you encounter duplicate data that already exists in the database , Then the newly collected data will be used to cover . ③ Additional : For example, the attribute of a field is an operation field , You can perform field operations . ④ Insert and update : If there is no duplicate record, insert , If there is a duplicate record, it will be updated .
Two , How to collect lists / Tabular data
The identification list is used to store tables / List of data , Put the form / Different columns of the list are stored in different fields , form / Different rows of the list are stored as multiple records of the data table . I used to sniff the official website Web The server (http://www.forenose.com/panne…) For example .
1. create form
According to the table , Create a form that stores table data . On the tab “ Data table ” in , Create a form .(>> Free to build a watch )
Identify the table structure of the list
(1) Primary key When collecting tables , A row of the table is used as a piece of data . Because the whole table belongs to the same web document , The document has only one primary key , So it can't be like collecting other content , Select the value type “ Page primary key ”. The variable type of the table's primary key , According to the number of rows and length of the table , choice “Integer” perhaps “Long”. Select the value type “ empty ”. Field attribute selection “ Primary key field ” and “ Automatic fields ”( Select the primary key field , The software will automatically select “ Key value unique ” and “ Index field ”.)
Configuration of primary key fields
(2) Other fields Select the variable type of other fields “string”, Select the value type “ All text in the constituency ”.(>> Field parameters )
Configuration of other fields
2. Create data extraction
Select a form for data extraction
Select the form
3. Identify multiple values
Click on “ Default data extraction ” node , Press Ctrl Click on any cell , Press Shift Click again to expand the area .
Positioning tables
Click on “ Identify multiple values ”, The selection is expanded to the whole table . Click on “ Confirm the constituency ”.
Confirm multi value
4. Field value
The primary key field does not need to be configured . The fields storing the contents of the table need to take values one by one .( Method 1 : Standard positioning / Method 2 : Feature location ) Click on the data extraction field , Configure the data of different columns of the table one by one . Click on the corresponding field , Press Ctrl Click any cell in the first column , Click on “ preservation ”.
Multi value field values
Publisher : Full stack programmer stack length , Reprint please indicate the source :https://javaforall.cn/101092.html Link to the original text :https://javaforall.cn
边栏推荐
- Musk has more than 100 million twitter fans, but he has been lost online for a week
- Launch of Rural Revitalization public welfare fund and release of public welfare bank for intangible cultural heritage protection of ancient tea tree
- "Kunming City coffee map" activity was launched again
- Getting started with X86 - take over bare metal control
- MySQL index, transaction and storage engine of database (1)
- mysql数据库基础:视图、变量
- Skill sorting [email protected]+ Alibaba cloud +nbiot+dht11+bh1750+ soil moisture sensor +oled
- Google 辟谣放弃 TensorFlow,它还活着!
- MySQL log management, backup and recovery of databases (1)
- js常见问题
猜你喜欢

透過華為軍團看科技之變(五):智慧園區

Arm新CPU性能提升22%,最高可组合12核,GPU首配硬件光追,网友:跟苹果的差距越来越大了...

MySQL log management, backup and recovery of databases (2)

Dyson design award, changing the world with sustainable design
[email protected]语音模块+stm32+nfc"/>技能梳理[email protected]语音模块+stm32+nfc

scratch绘制正方形 电子学会图形化编程scratch等级考试二级真题和答案解析2022年6月

MySQL index, transaction and storage engine of database (3)

Launch of Rural Revitalization public welfare fund and release of public welfare bank for intangible cultural heritage protection of ancient tea tree

The latest SCI impact factor release: the highest score of domestic journals is 46! Netizen: I understand if

CVPR 2022 | 清华&字节&京东提出BrT:用于视觉和点云3D目标检测的桥接Transformer
随机推荐
June training (day 30) - topology sorting
Curl --- the request fails when the post request parameter is too long (more than 1024b)
WGet -- 404 not found due to spaces in URL
The preliminary round of the sixth season of 2022 perfect children's model Hefei competition area was successfully concluded
How to deploy deflationary combustion destruction contract code in BSC chain_ Deploy dividend and marketing wallet contract code
透過華為軍團看科技之變(五):智慧園區
半钢同轴射频线的史密斯圆图查看和网络分析仪E5071C的射频线匹配校准
Es common curl finishing
A brief introduction to database mysql
Google 辟谣放弃 TensorFlow,它还活着!
Action bright: take good care of children's eyes together -- a summary of the field investigation on the implementation of action bright in Guangxi
Questions about cookies and sessions
Node environment configuration
Arm新CPU性能提升22%,最高可组合12核,GPU首配硬件光追,网友:跟苹果的差距越来越大了...
Gd32 RT thread flash driver function
Kernel linked list (general linked list) "list.h" simple version and individual comments
机器学习面试准备(一)KNN
RobotFramework学习笔记:环境安装以及robotframework-browser插件的安装
马斯克推特粉丝过亿了,但他在线失联已一周
Leetcode question brushing (I) -- double pointer (go Implementation)