当前位置:网站首页>This is probably the most comprehensive project about Twitter information crawler search on the Chinese Internet
This is probably the most comprehensive project about Twitter information crawler search on the Chinese Internet
2022-06-11 05:50:00 【Lao Liu selling vegetables at the corner of the street】
Recently, I crawled twitter for a while because of my work .
Twitter Website is used AJAX Asynchronously loaded , use request Ask to json File data url It is also rejected
So I can only slowly simulate the browser to slide and load it slowly json It's a document ( Of course I didn't use something like selenium Class I Library , Efficiency is too low )
for instance :
We need to climb trump Of 1000 Twitter , Just enter the following command
GetOldTweets3 --username "realDonaldTrump" --toptweets --maxtweets 100
The crawler results are as follows ( Because with windows System needs fanqiang, So directly in linux Under the use of )

use python Read the code csv file ( Poor people can only buy servers without graphical interfaces ):
import csv
import pandas as pd
df=pd.read_csv("/opt/test/output_got.csv")
print(df)

Another example , We want to search on twitter “changsha” News of

GetOldTweets3 --querysearch " changsha " --maxtweets 10
No problem , If in windows You can't use it under the system. You can send me a private message or it should be blocked by a wall .
Good programmers, I think, need one click and compatibility , See what I wrote about how to use it readme That's it
Specifically github The address is as follows :
I hope you can fork Let me have a moment , thank you !!!!
Of course , Sentiment analysis , I can't do things like event extraction , Just do a part of the original data source crawler ,
It should be updated sometime to see if there is a way to bypass twitter Developer accounts, such as those who crawl for attention , Multi level attention, etc
边栏推荐
- NDK learning notes (14) create an avi video player using avilib+window
- Configure the rust compilation environment
- DISM命令使用小结
- Experimental report on information management and information system [information security and confidentiality] of Huazhong Agricultural University
- Activity start process record
- Informatica: six steps of data quality management
- Es IK installation error
- 深度学习分布式训练
- 创建酷炫的 CollectionViewCell 转换动画
- [daily exercises] merge two ordered arrays
猜你喜欢
![[metadata]linkedin datahub](/img/6a/247ee77dfc219d26651e5d93b04c98.png)
[metadata]linkedin datahub

All questions and answers of database SQL practice niuke.com

NDK learning notes (13) create an avi video player using avilib+opengl es 2.0

Stone game -- leetcode practice

安装Oracle数据库

NDK learning notes (14) create an avi video player using avilib+window

Maximum number of points on the line ----- hash table solution

Altiumdesigner2020 import 3D body SolidWorks 3D model

微信小程序text内置组件换行符不换行的原因-wxs处理换行符,正则加段首空格

Yonghong Bi product experience (I) data source module
随机推荐
Multi threading tutorial (XXIV) cas+volatile
NDK learning notes (VII) system configuration, users and groups
Quartz2d drawing technology
修复Yum依赖冲突
Getbackgroundaudiomanager controls music playback (dynamic binding of class name)
Informatica:数据质量管理六步法
袋鼠雲數棧基於CBO在Spark SQL優化上的探索
Wechat applet text built-in component newline character does not newline reason
Elk log system practice (V): install vector and output data to es and Clickhouse cases
Use of vlayout
使用Batch枚舉文件
NDK R21 compiles ffmpeg 4.2.2+x264 and converts video files using ffmpeg
Multi threading tutorial (XXIV) cas+volatile
Get the full link address of the current project request URL
初步了解多任务学习
Aurora im live chat
安装Oracle数据库
ReferenceError: server is not defined
Get the value of program exit
Cocoatouch framework and building application interface