当前位置:网站首页>This is probably the most comprehensive project about Twitter information crawler search on the Chinese Internet
This is probably the most comprehensive project about Twitter information crawler search on the Chinese Internet
2022-06-11 05:50:00 【Lao Liu selling vegetables at the corner of the street】
Recently, I crawled twitter for a while because of my work .
Twitter Website is used AJAX Asynchronously loaded , use request Ask to json File data url It is also rejected
So I can only slowly simulate the browser to slide and load it slowly json It's a document ( Of course I didn't use something like selenium Class I Library , Efficiency is too low )
for instance :
We need to climb trump Of 1000 Twitter , Just enter the following command
GetOldTweets3 --username "realDonaldTrump" --toptweets --maxtweets 100
The crawler results are as follows ( Because with windows System needs fanqiang, So directly in linux Under the use of )

use python Read the code csv file ( Poor people can only buy servers without graphical interfaces ):
import csv
import pandas as pd
df=pd.read_csv("/opt/test/output_got.csv")
print(df)

Another example , We want to search on twitter “changsha” News of

GetOldTweets3 --querysearch " changsha " --maxtweets 10
No problem , If in windows You can't use it under the system. You can send me a private message or it should be blocked by a wall .
Good programmers, I think, need one click and compatibility , See what I wrote about how to use it readme That's it
Specifically github The address is as follows :
I hope you can fork Let me have a moment , thank you !!!!
Of course , Sentiment analysis , I can't do things like event extraction , Just do a part of the original data source crawler ,
It should be updated sometime to see if there is a way to bypass twitter Developer accounts, such as those who crawl for attention , Multi level attention, etc
边栏推荐
- Dichotomy find template
- Vscode plug-in development
- qmake 实现QT工程pro脚本转vs解决方案
- NDK learning notes (VIII) thread related
- Recherche sur l'optimisation de Spark SQL basée sur CBO pour kangourou Cloud Stack
- "All in one" is a platform to solve all needs, and the era of operation and maintenance monitoring 3.0 has come
- Getbackgroundaudiomanager controls music playback (dynamic binding of class name)
- NDK learning notes (IX) POSIX sockect connection oriented communication
- Get the full link address of the current project request URL
- Experimental report on information management and information system [information security and confidentiality] of Huazhong Agricultural University
猜你喜欢

Multithreading tutorial (XXVII) CPU cache and pseudo sharing

What happened to the young man who loved to write code -- approaching the "Yao Guang young man" of Huawei cloud

If the MAC fails to connect with MySQL, it will start and report an error

NDK R21 compiles ffmpeg 4.2.2+x264 and converts video files using ffmpeg

深度学习分布式训练

Do we really need conference headphones?

Pycharm usage experience

NDK R21 compiles ffmpeg 4.2.2 (x86, x86_64, armv7, armv8)

Cocoapods installation error

Start the project using the locally configured gradle
随机推荐
SwiftUI: Navigation all know
NFC Development -- the method of using NFC mobile phones as access control cards (II)
DISM命令使用小结
數組部分方法
Xposed bypasses 360 reinforcement to get a real classloader
YOLOv5的Tricks | 【Trick8】图片采样策略——按数据集各类别权重采样
How to deal with message blackout and message sending failure of Aurora im
Install Oracle Database
【深入kotlin】 - Flow 进阶
Recherche sur l'optimisation de Spark SQL basée sur CBO pour kangourou Cloud Stack
Quartz2d drawing technology
Combing route - Compaction Technology
Multi thread tutorial (30) meta sharing mode
The meaning in the status column displayed by PS aux command
If the MAC fails to connect with MySQL, it will start and report an error
[usual practice] explore the insertion position
JS -- reference type
那个酷爱写代码的少年后来怎么样了——走近华为云“瑶光少年”
Clear function of ArrayList
qmake 实现QT工程pro脚本转vs解决方案