This is a script that scrapes the longitude and latitude on food.grab.com

Related tags

Web Crawlinggrab
Overview

grab

This is a script that scrapes the longitude and latitude for any restaurant in Manila on food.grab.com, location can be adjusted.

Search Result page

Preview

Prerequisites

  • Python
  • Selenium
  • Chrome Browser and Chrome Driver
  • VPN and the Browser extension for the VPN should be installed in your Chrome browser.
  • Beautiful Soup.
  • Change the driver path in the script to match yours.

Notes

  • Avoid making too many requests. There is a limit and a blocker for webscraping implemented by grab.com that's why you need to increase the timeout.
  • Vpn location can be any country in Asia.
  • Selenium is highly unstable, so at times you might get some errors. Do not fret :) just restart the script.
  • Running the driver as Headless strangely doesn't work properly because of the proxy problem and since i am using a vpn extension you need to use actual proxy addresses to get it working. So the GUI has to pop up. 👀
  • You can adjust the sliced_links depending on how many restaurants longitude and latitude you want to scrape.

Output

Number of pages loaded is  1
Number of pages loaded is  2
Number of pages loaded is  3
['https://food.grab.com/ph/en/restaurant/mcdonald-s-sta-cruz-church-delivery/AWiD5JwQfYWaYaQC4nY4', 'https://food.grab.com/ph/en/restaurant/greenwich-morayta-delivery/2-CYKTRUDUNNVAFE', 'https://food.grab.com/ph/en/restaurant/mang-inasal-morayta-delivery/2-CZDEV76BTREHKE', 'https://food.grab.com/ph/en/restaurant/wendy-s-dapitan-available-for-long-distance-delivery-delivery/PHGFSTI0000017w', 'https://food.grab.com/ph/en/restaurant/angel-s-pizza-legarda-available-for-long-distance-delivery-delivery/PHGFSTI000000zw', 'https://food.grab.com/ph/en/restaurant/jollibee-raon-delivery/2-CZC1EZEYACBTCX', 'https://food.grab.com/ph/en/restaurant/kfc-sta-cruz-delivery/2-CYUZC8BTGJ51GJ', 'https://food.grab.com/ph/en/restaurant/chowking-sta-cruz-manila-delivery/2-CYMHN7DWAGL2WE', 'https://food.grab.com/ph/en/restaurant/coco-fresh-tea-juice-grabkitchen-sampaloc-delivery/2-C2DDLZNZVTV2TA', 'https://food.grab.com/ph/en/restaurant/subway-medical-center-manila-available-for-long-distance-delivery-delivery/PHGFSTI000003e8', 'https://food.grab.com/ph/en/restaurant/yellow-cab-pizza-espana-available-for-long-distance-delivery-delivery/2-CYKCVZNZJTDFLE', 'https://food.grab.com/ph/en/restaurant/three-j-lugawan-earnshaw-street-available-for-long-distance-delivery-delivery/2-C2JTJFEKWCKZLE', 'https://food.grab.com/ph/en/restaurant/army-navy-burger-burrito-ust-dapitan-available-for-long-distance-delivery-delivery/PHGFSTI000000ym', 'https://food.grab.com/ph/en/restaurant/kenny-rogers-roasters-manila-delco-available-for-long-distance-delivery-delivery/2-CYMFVAK2CU5KN6', 'https://food.grab.com/ph/en/restaurant/zark-s-burgers-ust-available-for-long-distance-delivery-delivery/2-C2VZMEUEPGADDA', 'https://food.grab.com/ph/en/restaurant/happilee-korean-kitchen-grabkitchen-sampaloc-available-for-long-distance-delivery-delivery/2-C2JECYXXKF2ATA', 'https://food.grab.com/ph/en/restaurant/boodle-inasal-x-happy-thirstday-sampaloc-available-for-long-distance-delivery-delivery/2-C2J3BF3DSA2YRX', 'https://food.grab.com/ph/en/restaurant/pares-kimchi-intramuros-available-for-long-distance-delivery-delivery/2-CZNKLEUUMBTHCA', 'https://food.grab.com/ph/en/restaurant/something-healthy-ust-available-for-long-distance-delivery-delivery/2-CYVGEUCHNF3ZJN', 'https://food.grab.com/ph/en/restaurant/shakey-s-pizza-espana-delivery/2-CYK2GKKXEKTULJ', 'https://food.grab.com/ph/en/restaurant/papa-john-s-pizza-tri-loyola-building-available-for-long-distance-delivery-delivery/2-CYTZE7BTVVEVDA', 'https://food.grab.com/ph/en/restaurant/pizza-hut-cm-recto-available-for-long-distance-delivery-delivery/2-CYLCMEJHCNTCGJ', 'https://food.grab.com/ph/en/restaurant/macao-imperial-tea-pacific-suites-delivery/PHGFSTI000003fv', 'https://food.grab.com/ph/en/restaurant/turks-recto-delivery/2-CYUJJNU2N6KFTA', 'https://food.grab.com/ph/en/restaurant/serenitea-ust-lacson-available-for-long-distance-delivery-delivery/2-CZJTAAXDGEJABA', 'https://food.grab.com/ph/en/restaurant/food-house-by-madla-ust-delivery/2-CZKJC241LXA1TN', 'https://food.grab.com/ph/en/restaurant/mister-kabab-grabkitchen-sampaloc-available-for-long-distance-delivery-delivery/2-C2JEE7UARFJYEN', 'https://food.grab.com/ph/en/restaurant/grabkitchen-mix-and-match-sampaloc-available-for-long-distance-delivery-delivery/2-C2JTEVCUFFAYA6', 'https://food.grab.com/ph/en/restaurant/dosirakuya-korean-street-food-tondo-available-for-long-distance-delivery-delivery/2-C2MEPBJBLYU2DA', 'https://food.grab.com/ph/en/restaurant/tutong-s-laksa-sampaloc-available-for-long-distance-delivery-delivery/2-C23URRLCNJUTVT', 'https://food.grab.com/ph/en/restaurant/i-love-milktea-tayuman-delivery/2-C2TKT3DVJYUELX', 'https://food.grab.com/ph/en/restaurant/erlinda-s-foodhouse-loyola-street-delivery/2-C2WCFGMANYXTAA', 'https://food.grab.com/ph/en/restaurant/big-scoop-p-guevarra-st-delivery/2-C2UWRYJBTX4JSE', 'https://food.grab.com/ph/en/restaurant/mipanda-milk-tea-hub-lacson-available-for-long-distance-delivery-delivery/2-C2MUNFEKGYAHET', 'https://food.grab.com/ph/en/restaurant/caution-bistro-manila-available-for-long-distance-delivery-delivery/2-C2WFT2UYEUBDCX', 'https://food.grab.com/ph/en/restaurant/24-7-wings-ust-available-for-long-distance-delivery-delivery/2-CYUYBF42AU5ELJ', 'https://food.grab.com/ph/en/restaurant/baliwag-lechon-manok-laong-laan-manila-available-for-long-distance-delivery-delivery/2-C2CGNKTYGA4ZL6', 'https://food.grab.com/ph/en/restaurant/hong-kong-noodles-dimsum-house-quiapo-available-for-long-distance-delivery-delivery/PHGFSTI000001d8', 'https://food.grab.com/ph/en/restaurant/selecta-ice-cream-shop-sta-cruz-delivery/2-CYWFLCNBKFCZJJ', 'https://food.grab.com/ph/en/restaurant/starbucks-puerta-de-isabel-ii-delivery/2-CY42TKKBTKM3LE']
You have scraped 40 links
{'latitude': 14.599870428571428, 'longitude': 120.9797005}
{'latitude': 14.605052185873603, 'longitude': 120.98803399294457}
{'latitude': 14.605344959550562, 'longitude': 120.9881068}
{'latitude': 14.611880266666667, 'longitude': 120.9884545}
{'latitude': 14.599262918622557, 'longitude': 120.9900343411054}
{'latitude': 14.6007231, 'longitude': 120.98439648}
{'latitude': 14.599968222336393, 'longitude': 120.98002192}
{'latitude': 14.599834675, 'longitude': 120.980555425}
原神爬虫 抓取原神界面圣遗物信息

原神圣遗物半自动爬虫 说明 直接抓取原神界面中的圣遗物数据 目前只适配了背包页面的抓取 准确率:97.5%(普通通用接口,对 40 件随机圣遗物识别,统计完全正确的数量为 39) 准确率:100%(4k 屏幕,普通通用接口,对 110 件圣遗物识别,统计完全正确的数量为 110) 不排除还有小错误的

hwa 28 Oct 10, 2022
Using Selenium with Python to Web Scrap Popular Youtube Tech Channels.

Web Scrapping Popular Youtube Tech Channels with Selenium Data Mining, Data Wrangling, and Exploratory Data Analysis About the Data Web scrapi

David Rusho 0 Aug 18, 2021
Poolbooru gelscraper - a simple python script for scraping images off gelbooru pools.

poolbooru_gelscraper a simple python script for scraping images off gelbooru pools. modules required:requests_html, and os by default saves files with

savantshuia 1 Jan 02, 2022
Instagram_scrapper - This project allow you to scrape the list of followers, following or both from a public Instagram account, and create a csv or excel file easily.

Instagram_scrapper This project allow you to scrape the list of followers, following or both from a public Instagram account, and create a csv or exce

Lakhdar Belkharroubi 5 Oct 17, 2022
Pyrics is a tool to scrape lyrics, get rhymes, generate relevant lyrics with rhymes.

Pyrics Pyrics is a tool to scrape lyrics, get rhymes, generate relevant lyrics with rhymes. ./test/run.py provides the full function in terminal cmd

MisterDK 1 Feb 12, 2022
A scalable frontier for web crawlers

Frontera Overview Frontera is a web crawling framework consisting of crawl frontier, and distribution/scaling primitives, allowing to build a large sc

Scrapinghub 1.2k Jan 02, 2023
A simple reddit scraper to get memes (only images) from r/ProgrammerHumor.

memey A simple reddit scraper to get memes (only images) from r/ProgrammerHumor. Note Only works if you have firefox installed (yet). Instructions foo

2 Nov 16, 2021
此脚本为 python 脚本,实现原理为利用 selenium 定位相关元素,再配合点击事件完成浏览器的自动化.

此脚本为 python 脚本,实现原理为利用 selenium 定位相关元素,再配合点击事件完成浏览器的自动化.

N0el4kLs 5 Nov 19, 2021
Automated Linkedin bot that will improve your visibility and increase your network.

LinkedinSpider LinkedinSpider is a small project using browser automating to increase your visibility and network of connections on Linkedin. DISCLAIM

Frederik 2 Nov 26, 2021
A command-line program to download media, like and unlike posts, and more from creators on OnlyFans.

onlyfans-scraper A command-line program to download media, like and unlike posts, and more from creators on OnlyFans. Installation You can install thi

185 Jul 23, 2022
Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors

Parsel Parsel is a BSD-licensed Python library to extract and remove data from HTML and XML using XPath and CSS selectors, optionally combined with re

Scrapy project 859 Dec 29, 2022
A Python web scraper to scrape latest posts from official Coinbase's Blog.

Coinbase Blog Scraper A Python web scraper to scrape latest posts from official Coinbase's Blog. IDEA It scrapes up latest blog posts from https://blo

Lucas Villela 3 Feb 18, 2022
Libextract: extract data from websites

Libextract is a statistics-enabled data extraction library that works on HTML and XML documents and written in Python

499 Dec 09, 2022
A Python module to bypass Cloudflare's anti-bot page.

cloudflare-scrape A simple Python module to bypass Cloudflare's anti-bot page (also known as "I'm Under Attack Mode", or IUAM), implemented with Reque

3k Jan 04, 2023
对于有验证码的站点爆破,用于安全合法测试

使用方法 python3 main.py + 配置好的文件 python3 main.py Verify.json python3 main.py NoVerify.json 以上分别对应有验证码的demo和无验证码的demo Tips: 你可以以域名作为配置文件名字加载:python3 main

47 Nov 09, 2022
Distributed Crawler Management Framework Based on Scrapy, Scrapyd, Django and Vue.js

Gerapy Distributed Crawler Management Framework Based on Scrapy, Scrapyd, Scrapyd-Client, Scrapyd-API, Django and Vue.js. Documentation Documentation

Gerapy 2.9k Jan 03, 2023
A dead simple crawler to get books information from Douban.

Introduction A dead simple crawler to get books information from Douban. Pre-requesites Python 3 Install dependencies from requirements.txt (Optional)

Yun Wang 1 Jan 10, 2022
This is my CS 20 final assesment.

eeeeeSpider This is my CS 20 final assesment. How to use: Open program Run to your hearts content! There are no external dependancies that you will ha

1 Jan 17, 2022
🥫 The simple, fast, and modern web scraping library

About gazpacho is a simple, fast, and modern web scraping library. The library is stable, actively maintained, and installed with zero dependencies. I

Max Humber 692 Dec 22, 2022
Rottentomatoes, Goodreads and IMDB sites crawler. Semantic Web final project.

Crawler Rottentomatoes, Goodreads and IMDB sites crawler. Crawler written by beautifulsoup, selenium and lxml to gather books and films information an

Faeze Ghorbanpour 1 Dec 30, 2021