Every web site provides APIs.

Last update: Jan 05, 2023

Overview

Toapi

Overview

Toapi give you the ability to make every web site provides APIs.

Version v2.0.0, Completely rewrote.

More elegant. More pythonic

v1.0.0 Documentation: http://www.toapi.org
Awesome: https://github.com/toapi/awesome-toapi
Organization: https://github.com/toapi

Features

Automatic converting HTML web site to API service.
Automatic caching every page of source site.
Automatic caching every request.
Support merging multiple web sites into one API service.

Get Started

Installation

$ pip install toapi
$ toapi -v
toapi, version 2.0.0

Usage

create app.py and copy the code:

from flask import request
from htmlparsing import Attr, Text
from toapi import Api, Item

api = Api()


@api.site('https://news.ycombinator.com')
@api.list('.athing')
@api.route('/posts?page={page}', '/news?p={page}')
@api.route('/posts', '/news?p=1')
class Post(Item):
    url = Attr('.storylink', 'href')
    title = Text('.storylink')


@api.site('https://news.ycombinator.com')
@api.route('/posts?page={page}', '/news?p={page}')
@api.route('/posts', '/news?p=1')
class Page(Item):
    next_page = Attr('.morelink', 'href')

    def clean_next_page(self, value):
        return api.convert_string('/' + value, '/news?p={page}', request.host_url.strip('/') + '/posts?page={page}')


api.run(debug=True, host='0.0.0.0', port=5000)

run python app.py

then open your browser and visit http://127.0.0.1:5000/posts?page=1

you will get the result like:

{
  "Page": {
    "next_page": "http://127.0.0.1:5000/posts?page=2"
  }, 
  "Post": [
    {
      "title": "Mathematicians Crack the Cursed Curve", 
      "url": "https://www.quantamagazine.org/mathematicians-crack-the-cursed-curve-20171207/"
    }, 
    {
      "title": "Stuffing a Tesla Drivetrain into a 1981 Honda Accord", 
      "url": "https://jalopnik.com/this-glorious-madman-stuffed-a-p85-tesla-drivetrain-int-1823461909"
    }
  ]
}

Todo

Visualization. Create toapi project in a web page by drag and drop.

Contributing

Write code and test code and pull request.

Every web site provides APIs.

Related tags

Overview

Toapi

Overview

Features

Get Started

Installation

Usage

Todo

Contributing

Owner

Jiuli Gao

Every web site provides APIs.

News, full-text, and article metadata extraction in Python 3. Advanced docs:

Brownant is a web data extracting framework.

Github Actions采集RSS, 打造无广告内容优质的头版头条超赞宝藏页

RSS feed generator website with user friendly interface

Export your data from Xiami

fast python port of arc90's readability tool, updated to match latest readability.js!

Module for automatic summarization of text documents and HTML pages.

Combine XPath, CSS Selectors and JSONPath for Web data extracting.

Pythonic HTML Parsing for Humans™

Zotero2Readwise - A Python Library to retrieve annotations and notes from Zotero and upload them to your Readwise

a small library for extracting rich content from urls

Fast and robust date extraction from web pages, with Python or on the command-line

Convert HTML to Markdown-formatted text.

Web-Extractor - Simple Tool To Extract IP-Adress From Website

Web Content Retrieval for Humans™

Open clone of OpenAI's unreleased WebText dataset scraper.