Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format

Last update: Dec 01, 2021

Related tags

Overview

opendata

Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format.

import asyncio
from opendata.sources.bikeshare.bay_wheels import trips as bay_wheels

trips_df, _ = asyncio.run(bay_wheels.async_load(trip_sample_rate=1000))

len(trips_df.index)
# 8731

trips_df.columns
# Index(['started_at', 'ended_at', 'start_station_id', 'end_station_id',
#        'start_station_name', 'end_station_name', 'rideable_type', 'ride_id',
#        'start_lat', 'start_lng', 'end_lat', 'end_lng', 'gender', 'user_type',
#        'bike_id', 'birth_year'],
#       dtype='object')

An example analysis can be found here: https://observablehq.com/@brady/bikeshare

Supports sampling and local file caching to improve performance.

Markets supported

import opendata.sources.bikeshare.bay_wheels
import opendata.sources.bikeshare.bixi
import opendata.sources.bikeshare.divvy
import opendata.sources.bikeshare.capital_bikeshare
import opendata.sources.bikeshare.citi_bike
import opendata.sources.bikeshare.cogo
import opendata.sources.bikeshare.niceride
import opendata.sources.bikeshare.bluebikes
import opendata.sources.bikeshare.metro_bike_share
import opendata.sources.bikeshare.indego

Bootstrap

Set up your environment

brew install chromedriver
brew install python3
python3 -m pip install pre-commit

pre-commit install --install-hooks
python3 -m venv venv
source venv/bin/activate
python3 -m pip install -r requirements.txt

Entering virtualenv

python3 -m venv venv
source venv/bin/activate
python3 -m pip install -r requirements.txt

Usage

Try the test export to CSV:

python3 test.py

Updating pip requirements

pip-compile

Pre-commit setup

pre-commit install --install-hooks

Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format

Related tags

Overview

opendata

Markets supported

Bootstrap

Entering virtualenv

Usage

Updating pip requirements

Pre-commit setup

Bikeshare markets to add

USA

World

Owner

Brady Law

track your GitHub statistics

Single machine, multiple cards training; mix-precision training; DALI data loader.

Spectacular AI SDK fuses data from cameras and IMU sensors and outputs an accurate 6-degree-of-freedom pose of a device.

Analyzing Covid-19 Outbreaks in Ontario

Cleaning and analysing aggregated UK political polling data.

Flexible HDF5 saving/loading and other data science tools from the University of Chicago

Randomisation-based inference in Python based on data resampling and permutation.

BasstatPL is a package for performing different tabulations and calculations for descriptive statistics.

Cold Brew: Distilling Graph Node Representations with Incomplete or Missing Neighborhoods

Scraping and analysis of leetcode-compensations page.

Fancy data functions that will make your life as a data scientist easier.

Elasticsearch tool for easily collecting and batch inserting Python data and pandas DataFrames

Udacity-api-reporting-pipeline - Udacity api reporting pipeline

A data parser for the internal syncing data format used by Fog of World.

Template for a Dataflow Flex Template in Python

Single-Cell Analysis in Python. Scales to >1M cells.

Unsub is a collection analysis tool that assists libraries in analyzing their journal subscriptions.

.npy, .npz, .mtx converter.

Display the behaviour of a realtime program with a scope or logic analyser.

Vectorizers for a range of different data types