Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format

Last update: Dec 01, 2021

Related tags

Overview

opendata

Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format.

import asyncio
from opendata.sources.bikeshare.bay_wheels import trips as bay_wheels

trips_df, _ = asyncio.run(bay_wheels.async_load(trip_sample_rate=1000))

len(trips_df.index)
# 8731

trips_df.columns
# Index(['started_at', 'ended_at', 'start_station_id', 'end_station_id',
#        'start_station_name', 'end_station_name', 'rideable_type', 'ride_id',
#        'start_lat', 'start_lng', 'end_lat', 'end_lng', 'gender', 'user_type',
#        'bike_id', 'birth_year'],
#       dtype='object')

An example analysis can be found here: https://observablehq.com/@brady/bikeshare

Supports sampling and local file caching to improve performance.

Markets supported

import opendata.sources.bikeshare.bay_wheels
import opendata.sources.bikeshare.bixi
import opendata.sources.bikeshare.divvy
import opendata.sources.bikeshare.capital_bikeshare
import opendata.sources.bikeshare.citi_bike
import opendata.sources.bikeshare.cogo
import opendata.sources.bikeshare.niceride
import opendata.sources.bikeshare.bluebikes
import opendata.sources.bikeshare.metro_bike_share
import opendata.sources.bikeshare.indego

Bootstrap

Set up your environment

brew install chromedriver
brew install python3
python3 -m pip install pre-commit

pre-commit install --install-hooks
python3 -m venv venv
source venv/bin/activate
python3 -m pip install -r requirements.txt

Entering virtualenv

python3 -m venv venv
source venv/bin/activate
python3 -m pip install -r requirements.txt

Usage

Try the test export to CSV:

python3 test.py

Updating pip requirements

pip-compile

Pre-commit setup

pre-commit install --install-hooks

Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format

Related tags

Overview

opendata

Markets supported

Bootstrap

Entering virtualenv

Usage

Updating pip requirements

Pre-commit setup

Bikeshare markets to add

USA

World

Owner

Brady Law

Minimal working example of data acquisition with nidaqmx python API

ELFXtract is an automated analysis tool used for enumerating ELF binaries

MeSH2Matrix - A set of Python codes for the generation of biomedical ontologies from the MeSH keywords of the PubMed scholarly publications

PyChemia, Python Framework for Materials Discovery and Design

Flood modeling by 2D shallow water equation

A simplified prototype for an as-built tracking database with API

Open-Domain Question-Answering for COVID-19 and Other Emergent Domains

4CAT: Capture and Analysis Toolkit

Streamz helps you build pipelines to manage continuous streams of data

bigdata_analyse 大数据分析项目

PySpark Structured Streaming ROS Kafka ApacheSpark Cassandra

CaterApp is a cross platform, remotely data sharing tool created for sharing files in a quick and secured manner.

Intercepting proxy + analysis toolkit for Second Life compatible virtual worlds

Tools for the analysis, simulation, and presentation of Lorentz TEM data.

WAL enables programmable waveform analysis.

The Master's in Data Science Program run by the Faculty of Mathematics and Information Science

Probabilistic Programming in Python: Bayesian Modeling and Probabilistic Machine Learning with Theano

Making the DAEN information accessible.

Implementation in Python of the reliability measures such as Omega.

A utility for functional piping in Python that allows you to access any function in any scope as a partial.