Upcoming House Floor Schedule
This task scrapes the upcoming floor schedule for the U.S. House of Representatives from docs.house.gov/floor/
.
This data provides information on bills and draft bills that are expected to be considered on the House floor in the near future.
Usage
To fetch the schedule for the current week and the next two weeks:
usc-run upcoming_house_floor
The script automatically determines the most recent available week and also scans two weeks ahead.
Options
-
--week_of=<yyyymmdd>
: Fetches the schedule for a specific week. The date provided should be a Monday.Example:
// Fetch the schedule for the week of October 21, 2013 usc-run upcoming_house_floor --week_of=20131021
-
--download
: By default, the scraper only fetches the schedule metadata. If this flag is present, it will also download all associated documents (e.g., bill text PDFs, reports) linked in the schedule and convert any PDFs to plain text.
Output
The script generates a JSON file for each week scraped, located at:
data/upcoming_house_floor/{yyyymmdd}.json
If --download
is enabled, associated documents are saved to:
data/upcoming_house_floor/{yyyymmdd}/{filename}