Upcoming House Floor Schedule

This task scrapes the upcoming floor schedule for the U.S. House of Representatives from docs.house.gov/floor/.

This data provides information on bills and draft bills that are expected to be considered on the House floor in the near future.

Usage

To fetch the schedule for the current week and the next two weeks:

usc-run upcoming_house_floor

The script automatically determines the most recent available week and also scans two weeks ahead.

Options

  • --week_of=<yyyymmdd>: Fetches the schedule for a specific week. The date provided should be a Monday.

    Example:

    // Fetch the schedule for the week of October 21, 2013
    usc-run upcoming_house_floor --week_of=20131021
    
  • --download: By default, the scraper only fetches the schedule metadata. If this flag is present, it will also download all associated documents (e.g., bill text PDFs, reports) linked in the schedule and convert any PDFs to plain text.

Output

The script generates a JSON file for each week scraped, located at:

data/upcoming_house_floor/{yyyymmdd}.json

If --download is enabled, associated documents are saved to:

data/upcoming_house_floor/{yyyymmdd}/{filename}