Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Handle streets where the name overlaps the street-type map #33

Open
wants to merge 1 commit into
base: master
Choose a base branch
from
Open

Conversation

hannahwhy
Copy link

Some addresses like

14168 W RIVER RD
COLUMBIA STATION, OH 44028-9430

are interpreted with the street being "W", the street type as "River"
(which abbreviates to "riv"), and the city as "RD \nCOLUMBIA STATION".
The example addresses are all in Ohio because that's my current data set,
but it's not an Ohio-specific phenomenon: for example, in Illinois,
there's a River Road that follows the Des Plaines River.

The erroneous parse appears to be from the "100 South Street"
special case in the street regexp. This commit adds a second special
case with higher precedence, matching [prefix, non-numeric street,
street type] sequences. The street match excludes numerics to preserve
the existing parse behavior for the "6641 N 2200 W Apt D304 Park City,
UT 84098" case.

Some addresses like

14168 W RIVER RD
COLUMBIA STATION, OH 44028-9430

are interpreted with the street being "W", the street type as "River"
(which abbreviates to "riv"), and the city as "RD   \nCOLUMBIA STATION".
The example addresses are all in Ohio because that's my current dataset,
but it's not an Ohio-specific phenomenon: for example, in Illinois,
there's a River Road that follows the Des Plaines River.

The erroneous parse appears to be from the "100 South Street"
special case in the street regexp.  This commit adds a second special
case with higher precedence, matching [prefix, non-numeric street,
street type] sequences.  The street match excludes numerics to preserve
the existing parse behavior for the "6641 N 2200 W Apt D304 Park City,
UT 84098" case.
@SalvatoreT
Copy link

@derrek, this seems pretty neat.

@jsmestad
Copy link

jsmestad commented May 7, 2017

Really would like to see this. "Ridge Road" suffers from this as well.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants