Processing Government Data: ZIP Codes, Python, and OpenRefine
Processing Government Data: ZIP Codes, Python, and OpenRefine
Blog Article
While there is a vast amount of useful US government data on the web, some of it is in a raw state that is not readily accessible to the average user.Data librarians can improve accessibility and usability for their patrons by processing data to create subsets jordan 12 nubuck of local interest and by appending geographic identifiers to help users select and aggregate data.This case study illustrates vector gp68hx 12vh-012ca how census geography crosswalks, Python, and OpenRefine were used to create spreadsheets of non-profit organizations in New York City from the IRS Tax-Exempt Organization Masterfile.
This paper illustrates the utility of Python for data librarians and should be particularly insightful for those who work with address-based data.