Open Access. Powered by Scholars. Published by Universities.®

Social and Behavioral Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Data processing

City University of New York (CUNY)

Articles 1 - 1 of 1

Full-Text Articles in Social and Behavioral Sciences

Processing Government Data: Zip Codes, Python, And Openrefine, Frank Donnelly Jul 2014

Processing Government Data: Zip Codes, Python, And Openrefine, Frank Donnelly

Publications and Research

While there is a vast amount of useful US government data on the web, some of it is in a raw state that is not readily accessible to the average user. Data librarians can improve accessibility and usability for their patrons by processing data to create subsets of local interest and by appending geographic identifiers to help users select and aggregate data. This case study illustrates how census geography crosswalks, Python, and OpenRefine were used to create spreadsheets of non-profit organizations in New York City from the IRS Tax-Exempt Organization Masterfile. This paper illustrates the utility of Python for data …