Open Access. Powered by Scholars. Published by Universities.®

Science and Technology Studies Commons

Open Access. Powered by Scholars. Published by Universities.®

2012

alan l porter

Data cleaning

Articles 1 - 1 of 1

Full-Text Articles in Science and Technology Studies

Text Clumping For Technical Intelligence, Alan L. Porter, Yi Zhang Jan 2012

Text Clumping For Technical Intelligence, Alan L. Porter, Yi Zhang

alan l porter

This chapter presents a stepwise process to clean and consolidate sizable phrase compilations. We focus on Science, Technology and Innovation (ST&I) information sets, typically in the form of abstract records retrieved from topical database searches (e.g., Web of Science, Derwent World Patent Index, Factiva). Our aim is to devise a semi-automated desktop process that can rapidly concentrate lists of informative terms and phrases. Those might then be reviewed by topic experts or otherwise processed to fuel further analyses to gain topic-intensive technical intelligence. We are expressly interested, as well, in further processing of such clumped phrases to generate interpretable topic …