Computer Sciences | Open Access Articles | Digital Commons Network™

Mining A Digital Library For Influential Authors, David Mimno, Andrew Mccallum

Andrew McCallum

When browsing a digital library of research papers, it is natural to ask which authors are most influential in a particular topic. We present a probabilistic model that ranks authors based on their influence in particular areas of scientific research. This model combines several sources of information: citation information between documents as represented by PageRank scores, authorship data gathered through automatic information extraction, and the words in paper abstracts. We propose a topic model on the words, and compare performance versus a smoothed language model by assessing the number of major award winners in the resulting ranked list of researchers.

Full-Text Articles in Computer Sciences

Mining A Digital Library For Influential Authors, David Mimno, Andrew Mccallum

Andrew McCallum

Resource-Bounded Information Gathering For Correlation Clustering, Pallika Kanani, Andrew Mccallum

Andrew McCallum

Expertise Modeling For Matching Papers With Reviewers, David Mimno, Andrew Mccallum

Andrew McCallum

Sparse Message Passing Algorithms Forweighted Maximum Satisfiability, Aron Culotta, Andrew Mccallum, Bart Selman, Ashish Sabharwal

Andrew McCallum

Semi-Supervised Classification With Hybrid Generative/Discriminative Methods, Gregory Druck, Chris Pal, Xiaoping Zhu, Andrew Mccallum

Andrew McCallum

Improving Author Coreference By Resource-Bounded Information Gathering From Theweb, Pallika Kanani, Andrew Mccallum, Chris Pal

Andrew McCallum

Topical N-Grams: Phrase And Topic Discovery, With An Application To Information Retrieval, Xuerui Wang, Andrew Mccallum, Xing Wei

Andrew McCallum

Lightly-Supervised Attribute Extraction, Kedar Bellare, Partha Pratim Talukdar, Giridhar Kumaran, Fernando Pereira, Mark Liberman, Andrew Mccallum, Mark Dredze

Andrew McCallum

Generalized Component Analysis For Text With Heterogeneous Attributes, Xuerui Wang, Chris Pal, Andrew Mccallum

Andrew McCallum

Penn/Umass/Chop Biocreative Ii Systems, Kuzman Ganchev, Koby Crammer, Fernando Pereira, Gideon Mann, Kedar Bellare, Andrew Mccallum, Steve Carroll, Yang Jin, Peter White

Andrew McCallum

Efficient Strategies For Improving Partitioning-Based Author Coreference By Incorporating Web Pages As Graph Nodes, Pallika Kanani, Andrew Mccallum

Andrew McCallum

Mixtures Of Hierarchical Topics With Pachinko Allocation, David Mimno, Wei Li, Andrew Mccallum

Andrew McCallum

Cryptogram Decoding For Ocr Using Numerization Strings, Gary Huang, Erik Learned-Miller, Andrew Mccallum

Andrew McCallum

Organizing The Oca: Learning Faceted Subjects From A Library Of Digital Books, David Mimno, Andrew Mccallum

Andrew McCallum

Efficient Computation Of Entropy Gradient For Semi-Supervised Conditional Random Fields, Gideon S. Mann, Andrew Mccallum

Andrew McCallum

Canonicalization Of Database Records Using Adaptive Similarity Measures, Aron Culotta, Michael Wick, Robert Hall, Matthew Marzilli, Andrew Mccallum

Andrew McCallum

Improved Dynamic Schedules For Belief Propagation, Charles Sutton, Andrew Mccallum

Andrew McCallum

Learning Extractors From Unlabeled Text Using Relevant Databases, Kedar Bellare, Andrew Mccallum

Andrew McCallum

Nonparametric Bayes Pachinko Allocation, Wei Li, David Blei, Andrew Mccallum

Andrew McCallum

Leveraging Existing Resources Using Generalized Expectation Criteria, Gregory Druck, Gideon Mann, Andrew Mccallum

Andrew McCallum

Undirected And Interpretable Continuous Topic Models Of Documents, X. Wang, K. Crammer, Andrew Mccallum

Andrew McCallum

Author Disambiguation Using Error-Driven Machine Learning With A Ranking Loss Function, Aron Culotta, Pallika Kanani, Robert Hall, Michael Wick, Andrew Mccallum

Andrew McCallum