Open Access. Powered by Scholars. Published by Universities.®
- Discipline
-
- Physical Sciences and Mathematics (25)
- Engineering (23)
- Computer Sciences (22)
- Artificial Intelligence and Robotics (17)
- Computer Engineering (11)
-
- Data Science (8)
- Social and Behavioral Sciences (7)
- Electrical and Computer Engineering (5)
- Other Computer Sciences (5)
- Computational Engineering (4)
- Other Computer Engineering (4)
- Databases and Information Systems (3)
- Industrial Engineering (3)
- Operations Research, Systems Engineering and Industrial Engineering (3)
- Signal Processing (3)
- Software Engineering (3)
- Theory and Algorithms (3)
- American Politics (2)
- Bioinformatics (2)
- Environmental Sciences (2)
- Life Sciences (2)
- Linguistics (2)
- Other Electrical and Computer Engineering (2)
- Political Science (2)
- Statistics and Probability (2)
- Aerospace Engineering (1)
- Applied Behavior Analysis (1)
- Applied Statistics (1)
- Arts and Humanities (1)
- Automotive Engineering (1)
- Institution
Articles 61 - 61 of 61
Full-Text Articles in Entire DC Network
Classification Of Web Pages In Yioop With Active Learning, Shawn Cameron Tice
Classification Of Web Pages In Yioop With Active Learning, Shawn Cameron Tice
Master's Theses
This thesis project augments the Yioop search engine with a general facility for automatically assigning "class" meta words (e.g., "class:advertising") to web pages based on the output of a logistic regression text classifier. Users can create multiple classifers using Yioop's web-based interface, each trained first on a small set of labeled documents drawn from previous crawls then improved over repeated rounds of active learning using density-weighted pool-based sampling.
The classification system's accuracy when classifying new documents was found to be comparable to published results for a common dataset, approaching 82% for a corpus of advertisements to be filtered from content-providers' …