Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 10 of 10

Full-Text Articles in Physical Sciences and Mathematics

Whisk: Web Hosted Information Into Summarized Knowledge, Jiewen Wu Jul 2016

Whisk: Web Hosted Information Into Summarized Knowledge, Jiewen Wu

Master's Theses

Today’s online content increases at an alarmingly rate which exceeds users’ ability to consume such content. Modern search techniques allow users to enter keyword queries to find content they wish to see. However, such techniques break down when users freely browse the internet without knowing exactly what they want. Users may have to invest an unnecessarily long time reading content to see if they are interested in it. Automatic text summarization helps relieve this problem by creating synopses that significantly reduce the text while preserving the key points. Steffen Lyngbaek created the SPORK summarization pipeline to solve the content overload …


Skewer: Sentiment Knowledge Extraction With Entity Recognition, Christopher James Wu Jun 2016

Skewer: Sentiment Knowledge Extraction With Entity Recognition, Christopher James Wu

Master's Theses

The California state legislature introduces approximately 5,000 new bills each legislative session. While the legislative hearings are recorded on video, the recordings are not easily accessible to the public. The lack of official transcripts or summaries also increases the effort required to gain meaningful insight from those recordings. Therefore, the news media and the general population are largely oblivious to what transpires during legislative sessions.

Digital Democracy, a project started by the Cal Poly Institute for Advanced Technology and Public Policy, is an online platform created to bring transparency to the California legislature. It features a searchable database of state …


Concatenative Synthesis For Novel Timbral Creation, James Eric Bilous Jun 2016

Concatenative Synthesis For Novel Timbral Creation, James Eric Bilous

Master's Theses

Modern day musicians rely on a variety of instruments for musical expression. Tones produced from electronic instruments have become almost as commonplace as those produced by traditional ones as evidenced by the plethora of artists who can be found composing and performing with nothing more than a personal computer. This desire to embrace technical innovation as a means to augment performance art has created a budding field in computer science that explores the creation and manipulation of sound for artistic purposes. One facet of this new frontier concerns timbral creation, or the development of new sounds with unique characteristics that …


Automating Self Evaluations For Software Engineers, Jonathan Rodrigo A. Miranda Jun 2016

Automating Self Evaluations For Software Engineers, Jonathan Rodrigo A. Miranda

Master's Theses

Software engineers frequently compose self-evaluations as part of employee perfor- mance reviews. These evaluations can be a key artifact for assessing a software engineer’s contributions to a team and organization, and for generating useful feed- back. Self-evaluations can be challenging because a) they can be time consuming, b) individuals may forget about important contributions especially when the review period is long such as a full year, c) some individuals can consciously or unconsciously overstate their contributions, and d) some individuals can be reluctant to describe their contributions for fear of appearing too proud [24].

UNBIASED, Useful New Basic Interactive Automated …


Categorizing Blog Spam, Brandon Bevans Jun 2016

Categorizing Blog Spam, Brandon Bevans

Master's Theses

The internet has matured into the focal point of our era. Its ecosystem is vast, complex, and in many regards unaccounted for. One of the most prevalent aspects of the internet is spam. Similar to the rest of the internet, spam has evolved from simply meaning ‘unwanted emails’ to a blanket term that encompasses any unsolicited or illegitimate content that appears in the wide range of media that exists on the internet.

Many forms of spam permeate the internet, and spam architects continue to develop tools and methods to avoid detection. On the other side, cyber security engineers continue to …


Predicting Changes To Source Code, Justin James Roll Apr 2016

Predicting Changes To Source Code, Justin James Roll

Master's Theses

Organizations typically use issue tracking systems (ITS) such as Jira to plan software releases and assign requirements to developers. Organizations typically also use source control management (SCM) repositories such as Git to track historical changes to a code-base. These ITS and SCM repositories contain valuable data that remains largely untapped. As developers churn through an organization, it becomes expensive for developers to spend time determining which software artifact must be modified to implement a requirement. In this work we created, developed, tested and evaluated a tool called Class Change Predictor, otherwise known as CCP, for predicting which class will implement …


Reducing Costs In Human Assisted Speech Transcription, Justin Rovin Mar 2016

Reducing Costs In Human Assisted Speech Transcription, Justin Rovin

Master's Theses

The only official documentation of the lawmaking process at the California Legislature is unedited video recordings of committee hearings, bill texts, votes and analyses. While the bills resulting from these hearings are clear, using video recordings to understand how a bill was created is far too laborious for the average citizen. To increase public transparency, a service that provides easier access to the bill creation process was needed. In response to this need, the Digital Democracy initiative was established at Cal Poly by the Honorable Sam Blakeslee, former California State Senator and founder of the Institute for Advanced Technology and …


Metrics Dashboard Services: A Framework For Analyzing Free/Open Source Team Repositories, Fnu Shilpika Jan 2016

Metrics Dashboard Services: A Framework For Analyzing Free/Open Source Team Repositories, Fnu Shilpika

Master's Theses

Software engineering as practiced today (especially in the industry) is no longer about the stereotypical monolithic life cycle processes (e.g. waterfall, spiral, etc.) found in most software engineering textbooks. These heavyweight methods historically have impeded progress for small/medium sized development teams owing to their inherent complexity and rather limited data collection strategies that predominated the 1980s until relatively recently in the mid-2000s. The discipline and practice of software engineering includes software quality, which has an established theoretical foundation for doing software metrics. Software metrics are a critical tool which provide continuous insight to products and processes and help build reliable …


Activity Recognition For Incomplete Spinal Cord Injury Subjects Using Hidden Markov Models, Pichleap Sok Jan 2016

Activity Recognition For Incomplete Spinal Cord Injury Subjects Using Hidden Markov Models, Pichleap Sok

Master's Theses

Successful activity recognition in patients with motor disabilities can improve patient care by providing researchers and clinicians with valuable information on patient movements and quality of life in real-world settings. Understanding the everyday activities of patients is important for rehabilitation. For researchers, having convenient, objective, and continuous data can drastically improve outcome measures to better compare therapies, and ultimately make recommendations. For clinicians, individual assessment of compliance and outcomes outside the clinic can be more objective, permitting much more tailored recommendations to patients. Most importantly, for individual patients, activity recognition can make this improved health care possible by simply having …


Spest – A Tool For Specification-Based Testing, Corrigan Redford Johnson Jan 2016

Spest – A Tool For Specification-Based Testing, Corrigan Redford Johnson

Master's Theses

This thesis presents a tool for SPEcification based teSTing (SPEST). SPEST is designed to use well known practices for automated black-box testing to reduce the burden of testing on developers. The tool uses a simple formal specification language to generate highly-readable unit tests that embody best practices for thorough software testing. Because the specification language used to generate the assertions about the code can be compiled, it can also be used to ensure that documentation describing the code is maintained during development and refactoring.

The utility and effectiveness of SPEST were validated through several exper- iments conducted with students in …