Open Access. Powered by Scholars. Published by Universities.®
Articles 1 - 6 of 6
Full-Text Articles in Entire DC Network
Improving Automatic Transcription Using Natural Language Processing, Anna Kiefer
Improving Automatic Transcription Using Natural Language Processing, Anna Kiefer
Master's Theses
Digital Democracy is a CalMatters and California Polytechnic State University initia-
tive to promote transparency in state government by increasing access to the Califor-
nia legislature. While Digital Democracy is made up of many resources, one founda-
tional step of the project is obtaining accurate, timely transcripts of California Senate
and Assembly hearings. The information extracted from these transcripts provides
crucial data for subsequent steps in the pipeline. In the context of Digital Democracy,
upleveling is when humans verify, correct, and annotate the transcript results after
the legislative hearings have been automatically transcribed. The upleveling process
is done with the …
Legislative Language For Success, Sanjana Gundala
Legislative Language For Success, Sanjana Gundala
Master's Theses
Legislative committee meetings are an integral part of the lawmaking process for local and state bills. The testimony presented during these meetings is a large factor in the outcome of the proposed bill. This research uses Natural Language Processing and Machine Learning techniques to analyze testimonies from California Legislative committee meetings from 2015-2016 in order to identify what aspects of a testimony makes it successful. A testimony is considered successful if the alignment of the testimony matches the bill outcome (alignment is "For" and the bill passes or alignment is "Against" and the bill fails). The process of finding what …
Extractive Text Summarization With Deep Learning, Garrett G. Chan
Extractive Text Summarization With Deep Learning, Garrett G. Chan
Computer Engineering
This project explores extractive text summarization using the capabilities of Deep Learning. The goal of this project is to create an application with a neural network to take in text as its input, and create a summary that is a shorter, condensed version of the input text. This has been implemented in Python by configuring and training a neural network that takes in a vector of features that are extracted from the text using various Natural Language Processing libraries. The implementation demonstrates that we can train simple deep neural networks to successfully summarize text.
Predicting Music Genre Preferences Based On Online Comments, Andrew J. Sinclair
Predicting Music Genre Preferences Based On Online Comments, Andrew J. Sinclair
Master's Theses
Communication Accommodation Theory (CAT) states that individuals adapt to each other’s communicative behaviors. This adaptation is called “convergence.” In this work we explore the convergence of writing styles of users of the online music distribution plat- form SoundCloud.com. In order to evaluate our system we created a corpus of over 38,000 comments retrieved from SoundCloud in April 2014. The corpus represents comments from 8 distinct musical genres: Classical, Electronic, Hip Hop, Jazz, Country, Metal, Folk, and World. Our corpus contains: short comments, frequent misspellings, little sentence struc- ture, hashtags, emoticons, and URLs. We adapt techniques used by researchers analyzing other …
Csc Senior Project: Nlpstats, Michael Mease
Csc Senior Project: Nlpstats, Michael Mease
Computer Science and Software Engineering
Natural Language Processing has recently increased in popularity. The field of authorship analysis, specifically, uses various characteristics of text quantified by markers. NLPStats serves as a tool designed to streamline marker extraction based on user needs. A flexible query system allows for custom marker requests, adjustment of result formatting, and preprocessing options. Furthermore, an efficiently designed structure ensures that users retrieve information quickly. As a whole, NLPStats enables anyone, regardless of NLP experience, to extract important information about the text of a document.
Spoons: Netflix Outage Detection Using Microtext Classification, Eriq A. Augusitne
Spoons: Netflix Outage Detection Using Microtext Classification, Eriq A. Augusitne
Master's Theses
Every week there are over a billion new posts to Twitter services and many of those messages contain feedback to companies about their services. One company that recognizes this unused source of information is Netflix. That is why Netflix initiated the development of a system that lets them respond to the millions of Twitter and Netflix users that are acting as sensors and reporting all types of user visible outages. This system enhances the feedback loop between Netflix and its customers by increasing the amount of customer feedback that Netflix receives and reducing the time it takes for Netflix to …