Open Access. Powered by Scholars. Published by Universities.®
- Discipline
-
- Physical Sciences and Mathematics (184)
- Social and Behavioral Sciences (108)
- Engineering (102)
- Statistics and Probability (92)
- Computer Sciences (46)
-
- Business (45)
- Applied Statistics (42)
- Medicine and Health Sciences (38)
- Life Sciences (37)
- Education (33)
- Economics (32)
- Statistical Models (26)
- Electrical and Computer Engineering (22)
- Computer Engineering (21)
- Mathematics (21)
- Operations Research, Systems Engineering and Industrial Engineering (20)
- Statistical Methodology (19)
- Statistical Theory (19)
- Applied Mathematics (17)
- Psychology (15)
- Civil and Environmental Engineering (14)
- Data Science (13)
- Arts and Humanities (12)
- Econometrics (12)
- Industrial Engineering (12)
- Public Health (12)
- Business Analytics (11)
- Science and Technology Studies (11)
- Artificial Intelligence and Robotics (10)
- Earth Sciences (10)
- Institution
-
- University of Wollongong (21)
- Utah State University (21)
- University of Nebraska - Lincoln (18)
- Wayne State University (16)
- Selected Works (14)
-
- University of South Florida (14)
- University of Kentucky (12)
- TÜBİTAK (11)
- Western University (11)
- Technological University Dublin (10)
- COBRA (9)
- Old Dominion University (9)
- Western Kentucky University (9)
- Air Force Institute of Technology (8)
- City University of New York (CUNY) (8)
- Georgia Southern University (8)
- University of Louisville (8)
- University of Massachusetts Amherst (8)
- University of New Hampshire (8)
- Clemson University (7)
- SelectedWorks (7)
- University of Vermont (7)
- Association of Arab Universities (6)
- Southern Methodist University (6)
- University of Arkansas, Fayetteville (6)
- University of South Carolina (6)
- American University in Cairo (5)
- Brigham Young University (5)
- California Polytechnic State University, San Luis Obispo (5)
- Embry-Riddle Aeronautical University (5)
- Publication Year
- Publication
-
- Electronic Theses and Dissertations (22)
- Theses and Dissertations (20)
- All Graduate Theses and Dissertations, Spring 1920 to Summer 2023 (11)
- Dissertations (11)
- USF Tampa Graduate Theses and Dissertations (11)
-
- Journal of Modern Applied Statistical Methods (10)
- Faculty Publications (9)
- Honors Theses (8)
- Faculty of Engineering and Information Sciences - Papers: Part A (7)
- Honors Theses and Capstones (7)
- Electronic Thesis and Dissertation Repository (6)
- SMU Data Science Review (6)
- Turkish Journal of Electrical Engineering and Computer Sciences (6)
- Applied Mathematics & Information Sciences (5)
- FIU Electronic Theses and Dissertations (5)
- International Journal of Exercise Science: Conference Proceedings (5)
- Theses (5)
- U.C. Berkeley Division of Biostatistics Working Paper Series (5)
- All Theses (4)
- Graduate Theses and Dissertations (4)
- IGC Proceedings (1997-2023) (4)
- Jeff L Yates (4)
- Publications (4)
- Publications and Research (4)
- Williams Honors College, Honors Research Projects (4)
- All Graduate Plan B and other Reports, Spring 1920 to Spring 2023 (3)
- Antioch University Dissertations & Theses (3)
- Applications and Applied Mathematics: An International Journal (AAM) (3)
- Cowles Foundation Discussion Papers (3)
- Dissertations, Master's Theses and Master's Reports (3)
Articles 1 - 30 of 516
Full-Text Articles in Entire DC Network
Examining The Interaction Between Calcium Supplement Use, Demographics, And Lifestyle Factors On Bone Health In Women, Vix Talbot
University Honors Theses
Osteoporosis is a condition which poses a significant health threat, particularly among women during the menopause transition, where accelerated bone loss increases fracture risk. Calcium supplementation has been shown to be an important intervention to mitigate bone mineral density (BMD) decline during this and other periods of life. However, the efficacy of calcium supplementation is influenced by various individual factors, including demographics and lifestyle habits. This study investigates the interaction between calcium supplement use, and several interaction terms on bone health in women. Multiple linear regression analysis is employed to assess the impact of these factors on BMD. Data from …
Developing Guidelines For The Use Of Lightweight Materials In Culvert Preservation, Charlie Sun, Kean H. Ashurst Jr.
Developing Guidelines For The Use Of Lightweight Materials In Culvert Preservation, Charlie Sun, Kean H. Ashurst Jr.
Kentucky Transportation Center Research Report
This study addresses challenges that arise during highway embankment construction on road widening projects when additional fill is placed above existing culverts. Researchers conducted reduced-scale model laboratory tests to simulate culvert behavior under different loading conditions and to determine how well lightweight materials (LWMs) perform when subjected to changing loads. Testing showed that proximity of LWM to the culvert’s top surface strongly influences the magnitude of strain reductions or increases. Placing LWMs with relatively low elastic modulus closer to the culvert’s top surface led to reduced culvert ceiling strain and increased culvert wall strain. LWMs with relatively high elastic modulus …
The Effect Of Remittances On Educational Outcome In Uganda, Kaity Chen
The Effect Of Remittances On Educational Outcome In Uganda, Kaity Chen
Economics Student Theses and Capstone Projects
This study investigates the impact of remittances received on the highest level of education completed by the household members in Uganda. The results of our multinomial logistic regression analysis that uses survey data from Uganda in 2010 indicate that, after controlling for other variables, our independent variable—the total amount of remittances received—is only a significant predictor of the three highest level of education categories (Completed secondary education vs Didn’t complete primary education category, Post-secondary diploma vs Didn’t complete primary education category, and Degree and Above Education vs Didn’t complete primary education). This could be due to the fact that households …
Teaching Students To Read Regression Results: A Statistical Literacy Lesson Plan For Librarians, Giovanna Badia
Teaching Students To Read Regression Results: A Statistical Literacy Lesson Plan For Librarians, Giovanna Badia
Transforming Libraries for Graduate Students
Descriptive and inferential statistics are taught to students in many disciplines. More classroom time is often spent on the theory behind different statistical methods that investigate relationships between variables rather than on how to interpret the results obtained to answer the research question that started the process. While statistical software (such as R, Stata, and SPSS) has made it easier to undertake regression with any dataset, the output produced remains challenging to understand and explain to intended audiences. To address this issue, the author created a 90-minute workshop that teaches students how to read tables of descriptive statistics and linear …
Relationships Of Empathy And Color-Blind Attitudes On Counseling Students’ Critical Consciousness, Bagmi Das, Maggie M. Parker, Sarah Litt
Relationships Of Empathy And Color-Blind Attitudes On Counseling Students’ Critical Consciousness, Bagmi Das, Maggie M. Parker, Sarah Litt
Teaching and Supervision in Counseling
A critical piece of counselor education is enhancing counselors’ in training (CITs) multicultural competence. Concepts included in CIT cultural development include both developing empathy (Constantine, 2001) and dismantling color-blind racial attitudes` (Neville et al., 2013). Thus, this study presents multiple regression to explore the relationships between color blindness, empathy development, and critical consciousness of 166 counseling students. Results indicate that that empathy and color-blind attitudes have associations with some aspects of critical consciousness, but not sociopolitical participation. Implications for counselor education and directions for future research are discussed.
Because It’S Worth It: Why Schools Violate Ncaa Rules And The Impact Of Getting Caught In Division I Basketball, Daniel A. Rascher, Andrey Tselikov, Mark S. Nagel, Andrew D. Schwarz
Because It’S Worth It: Why Schools Violate Ncaa Rules And The Impact Of Getting Caught In Division I Basketball, Daniel A. Rascher, Andrey Tselikov, Mark S. Nagel, Andrew D. Schwarz
Journal of Issues in Intercollegiate Athletics
No abstract provided.
Functional Data Learning Using Convolutional Neural Networks, Jose Galarza, Tamer Oraby
Functional Data Learning Using Convolutional Neural Networks, Jose Galarza, Tamer Oraby
School of Mathematical and Statistical Sciences Faculty Publications and Presentations
In this paper, we show how convolutional neural networks (CNNs) can be used in regression and classification learning problems for noisy and non-noisy functional data (FD). The main idea is to transform the FD into a 28 by 28 image. We use a specific but typical architecture of a CNN to perform all the regression exercises of parameter estimation and functional form classification. First, we use some functional case studies of FD with and without random noise to showcase the strength of the new method. In particular, we use it to estimate exponential growth and decay rates, the bandwidths of …
The Conviction Of Miss Prediction, Dane C. Joseph
The Conviction Of Miss Prediction, Dane C. Joseph
Journal of Humanistic Mathematics
Miss Prediction is questioned in a court of law over her involvement in the mischaracterization of linear models when they were inappropriate.
An Examination Of The Determinants Of Ease Of Doing Business: Perspectives From World Economies, Omar Eldarawy
An Examination Of The Determinants Of Ease Of Doing Business: Perspectives From World Economies, Omar Eldarawy
Theses and Dissertations
This thesis’ purpose is to explore the correlation between Ease of Doing Business sub-scores and the overall Ease of Doing Business score. The study includes a detailed quantitative study into the previously mentioned relationship. The Ease of Doing Business sub-scores comprise of scores representing starting a business, dealing with construction permits, getting electricity, registering property, getting credit, protecting minority investors, paying taxes, trading across borders, enforcing contracts, and resolving insolvency. This is conducted via a series of diagnostic tests to identify an appropriate regression model. These tests include the Hausman Test, RESET Test, and Breusch-Pagan Test. Based on the tests, …
An Examination Of The Determinants Of Ease Of Doing Business: Perspectives From World Economies, Omar Eldarawy
An Examination Of The Determinants Of Ease Of Doing Business: Perspectives From World Economies, Omar Eldarawy
Theses and Dissertations
This thesis’ purpose is to explore the correlation between Ease of Doing Business sub-scores and the overall Ease of Doing Business score. The study includes a detailed quantitative study into the previously mentioned relationship. The Ease of Doing Business sub-scores comprise of scores representing starting a business, dealing with construction permits, getting electricity, registering property, getting credit, protecting minority investors, paying taxes, trading across borders, enforcing contracts, and resolving insolvency. This is conducted via a series of diagnostic tests to identify an appropriate regression model. These tests include the Hausman Test, RESET Test, and Breusch-Pagan Test. Based on the tests, …
Evaluation And Modeling Of Factors Associated With Urine Drug Testing Appointment Absences, Rui Xu
Evaluation And Modeling Of Factors Associated With Urine Drug Testing Appointment Absences, Rui Xu
Graduate College Dissertations and Theses
Why doesn’t a patient show up for their appointment? Is it too far? Are there too many appointments? Or something else? Urine drug testing clinics often observe patient scheduled visit absenteeism, and this can be used as a data source to answer our questions and explore other potential correlations between factors. With a well-developed electronic health data system, a retrospective study was performed on a large data set collected between January 2019 and December 2021 across the U.S. and more than half a million patient encounters; it contained nearly a year of quarantine, and pandemic status was also analyzed as …
Evaluation And Modeling Of Factors Associated With Urine Drug Testing Appointment Absences, Rui Xu
Evaluation And Modeling Of Factors Associated With Urine Drug Testing Appointment Absences, Rui Xu
Graduate College Dissertations and Theses
Why doesn’t a patient show up for their appointment? Is it too far? Are there too many appointments? Or something else? Urine drug testing clinics often observe patient scheduled visit absenteeism, and this can be used as a data source to answer our questions and explore other potential correlations between factors. With a well-developed electronic health data system, a retrospective study was performed on a large data set collected between January 2019 and December 2021 across the U.S. and more than half a million patient encounters; it contained nearly a year of quarantine, and pandemic status was also analyzed as …
Evaluation And Modeling Of Factors Associated With Urine Drug Testing Appointment Absences, Sherry Xu
Evaluation And Modeling Of Factors Associated With Urine Drug Testing Appointment Absences, Sherry Xu
Graduate College Dissertations and Theses
Why doesn’t a patient show up for their appointment? Is it too far? Are there too many appointments? Or something else? Urine drug testing clinics often observe patient scheduled visit absenteeism, and this can be used as a data source to answer our questions and explore other potential correlations between factors. With a well-developed electronic health data system, a retrospective study was performed on a large data set collected between January 2019 and December 2021 across the U.S. and more than half a million patient encounters; it contained nearly a year of quarantine, and pandemic status was also analyzed as …
Predicting Future States With Spatial Point Processes In Single Molecule Resolution Spatial Transcriptomics, Biraaj Rout
Predicting Future States With Spatial Point Processes In Single Molecule Resolution Spatial Transcriptomics, Biraaj Rout
Computer Science and Engineering Theses
In this thesis, we present an innovative framework centered around the application of Random Forest Regression to forecast the prospective distribution of cells expressing the Sog-D gene (active cells) during the embryogenesis process in Drosophila. Our methodology specifically targets the Anterior-to-posterior (AP) and Dorsal-to-Ventral (DV) axes, unraveling the intricacies of gene expression control in living organisms at super-resolution, single-molecule resolution through whole embryo spatial transcriptomics imaging. The Random Forest Regression model serves as a pivotal tool in predicting the succeeding stage’s active cell distribution, capitalizing on the insights obtained from the preceding stage. We integrate temporally resolved, spatial point processes …
Forecasting The Outcome Of Nfl Playoff Games: A Regression Analysis, Jack Pierpont Morgan V
Forecasting The Outcome Of Nfl Playoff Games: A Regression Analysis, Jack Pierpont Morgan V
UVM Patrick Leahy Honors College Senior Theses
Professional sports are one of the most consumed forms of entertainment in the world today. Professional sporting events are some of the most watched broadcasts worldwide each year. The 2022 FIFA World Cup Final garnered about 1.5 billion views worldwide, almost 20% of our planet’s population (Jones, 2023). The National Football League is the most popular professional sport in the United States. Recent polling data shows that a clear majority of the country, 72% of Americans, self-identify as football fans (“St. Bonaventure”, 2023). The NFL runs from September to February and regularly draws 15-20 million viewers weekly during …
Defensive Impact Wins: Developing A New Method To Rate Individual Defense In Nba Games, Dylan J. Stiles
Defensive Impact Wins: Developing A New Method To Rate Individual Defense In Nba Games, Dylan J. Stiles
Honors Theses and Capstones
With the analytics revolution in sports in the past 20 years, it seems that everything that can be quantified is. In basketball though, trying to break the game down into a set of numbers comes with a unique problem. While we've come up with a good set of advanced numbers to measure offensive efficiency, defense is fundamentally harder to quantify. The game is played five on five, but it has often been popular or convenient to model defense as a set of five one on one games. As defenses became more complex into the 2010s, this methodology became more insignificant. …
The Regression Of The Flood In Virginia, James C. Rakestraw, Jim Melnick
The Regression Of The Flood In Virginia, James C. Rakestraw, Jim Melnick
Proceedings of the International Conference on Creationism
The geology, tectonics, and hydraulics of the regression of the Flood formed much of the geomorphology of Virginia. Opportunities to view and study geology and geomorphology are available through visiting parks, traveling on public roads, and viewing geographic information system (GIS) resources.
Virginia is part of the North American Plate. A series of “blocks” of basement rocks within the plate underlie the geomorphological provinces of Virginia. These “blocks” form a series of steps between the Atlantic Ocean Basin and the Blue Ridge. The “Fall Line” found in Virginia is a fault between two blocks of basement rock. The basement rocks …
Unlucky Child, Adam Sana
The Use Of Regularization To Detect Racial Inequities In Pay Equity Studies: An Empirical Study And Reflections On Regulation Methods, Christopher M. Peña
The Use Of Regularization To Detect Racial Inequities In Pay Equity Studies: An Empirical Study And Reflections On Regulation Methods, Christopher M. Peña
Electronic Theses and Dissertations
Since the late 1970s, multiple linear regression has been the preferred method for identifying discrimination in pay. An empirical study on this topic was conducted using quantitative critical methods. A literature review first examined conflicting views on using multiple linear regression in pay equity studies. The review found that multiple linear regression is used so prevalently in pay equity studies because the courts and practitioners have widely accepted it and because of its simplicity and ability to parse multiple sources of variance simultaneously. Commentaries in the literature cautioned about errors in model specification, the use of tainted variables, and the …
Corruption Perceptions During The Pandemic, Linh Phuong Thao Nguyen, Guangjun Qu
Corruption Perceptions During The Pandemic, Linh Phuong Thao Nguyen, Guangjun Qu
Annual Student Research Poster Session
This study delves into the response of corruption perception indices to the COVID-19 pandemic. We investigate whether a global shift in corruption indices occurred post-pandemic compared to pre-pandemic levels. Additionally, we assess changes in standard errors of these indices before and after the pandemic to gauge shifts in consensus among people regarding corruption levels of a country. Given the WGI-CC's lack of year-to-year comparability across countries, we recalculated WGI-CC standard errors using methods akin to TI-CPI score calculations. Subsequently, we employ regression analysis, incorporating independent variables such as population, GDP, education, and political regime to explore whether changes in standard …
Structure Property Performance Of Ti3c2 Mxene/Polyelectrolyte Hybrids For Electronic And Electromagnetic Applications, Farivash Gholamirad
Structure Property Performance Of Ti3c2 Mxene/Polyelectrolyte Hybrids For Electronic And Electromagnetic Applications, Farivash Gholamirad
Theses and Dissertations
Hybrid materials based on transition metal carbide MXene (Ti3C2Tx) nanosheets have great potential for electronic, electromagnetic interference (EMI) shielding, and environmental applications due to the unique combination of considerable electrical conductivity of 4000 to 15000 S.cm-1, abundant surface functional groups (OH, O, F, Cl), and suitable mechanical properties. However, the performance of final products depends not only on the properties of constituent components but also on the morphology of the assembly. The strong repulsive electrical double layer forces among MXene nanosheets make control over their assembly morphology and final properties challenging. To address this challenge, this dissertation focuses on applying …
Predicting Dynamic Fragmentation Characteristics From High-Impact Energy Events Utilizing Terrestrial Static Arena Test Data And Machine Learning, Katharine Larsen, Riccardo Bevilacqua, Omkar S. Mulekar, Elisabetta L. Jerome, Thomas J. Hatch-Aguilar
Predicting Dynamic Fragmentation Characteristics From High-Impact Energy Events Utilizing Terrestrial Static Arena Test Data And Machine Learning, Katharine Larsen, Riccardo Bevilacqua, Omkar S. Mulekar, Elisabetta L. Jerome, Thomas J. Hatch-Aguilar
Student Works
To continue space operations with the increasing space debris, accurate characterization of fragment fly-out properties from hypervelocity impacts is essential. However, with limited realistic experimentation and the need for data, available static arena test data, collected utilizing a novel stereoscopic imaging technique, is the primary dataset for this paper. This research leverages machine learning methodologies to predict fragmentation characteristics using combined data from this imaging technique and simulations, produced considering dynamic impact conditions. Gaussian mixture models (GMMs), fit via expectation maximization (EM), are used to model fragment track intersections on a defined surface of intersection. After modeling the fragment distributions, …
Assessing The Effect Of Fintech Adoption On Country's Productivity, Mai Metwally
Assessing The Effect Of Fintech Adoption On Country's Productivity, Mai Metwally
Theses and Dissertations
The relationship between total factor productivity of countries, for both low-income and high-income countries, and Fintech adoption will be examined in this paper. Also, a background on Fintech history will be discussed and explored briefly along with Fintech future risks and opportunities. Starting off with the importance of TFP, it is also known to be "Solow residual" (named after American Economist "Robert Solow"). TFP shows and examines the performance along with efficiency of the entity or country. It shows how well and efficient the firm or country in transforming its inputs to the desired outputs. Moreover, it is the ratio …
Workforce Management For Salik Call Center, Alaa Seder, Rawan Alnasser
Workforce Management For Salik Call Center, Alaa Seder, Rawan Alnasser
Theses
Salik is the Dubai Roads and Transport Authority’s (RTA) automated toll collection system. Salik uses tag technology that must be registered for each vehicle either online or through an authorized dealer. With this tag, you can drive freely in the Emirate without having to stop at any toll booth. Salik means "open" or "clear" in Arabic, meaning that there are no toll booths, barriers, or physical gates. As a customer- focused organization, RTA Salik can only thrive if we truly understand the needs of our customers and provide them with the service, they want using the highest quality standards.
Prediction …
The Effect Of Yearly Labor Earnings On Commute Time To Work In South Carolina, Peter Trela
The Effect Of Yearly Labor Earnings On Commute Time To Work In South Carolina, Peter Trela
All Theses
In this paper, I attempt to ascertain the effect of labor earnings on commute time to work for individuals in South Carolina by using ACS 1-year Public Use Microdata Sample Estimates. First, I use standard linear regression models with controls to determine the direction and magnitude of the association between yearly labor earnings and commute time to work. I later use standard linear regression models with limited controls to determine how the association between yearly labor earnings and commute time changes before and during the events of the COVID-19 pandemic. There exists a positive relationship between yearly labor earnings and …
Data-Driven Air Quality And Environmental Evaluation For Cattle Farms, Jennifer Hu, Rushikesh Jagtap, Rishikumar Ravichandran, Chitra Priyaa Sathya Moorthy, Nataliya Sobol, Jane Wu, Jerry Gao
Data-Driven Air Quality And Environmental Evaluation For Cattle Farms, Jennifer Hu, Rushikesh Jagtap, Rishikumar Ravichandran, Chitra Priyaa Sathya Moorthy, Nataliya Sobol, Jane Wu, Jerry Gao
Faculty Research, Scholarly, and Creative Activity
The expansion of agricultural practices and the raising of animals are key contributors to air pollution. Cattle farms contain hazardous gases, so we developed a cattle farm air pollution analyzer to count the number of cattle and provide comprehensive statistics on different air pollutant concentrations based on severity over various time periods. The modeling was performed in two parts: the first stage focused on object detection using satellite data of farm images to identify and count the number of cattle; the second stage predicted the next hour air pollutant concentration of the seven cattle farm air pollutants considered. The output …
Examining Model Complexity's Effects When Predicting Continuous Measures From Ordinal Labels, Mckade S. Thomas
Examining Model Complexity's Effects When Predicting Continuous Measures From Ordinal Labels, Mckade S. Thomas
All Graduate Theses and Dissertations, Spring 1920 to Summer 2023
Many real world problems require the prediction of ordinal variables where the values are a set of categories with an ordering to them. However, in many of these cases the categorical nature of the ordinal data is not a desirable outcome. As such, regression models treat ordinal variables as continuous and do not bind their predictions to discrete categories. Prior research has found that these models are capable of learning useful information between the discrete levels of the ordinal labels they are trained on, but complex models may learn ordinal labels too closely, missing the information between levels. In this …
The 2015 Ncaa Cost-Of-Attendance Stipend And Its Effects On Institutional Financial Aid Packages, Sara Greene
The 2015 Ncaa Cost-Of-Attendance Stipend And Its Effects On Institutional Financial Aid Packages, Sara Greene
Honors Theses
In 2015, the National Collegiate Athletic Association (NCAA) allowed “Cost of Attendance” (COA) stipends to be offered to athletic recruits for Division I schools. These stipends are intended to allow schools to grant aid to student-athletes beyond a full-ride scholarship to cover additional costs imposed on student-athletes. These stipends created an opportunity for the “Autonomy” Power 5 programs to utilize a competitive tactic to try to win over the top recruits. There is evidence that these COA stipends have caused an increase in the estimated cost of attendance reported by the university. This paper examines if the COA stipends have …
Development Of Models For The Velocity-Pressure Gradient Correlations In Incompressible Wall-Bounded Planar Turbulent Flows Using Multiple Linear Regression, Juampablo E. Heras Rivera
Development Of Models For The Velocity-Pressure Gradient Correlations In Incompressible Wall-Bounded Planar Turbulent Flows Using Multiple Linear Regression, Juampablo E. Heras Rivera
Mechanical Engineering ETDs
In this thesis, the multiple linear regression method is applied to clarify terms and their coefficients in data-driven models for velocity/pressure-gradient (VPG) correlations in the Reynolds stress transport equations. Additionally, a method for developing universal linear models for the VPG correlations with unchanging coefficients when complex conditions arise in a flow is introduced. The generated models were assessed using residual analysis to ensure an appropriate level of accuracy. Data from direct numerical simulation in an incompressible fully-developed turbulent channel flow at Re_tau = 392 with an adverse pressure gradient is used in the study as the data source.
Continuous Semi-Supervised Nonnegative Matrix Factorization, Michael R. Lindstrom, Xiaofu Ding, Feng Liu, Anand Somayajula, Deanna Needell
Continuous Semi-Supervised Nonnegative Matrix Factorization, Michael R. Lindstrom, Xiaofu Ding, Feng Liu, Anand Somayajula, Deanna Needell
School of Mathematical and Statistical Sciences Faculty Publications and Presentations
Nonnegative matrix factorization can be used to automatically detect topics within a corpus in an unsupervised fashion. The technique amounts to an approximation of a nonnegative matrix as the product of two nonnegative matrices of lower rank. In certain applications it is desirable to extract topics and use them to predict quantitative outcomes. In this paper, we show Nonnegative Matrix Factorization can be combined with regression on a continuous response variable by minimizing a penalty function that adds a weighted regression error to a matrix factorization error. We show theoretically that as the weighting increases, the regression error in training …