Open Access. Powered by Scholars. Published by Universities.®
- Discipline
-
- Social and Behavioral Sciences (1382)
- Statistical Theory (1180)
- Life Sciences (628)
- Agriculture (496)
- Statistical Models (393)
-
- Applied Mathematics (369)
- Mathematics (294)
- Statistical Methodology (283)
- Biostatistics (186)
- Multivariate Analysis (142)
- Law (133)
- Medicine and Health Sciences (131)
- Business (128)
- Engineering (117)
- Longitudinal Data Analysis and Time Series (114)
- Probability (107)
- Computer Sciences (106)
- Economics (96)
- Education (88)
- Categorical Data Analysis (87)
- Other Statistics and Probability (83)
- Econometrics (76)
- Data Science (66)
- Environmental Sciences (64)
- Design of Experiments and Sample Surveys (54)
- Sociology (49)
- Other Applied Mathematics (48)
- Institution
-
- Wayne State University (1096)
- Kansas State University Libraries (490)
- Selected Works (183)
- Wright State University (150)
- SelectedWorks (123)
-
- Utah State University (98)
- University of Pennsylvania (78)
- Cornell University Law School (75)
- Montclair State University (67)
- Western Kentucky University (43)
- Western University (41)
- California Polytechnic State University, San Luis Obispo (40)
- Old Dominion University (39)
- University of Kentucky (39)
- University of Arkansas, Fayetteville (36)
- University of Richmond (34)
- University of Nebraska - Lincoln (33)
- University of Massachusetts Amherst (32)
- Kennesaw State University (31)
- Southern Methodist University (31)
- Air Force Institute of Technology (29)
- City University of New York (CUNY) (29)
- University of Tennessee, Knoxville (27)
- Virginia Commonwealth University (27)
- Claremont Colleges (24)
- Illinois State University (23)
- University of Nevada, Las Vegas (19)
- Georgia Southern University (18)
- East Tennessee State University (17)
- Technological University Dublin (17)
- Keyword
-
- Statistics (97)
- Empirical legal studies (72)
- Simulation (46)
- Bias (40)
- Regression (38)
-
- Bootstrap (33)
- Power (33)
- Logistic regression (31)
- Monte Carlo simulation (31)
- Capital punishment (29)
- Mathematics and Statistics (29)
- Confidence interval (28)
- Death penalty (27)
- Reliability (27)
- Western Kentucky University (26)
- Bayesian (25)
- Mean squared error (24)
- Sample size (24)
- Capital Jury Project (23)
- Machine learning (23)
- Multicollinearity (23)
- Robustness (22)
- Type I error (22)
- Estimation (21)
- Maximum likelihood (21)
- Monte Carlo (21)
- Nonparametric (21)
- Variance (21)
- CJP (20)
- Confidence intervals (20)
- Publication Year
- Publication
-
- Journal of Modern Applied Statistical Methods (1092)
- Conference on Applied Statistics in Agriculture (489)
- Mathematics and Statistics Faculty Publications (121)
- Cornell Law Faculty Publications (75)
- Department of Applied Mathematics and Statistics Faculty Scholarship and Creative Works (64)
-
- Statistics Papers (59)
- Electronic Theses and Dissertations (50)
- All Graduate Theses and Dissertations (46)
- Doctoral Dissertations (45)
- Theses and Dissertations (44)
- All Graduate Plan B and other Reports (41)
- Electronic Thesis and Dissertation Repository (36)
- Joseph M Hilbe (35)
- Math and Computer Science Faculty Publications (34)
- Graduate Theses and Dissertations (28)
- Yi Li (28)
- WKU Archives Records (26)
- Douglas G. Steigerwald (24)
- SMU Data Science Review (24)
- Mathematics & Statistics Theses & Dissertations (22)
- Symposium of Student Scholars (22)
- Statistics (21)
- Michael Stanley Smith (20)
- Peter Austin (20)
- Theses and Dissertations--Statistics (20)
- Publications and Research (18)
- Annual Symposium on Biomathematics and Ecology Education and Research (17)
- FIU Electronic Theses and Dissertations (16)
- Masters Theses (15)
- Jeffrey S. Morris (13)
- Publication Type
Articles 1 - 30 of 3416
Full-Text Articles in Applied Statistics
A Chairpersons Guide To Managing Time And Stress, Christian K. Hansen
A Chairpersons Guide To Managing Time And Stress, Christian K. Hansen
Academic Chairpersons Conference Proceedings
In this interactive workshop we discuss time and stress management specifically from the perspective of a department chairperson responsible for leading an academic department through numerous internal and external challenges. The focus will be on practical strategies for effective use of time, not only at a personal level, but also at a department wide level.
Two-Stage Approach For Forensic Handwriting Analysis, Ashlan J. Simpson, Danica M. Ommen
Two-Stage Approach For Forensic Handwriting Analysis, Ashlan J. Simpson, Danica M. Ommen
SDSU Data Science Symposium
Trained experts currently perform the handwriting analysis required in the criminal justice field, but this can create biases, delays, and expenses, leaving room for improvement. Prior research has sought to address this by analyzing handwriting through feature-based and score-based likelihood ratios for assessing evidence within a probabilistic framework. However, error rates are not well defined within this framework, making it difficult to evaluate the method and can lead to making a greater-than-expected number of errors when applying the approach. This research explores a method for assessing handwriting within the Two-Stage framework, which allows for quantifying error rates as recommended by …
Session 8: Ensemble Of Score Likelihood Ratios For The Common Source Problem, Federico Veneri, Danica M. Ommen
Session 8: Ensemble Of Score Likelihood Ratios For The Common Source Problem, Federico Veneri, Danica M. Ommen
SDSU Data Science Symposium
Machine learning-based Score Likelihood Ratios have been proposed as an alternative to traditional Likelihood Ratios and Bayes Factor to quantify the value of evidence when contrasting two opposing propositions.
Under the common source problem, the opposing proposition relates to the inferential problem of assessing whether two items come from the same source. Machine learning techniques can be used to construct a (dis)similarity score for complex data when developing a traditional model is infeasible, and density estimation is used to estimate the likelihood of the scores under both propositions.
In practice, the metric and its distribution are developed using pairwise comparisons …
Biasing Estimator To Mitigate Multicollinearity In Linear Regression Model, Abdulrasheed Bello Badawaire, Issam Dawoud, Adewale Folaranmi Lukman, Victoria Laoye, Arowolo Olatunji
Biasing Estimator To Mitigate Multicollinearity In Linear Regression Model, Abdulrasheed Bello Badawaire, Issam Dawoud, Adewale Folaranmi Lukman, Victoria Laoye, Arowolo Olatunji
Al-Bahir Journal for Engineering and Pure Sciences
A new two-parameter estimator was developed to combat the threat of multicollinearity for the linear regression model. Some necessary and sufficient conditions for the dominance of the proposed estimator over ordinary least squares (OLS) estimator, ridge regression estimator, Liu estimator, KL estimator, and some two-parameter estimators are obtained in the matrix mean square error sense. Theory and simulation results show that, under some conditions, the proposed two-parameter estimator consistently dominates other estimators considered in this study. The real-life application result follows suit.
A Statistical Analysis Of The Change In Age Distribution Of Spawning Hatchery Salmon, Rachel Macaulay, Emily Barrett, Grace Penunuri, Eli E. Goldwyn
A Statistical Analysis Of The Change In Age Distribution Of Spawning Hatchery Salmon, Rachel Macaulay, Emily Barrett, Grace Penunuri, Eli E. Goldwyn
Spora: A Journal of Biomathematics
Declines in salmon sizes have been reported primarily as a result of younger maturation rates. This change in age distribution poses serious threats to salmon-dependent peoples and ecological systems. We perform a statistical analysis to examine the change in age structure of spawning Alaskan chum salmon Oncorhynchus keta and Chinook salmon O. tshawytscha using 30 years of hatchery data. To highlight the impacts of this change, we investigate the average number of fry/smolt that each age of spawning chum/Chinook salmon produce. Our findings demonstrate an increase in younger hatchery salmon populations returning to spawn, and fewer amounts of fry produced …
Beyond Statistical Significance: A Holistic View Of What Makes A Research Finding "Important", Jane E. Miller
Beyond Statistical Significance: A Holistic View Of What Makes A Research Finding "Important", Jane E. Miller
Numeracy
Students often believe that statistical significance is the only determinant of whether a quantitative result is “important.” In this paper, I review traditional null hypothesis statistical testing to identify what questions inferential statistics can and cannot answer, including statistical significance, effect size and direction, causality, generalizability, and changeability of the independent variable. I illustrate these issues with examples from an empirical study of the association between how much time teenagers spent playing video games and time spent reading. I describe how study design and context determine each of those aspects of “importance,” and close by summarizing how to provide a …
Nearby Galaxies: Modelling Star Formation Histories And Contamination By Unresolved Background Galaxies, Hadi Papei
Nearby Galaxies: Modelling Star Formation Histories And Contamination By Unresolved Background Galaxies, Hadi Papei
Electronic Thesis and Dissertation Repository
Galaxies are complex systems of stars, gas, dust, and dark matter which evolve over billions of years, and one of the main goals of astrophysics is to understand how these complex systems form and change. Measuring the star formation history of nearby galaxies, in which thousands of stars can be resolved individually, has provided us with a clear picture of their evolutionary history and the evolution of galaxies in general.
In this work, we have developed the first public Python package, SFHPy, to measure star formation histories of nearby galaxies using their colour-magnitude diagrams. In this algorithm, an observed colour-magnitude …
On Partially Observed Tensor Regression, Dinara Miftyakhetdinova
On Partially Observed Tensor Regression, Dinara Miftyakhetdinova
Major Papers
Tensor data is widely used in modern data science. The interest lies in identifying and characterizing the relationship between tensor datasets and external covariates. These datasets, though, are often incomplete. An efficient nonconvex alternating updating algorithm proposed by J. Zhou et al. in the paper "Partially Observed Dynamic Tensor Response Regression" provides a novel approach. The algorithm handles the problem of unobserved entries by solving an optimization problem of a loss function under the low-rankness, sparsity, and fusion constraints. This analysis aims to understand in detail the proposed algorithms and their theoretical proofs with, potentially, dropping some of the assumptions …
Informative Hypothesis For Group Means Comparison, Dr. Teck Kiang Tan
Informative Hypothesis For Group Means Comparison, Dr. Teck Kiang Tan
Practical Assessment, Research, and Evaluation
Researchers often have hypotheses concerning the state of affairs in the population from which they sampled their data to compare group means. The classical frequentist approach provides one way of carrying out hypothesis testing using ANOVA to state the null hypothesis that there is no difference in the means and proceed with multiple comparisons if the null hypothesis is rejected. As this approach is not able to incorporate order, inequality, and direction into hypothesis testing, and neither does it able to specify multiple hypotheses, this paper introduces the informative hypothesis that allows more flexibility in stating hypothesis testing and is …
Optimal Speed Of A Machine In An Assembly Line Using The Continuous Time Markov Chain Rate Matrix, Chandi Darshani Rupasinghe
Optimal Speed Of A Machine In An Assembly Line Using The Continuous Time Markov Chain Rate Matrix, Chandi Darshani Rupasinghe
Major Papers
The optimal speed of a machine in an assembly line is determined using a Markov decision process type model. We develop the rate matrix that represents the inter-event time of a machine, either repair time or time to breakdown, as a function of speed. We consider the rate of time to breakdown with a variety of functions of speed. We find limiting probabilities and express profit in terms of these probabilities. We then find the optimal speed to maximize profit. Further, we assume an underlying function of speed and simulate data using R. From the simulated data, we estimate the …
Statistical Models For Decision-Making In Professional Soccer, Sean Hellingman
Statistical Models For Decision-Making In Professional Soccer, Sean Hellingman
Theses and Dissertations (Comprehensive)
As soccer is widely regarded as the most popular sport in the world there is high interest in methods of improving team performances. There are many ways teams and individual athletes can influence their own performances during competition. This thesis focuses on developing statistical methodologies for improving competition-based decision-making for soccer so as to allow professional soccer teams to make better informed decisions regarding player selection and in-game decision-making.
To properly capture the dynamic actions of professional soccer, Markov chains with increasing complexity are proposed. These models allow for the inclusion of potential changes in the process caused by goals …
(R1899) Asymptotic Normality Of The Conditional Hazard Function In The Local Linear Estimation Under Functional Mixing Data, Amina Goutal, Boubaker Mechab, Omar Fetitah, Torkia Merouan
(R1899) Asymptotic Normality Of The Conditional Hazard Function In The Local Linear Estimation Under Functional Mixing Data, Amina Goutal, Boubaker Mechab, Omar Fetitah, Torkia Merouan
Applications and Applied Mathematics: An International Journal (AAM)
In this study, we are interested in using the local linear technique to estimate the conditional hazard function for functional dependent data where the scalar response is conditioned by a functional random variable. The asymptotic normality of this constructed estimator is demonstrated under some extreme conditions. Our estimator’s performance is demonstrated through simulations.
Learning Graphical Models Of Multivariate Functional Data With Applications To Neuroimaging, Jiajing Niu
Learning Graphical Models Of Multivariate Functional Data With Applications To Neuroimaging, Jiajing Niu
All Dissertations
This dissertation investigates the functional graphical models that infer the functional connectivity based on neuroimaging data, which is noisy, high dimensional and has limited samples. The dissertation provides two recipes to infer the functional graphical model: 1) a fully Bayesian framework 2) an end-to-end deep model.
We first propose a fully Bayesian regularization scheme to estimate functional graphical models. We consider a direct Bayesian analog of the functional graphical lasso proposed by Qiao et al. (2019).. We then propose a regularization strategy via the graphical horseshoe. We compare both Bayesian approaches to the frequentist functional graphical lasso, and compare the …
Learning From Public Spaces In Historic Cities, Cody Josh Kucharski
Learning From Public Spaces In Historic Cities, Cody Josh Kucharski
Symposium of Student Scholars
Successful public spaces in cities are key for enhancing social cohesion and improving health and safety. Learning from historic cities involves the development of representational and analytical tools aimed at capturing their essence as places of human interaction. The research reports findings of the spatial analysis of twenty Adriatic and Ionian coastal cities, which addresses the question of how the network of public spaces calibrates different degrees of spatial enclosure necessary for creating successful social interactions. Cities in the littoral region include well-preserved historic centers that are renowned for the successful integration of urban squares into the urban fabric. For …
Evaluation Of Circular Logistic Regression Models With Asymmetrical Link Functions, Feridun Tasdan
Evaluation Of Circular Logistic Regression Models With Asymmetrical Link Functions, Feridun Tasdan
Annual Symposium on Biomathematics and Ecology Education and Research
No abstract provided.
Incorporating Interventions To An Extended Seird Model With Vaccination: Application To Covid-19 In Qatar, Elizabeth Amona
Incorporating Interventions To An Extended Seird Model With Vaccination: Application To Covid-19 In Qatar, Elizabeth Amona
Annual Symposium on Biomathematics and Ecology Education and Research
No abstract provided.
Improving The Accuracy Of Interactive Voice Response (Ivr) Technology For Pediatric Experience Scores, Elizabeth Spaargaren Ms, Mph, Cpxp, Abigail Kozak Mba, Cpxp, Cara Herbener Cpxp, Barbara Lawlor Burke Ma, Cpxp
Improving The Accuracy Of Interactive Voice Response (Ivr) Technology For Pediatric Experience Scores, Elizabeth Spaargaren Ms, Mph, Cpxp, Abigail Kozak Mba, Cpxp, Cara Herbener Cpxp, Barbara Lawlor Burke Ma, Cpxp
Patient Experience Journal
The increased use of interactive voice response (IVR) in assessing patient and family experience should be paired with evidence-based practices on how to obtain the most accurate information via this survey mode. We added a brief clarification sentence of the survey scale at the start of the IVR call to improve our experience data both qualitatively and quantitatively. Our setting was an urban pediatric hospital. We gathered lived experiences from our patients, families, and providers to understand and design a change to the IVR survey mode that would reduce survey inaccuracies. Outcome measures were assessed by baseline measurement and post-intervention …
Predicting Insulin Pump Therapy Settings, Riccardo L. Ferraro, David Grijalva, Alex Trahan
Predicting Insulin Pump Therapy Settings, Riccardo L. Ferraro, David Grijalva, Alex Trahan
SMU Data Science Review
Millions of people live with diabetes worldwide [7]. To mitigate some of the many symptoms associated with diabetes, an estimated 350,000 people in the United States rely on insulin pumps [17]. For many of these people, how effectively their insulin pump performs is the difference between sleeping through the night and a life threatening emergency treatment at a hospital. Three programmed insulin pump therapy settings governing effective insulin pump function are: Basal Rate (BR), Insulin Sensitivity Factor (ISF), and Carbohydrate Ratio (ICR). For many people using insulin pumps, these therapy settings are often not correct, given their physiological needs. While …
Application Of Probabilistic Ranking Systems On Women’S Junior Division Beach Volleyball, Cameron Stewart, Michael Mazel, Bivin Sadler
Application Of Probabilistic Ranking Systems On Women’S Junior Division Beach Volleyball, Cameron Stewart, Michael Mazel, Bivin Sadler
SMU Data Science Review
Women’s beach volleyball is one of the fastest growing collegiate sports today. The increase in popularity has come with an increase in valuable scholarship opportunities across the country. With thousands of athletes to sort through, college scouts depend on websites that aggregate tournament results and rank players nationally. This project partnered with the company Volleyball Life, who is the current market leader in the ranking space of junior beach volleyball players. Utilizing the tournament information provided by Volleyball Life, this study explored replacements to the current ranking systems, which are designed to aggregate player points from recent tournament placements. Three …
Improving Data-Driven Infrastructure Degradation Forecast Skill With Stepwise Asset Condition Prediction Models, Kurt R. Lamm, Justin D. Delorit, Michael N. Grussing, Steven J. Schuldt
Improving Data-Driven Infrastructure Degradation Forecast Skill With Stepwise Asset Condition Prediction Models, Kurt R. Lamm, Justin D. Delorit, Michael N. Grussing, Steven J. Schuldt
Faculty Publications
Organizations with large facility and infrastructure portfolios have used asset management databases for over ten years to collect and standardize asset condition data. Decision makers use these data to predict asset degradation and expected service life, enabling prioritized maintenance, repair, and renovation actions that reduce asset life-cycle costs and achieve organizational objectives. However, these asset condition forecasts are calculated using standardized, self-correcting distribution models that rely on poorly-fit, continuous functions. This research presents four stepwise asset condition forecast models that utilize historical asset inspection data to improve prediction accuracy: (1) Slope, (2) Weighted Slope, (3) Condition-Intelligent Weighted Slope, and (4) …
Bias-Corrected Bagging In Active Learning With An Actuarial Application, Yangxuan Xu
Bias-Corrected Bagging In Active Learning With An Actuarial Application, Yangxuan Xu
Undergraduate Student Research Internships Conference
The variable annuity (VA) is a modern insurance product that offers certain guaranteed protection and tax-deferred treatment. Because of the inherent complexity of guarantees’ payoff, the closed-form solution of fair market values (FMVs) is often not available. Most insurance companies depend on Monte Carlo (MC) simulation to price the FMVs of these products, which is an extremely computational intensive and time-consuming approach. The metamodeling approach can be used to circumvent the heavy computation.
In the modeling stage, the bagged tree method has proved to outperform other parametric approaches. Also, a bias-corrected (BC) bagging model was tried and showed significant improvement …
The Q-Analogue Of The Extended Generalized Gamma Distribution, Wenhao Chen
The Q-Analogue Of The Extended Generalized Gamma Distribution, Wenhao Chen
Undergraduate Student Research Internships Conference
This project introduces a flexible univariate probability model referred to as the q-analogue of the Extended Generalized Gamma (or q-EGG) distribution, which encompasses the majority of the most frequently used continuous distributions, including the gamma, Weibull, logistic, type-1 and type-2 beta, Gaussian, Cauchy, Student-t and F. Closed form representations of its moments and cumulative distribution function are provided. Additionally, computational techniques are proposed for determining estimates of its parameters. Both the method of moments and the maximum likelihood approach are utilized. The effect of each parameter is also graphically illustrated. Certain data sets are modeled with q-EGG distributions; goodness of …
Investigation Of Key Factors To Earthquake Insurance Take-Up Rates In Quebec And British Columbia Households And Prediction Model Building, Yongcheng Jiang
Investigation Of Key Factors To Earthquake Insurance Take-Up Rates In Quebec And British Columbia Households And Prediction Model Building, Yongcheng Jiang
Undergraduate Student Research Internships Conference
Maintaining an adequate level of earthquake take-up rate could protect the insurance industry from systemic failure. Past research has shown that British Columbia and Quebec have significant differences in earthquake insurance take-up rate. This report investigates key factors from the structure (default options and various types) of the insurance plan and personal characteristics along with socioeconomic/demographic profiles that affect the demand for earthquake protection in the form of insurance. The report also provides a prediction model for earthquake insurance take-up rate. The results show an importance ranking of key factors of earthquake insurance take up, the most important three are …
Financial Literacy: Self-Evaluation And Reality, Yangsijia Wang
Financial Literacy: Self-Evaluation And Reality, Yangsijia Wang
Undergraduate Student Research Internships Conference
This study is on the topic of financial literacy, with the data source containing information on clients' demographic information and self-evaluation, change in account value, and trade record, three major problems were investigated: first, whether a client's demographic traits are related to his/her self-evaluation of financial knowledge level; second, does the trading behaviour differ for clients who self-identified as in different financial knowledge groups; and third, do people who self-identified as financially knowledgeable have better investment result. Data manipulation was done using SQL and R. Exploratory analysis including multiple types of plots and proportion tables was used to derive the …
Building Intrapersonal Competencies In The First-Year Experience: Utilizing Random Forest, Cluster Analysis, And Linear Regression To Identify Students’ Strengths And Opportunities For Institutional Improvement, Marilee Bresciani Ludvik, Shiming Zhang, Sandra Kahn, Nina Potter, Lisa Richardson-Gates, Stephen Schellenberg, Robyn Saiki, Nasima Subedi, Rebecca Harmata, Rey Monzon, Randy Timm, Jeanne Stronach, Anna Jost
Building Intrapersonal Competencies In The First-Year Experience: Utilizing Random Forest, Cluster Analysis, And Linear Regression To Identify Students’ Strengths And Opportunities For Institutional Improvement, Marilee Bresciani Ludvik, Shiming Zhang, Sandra Kahn, Nina Potter, Lisa Richardson-Gates, Stephen Schellenberg, Robyn Saiki, Nasima Subedi, Rebecca Harmata, Rey Monzon, Randy Timm, Jeanne Stronach, Anna Jost
Practical Assessment, Research, and Evaluation
Leveraging research that illustrates the importance of intrapersonal competency cultivation and its correlation with institutional performance indicators of student success such as end-of-term cumulative GPA, persistence, and academic probation, our team set out to conduct an analysis on the effectiveness of a 1-unit credit/no-credit first-semester, first-year student seminar course. The course was designed to cultivate specific intrapersonal competency gains using a pre- and post-assessment design. Using a supervised Random Forest method and cluster analysis, the team expected to find unique differences in intrapersonal competency pre-, matched pre- and post-, and post-assessment inventory scores in a way where course design improvements …
Practical T-Test Power Analysis With R, Teck Kiang Tan
Practical T-Test Power Analysis With R, Teck Kiang Tan
Practical Assessment, Research, and Evaluation
Power analysis based on the analytical t-test is an important aspect of a research study to determine the sample size required to detect the effect for the comparison of two means. The current paper presents a reader-friendly procedure for carrying out the t-test power analysis using the various R add-on packages. While there is a growing of R users in the academic that uses R as the base for carrying out research, there is a lack of reference that discusses both frequentist and Bayesian approaches and point out their distinct features for t-test power analysis. The practical aspects of the …
Dynamic Prediction For Alternating Recurrent Events Using A Semiparametric Joint Frailty Model, Jaehyeon Yun
Dynamic Prediction For Alternating Recurrent Events Using A Semiparametric Joint Frailty Model, Jaehyeon Yun
Statistical Science Theses and Dissertations
Alternating recurrent events data arise commonly in health research; examples include hospital admissions and discharges of diabetes patients; exacerbations and remissions of chronic bronchitis; and quitting and restarting smoking. Recent work has involved formulating and estimating joint models for the recurrent event times considering non-negligible event durations. However, prediction models for transition between recurrent events are lacking. We consider the development and evaluation of methods for predicting future events within these models. Specifically, we propose a tool for dynamically predicting transition between alternating recurrent events in real time. Under a flexible joint frailty model, we derive the predictive probability of …
To Logit Or Not To Logit Data In The Unit Interval: A Simulation Study, Kayode Idris Hamzat
To Logit Or Not To Logit Data In The Unit Interval: A Simulation Study, Kayode Idris Hamzat
Major Papers
In this paper, we recommend a mechanism for determining whether to logit or not to logit data in the unit interval which is based on quantile estimation of data between 0 and 1. By using a simulated dataset generated from a Beta regression model, the estimated quantile for this model perform better than those based on the linear quantile regression with logit transformation.
Further, we investigate the performance of the quantile regression estimators based on the LQR and we conclude that it is better than those based on the Beta regression when the distribution is contaminated with 10% uniform numbers …
Advanced High Dimensional Regression Techniques, Yuan Yang
Advanced High Dimensional Regression Techniques, Yuan Yang
All Dissertations
This dissertation focuses on developing high dimensional regression techniques to analyze large scale data using both Bayesian and frequentist approaches, motivated by data sets from various disciplines, such as public health and genetics. More specifically, Chapters 2 and Chapter 4 take a Bayesian approach to achieve modeling and parameter estimation simultaneously while Chapter 3 takes a frequentist approach. The main aspects of these techniques are that they perform variable selection and parameter estimation simultaneously, while also being easily adaptable to large-scale data. In particular, by embedding a logistic model into traditional spike and slab framework and selecting of proper prior …
Quantum Computing Simulation Of The Hydrogen Molecule System With Rigorous Quantum Circuit Derivations, Yili Zhang
Quantum Computing Simulation Of The Hydrogen Molecule System With Rigorous Quantum Circuit Derivations, Yili Zhang
All Graduate Plan B and other Reports
Quantum computing has been an emerging technology in the past few decades. It utilizes the power of programmable quantum devices to perform computation, which can solve complex problems in a feasible time that is impossible with classical computers. Simulating quantum chemical systems using quantum computers is one of the most active research fields in quantum computing. However, due to the novelty of the technology and concept, most materials in the literature are not accessible for newbies in the field and sometimes can cause ambiguity for practitioners due to missing details.
This report provides a rigorous derivation of simulating quantum chemistry …