Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 3 of 3

Full-Text Articles in Physical Sciences and Mathematics

A Nonlinear Filter For Markov Chains And Its Effect On Diffusion Maps, Stefan Steinerberger Sep 2015

A Nonlinear Filter For Markov Chains And Its Effect On Diffusion Maps, Stefan Steinerberger

Yale Day of Data

Diffusion maps are a modern mathematical tool that helps to find structure in large data sets - we present a new filtering technique that is based on the assumption that errors in the data are intrinsically random to isolate and filter errors and thus boost the efficiency of diffusion maps. Applications include data sets from medicine (the Cleveland Heart Disease Data set and the Wisconsin Breast Cancer Data set) and engineering (the Ionosphere data set).


Crowdsourcing Global Wastewater Data, Don Mosteller, Sam Cohen, Cory Nestor, Angel Hsu, Omar Malik Sep 2015

Crowdsourcing Global Wastewater Data, Don Mosteller, Sam Cohen, Cory Nestor, Angel Hsu, Omar Malik

Yale Day of Data

No time to waste: Crowdsourcing global wastewater treatment data

Worldwide, over 80 percent of wastewater is discharged into water bodies without undergoing treatment, severely impairing human well-being and ecosystem vitality along the way. National performance on wastewater treatment is difficult to quantify and is poorly understood due to a lack of common definitions, poor data collection standards, and limited historical data. To address this, the Yale Environmental Performance Index (EPI), a research group that produces a biennial ranking of country-level environmental performance, developed a first-of-its kind national wastewater treatment indicator.[1]

The indicator assesses wastewater treatment performance for 183 countries, …


K-Mer Analysis On Developmental And Housekeeping Enhancer Peaks, Yunsi Yang, Anurag Sethi, Mark Gerstein Sep 2015

K-Mer Analysis On Developmental And Housekeeping Enhancer Peaks, Yunsi Yang, Anurag Sethi, Mark Gerstein

Yale Day of Data

The regulation of gene expression involves interaction between transcriptional enhancers and core promoters. However, the separation between developmental and housekeeping gene regulation remains unknown. Here, we present a method to detect if different core promoters exhibit specificity to certain enhancers within massively parallel assays for enhancer detection. We use k-mers of various length (3-8bp) as sequence features and compare k-mer frequencies between developmental and housekeeping enhancers. This method shows promoter specificity of enhancers in D. melanogaster.