Physical Sciences and Mathematics | Open Access Articles

Why Rectified Linear Neurons Are Efficient: Symmetry-Based, Complexity-Based, And Fuzzy-Based Explanations, Olac Fuentes, Justin Parra, Elizabeth Y. Anthony, Vladik Kreinovich

Departmental Technical Reports (CS)

Traditionally, neural networks used a sigmoid activation function. Recently, it turned out that piecewise linear activation functions are much more efficient -- especially in deep learning applications. However, so far, there have been no convincing theoretical explanation for this empirical efficiency. In this paper, we show that, by using different uncertainty techniques, we can come up with several explanations for the efficiency of piecewise linear neural networks. The existence of several different explanations makes us even more confident in our results -- and thus, in the efficiency of piecewise linear activation functions.

Full-Text Articles in Physical Sciences and Mathematics

Why Rectified Linear Neurons Are Efficient: Symmetry-Based, Complexity-Based, And Fuzzy-Based Explanations, Olac Fuentes, Justin Parra, Elizabeth Y. Anthony, Vladik Kreinovich

Departmental Technical Reports (CS)

Z-Numbers: How They Describe Student Confidence And How They Can Explain (And Improve) Laplacian And Schroedinger Eigenmap Dimension Reduction In Data Analysis, Vladik Kreinovich, Olga Kosheleva, Michael Zakharevich

Departmental Technical Reports (CS)

Why Deep Learning Methods Use Kl Divergence Instead Of Least Squares: A Possible Pedagogical Explanation, Olga Kosheleva, Vladik Kreinovich

Departmental Technical Reports (CS)

Why Triangular Membership Functions Are Often Efficient In F-Transform Applications: Relation To Interval Uncertainty\\ And Haar Wavelets, Olga Kosheleva, Vladik Kreinovich

Departmental Technical Reports (CS)

Beyond Integration: A Symmetry-Based Approach To Reaching Stationarity In Economic Time Series, Songsak Sriboonchitta, Olga Kosheleva, Vladik Kreinovich

Departmental Technical Reports (CS)

Why Sparse?, Thongchai Dumrongpokaphan, Olga Kosheleva, Vladik Kreinovich, Aleksandra Belina

Departmental Technical Reports (CS)

How To Best Apply Neural Networks In Geosciences: Towards Optimal "Averaging" In Dropout Training, Afshin Gholamy, Justin Parra, Vladik Kreinovich, Olac Fuentes, Elizabeth Y. Anthony

Departmental Technical Reports (CS)

Why Taylor Models And Modified Taylor Models Are Empirically Successful: A Symmetry-Based Explanation, Mioara Joldes, Christoph Lauter, Martine Ceberio, Olga Kosheleva, Vladik Kreinovich

Departmental Technical Reports (CS)

How To Store Tensors In Computer Memory: An Observation, Martine Ceberio, Vladik Kreinovich

Departmental Technical Reports (CS)

How To Make A Proof Of Halting Problem More Convincing: A Pedagogical Remark, Benjamin W. Robertson, Olga Kosheleva, Vladik Kreinovich

Departmental Technical Reports (CS)

Propagation Of Probabilistic Uncertainty: The Simplest Case (A Brief Pedagogical Introduction), Olga Kosheleva, Vladik Kreinovich

Departmental Technical Reports (CS)

Sudoku App: Model-Driven Development Of Android Apps Using Ocl?, Yoonsik Cheon, Aditi Barua

Departmental Technical Reports (CS)

Impacts Of Java Language Features On The Memory Performances Of Android Apps, Yoonsik Cheon, Adriana Escobar De La Torre

Departmental Technical Reports (CS)

Need For A Large-N Array (And Wavelets And Differences) To Determine The Assumption-Free 3-D Earth Model, Solymar Ayala Cortez, Aaron A. Velasco, Vladik Kreinovich

Departmental Technical Reports (CS)

Efficient Parameter-Estimating Algorithms For Symmetry-Motivated Models: Econometrics And Beyond, Vladik Kreinovich, Anh H. Ly, Olga Kosheleva, Songsak Sriboonchitta

Departmental Technical Reports (CS)

Practical Need For Algebraic (Equality-Type) Solutions Of Interval Equations And For Extended-Zero Solutions, Ludmila Dymova, Pavel Sevastjanov, Andrzej Pownuk, Vladik Kreinovich

Departmental Technical Reports (CS)

How To Use Absolute-Error-Minimizing Software To Minimize Relative Error: Practitioner's Guide, Afshin Gholamy, Vladik Kreinovich

Departmental Technical Reports (CS)

Granular Approach To Data Processing Under Probabilistic Uncertainty, Andrzej Pownuk, Vladik Kreinovich

Departmental Technical Reports (CS)

A Thought On Refactoring Java Loops Using Java 8 Streams, Khandoker Rahad, Zejing Cao, Yoonsik Cheon

Departmental Technical Reports (CS)

How Better Are Predictive Models: Analysis On The Practically Important Example Of Robust Interval Uncertainty, Vladik Kreinovich, Hung T. Nguyen, Songsak Sriboonchitta, Olga Kosheleva

Departmental Technical Reports (CS)

How To Gauge Accuracy Of Processing Big Data: Teaching Machine Learning Techniques To Gauge Their Own Accuracy, Vladik Kreinovich, Thongchai Dumrongpokaphan, Hung T. Nguyen, Olga Kosheleva

Departmental Technical Reports (CS)

Kuznets Curve: A Simple Dynamical System-Based Explanation, Thongchai Dumrongpokaphan, Vladik Kreinovich

Departmental Technical Reports (CS)

Taking Into Account Interval (And Fuzzy) Uncertainty Can Lead To More Adequate Statistical Estimates, Ligang Sun, Hani Dbouk, Steffen Schön, Vladik Kreinovich

Departmental Technical Reports (CS)

Entropy As A Measure Of Average Loss Of Privacy, Luc Longpre, Vladik Kreinovich, Thongchai Dumrongpokaphan

Departmental Technical Reports (CS)

Maximum Entropy As A Feasible Way To Describe Joint Distributions In Expert Systems, Thongchai Dumrongpokaphan, Vladik Kreinovich, Hung T. Nguyen

Departmental Technical Reports (CS)

Why Student Distributions? Why Matern's Covariance Model? A Symmetry-Based Explanation, Steffen Schön, Gaël Kermarrec, Boris Kargoll, Ingo Neumann, Olga Kosheleva, Vladik Kreinovich

Departmental Technical Reports (CS)

Markowitz Portfolio Theory Helps Decrease Medicines' Side Effect And Speed Up Machine Learning, Thongchai Dumrongpokaphan, Vladik Kreinovich

Departmental Technical Reports (CS)

What If We Do Not Know Correlations?, Michael Beer, Zitong Gong, Ingo Neumann, Songsak Sriboonchitta, Vladik Kreinovich

Departmental Technical Reports (CS)

Fuzzy Sets As Strongly Consistent Random Sets, Kittawit Autchariyapanitkul, Hung T. Nguyen, Vladik Kreinovich

Departmental Technical Reports (CS)

From Fuzzy Universal Approximation To Fuzzy Universal Representation: It All Depends On The Continuum Hypothesis, Mahdokhat Michelle Afravi, Vladik Kreinovich

Departmental Technical Reports (CS)