Randomness in Neural Networks: An Overview

Simone Scardapane; Dianhui Wang

Randomness in Neural Networks: An Overview

Simone Scardapane

Dianhui Wang

visibility

…

description

41 pages

link

1 file

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

Neural networks, as powerful tools for data mining and knowledge engineering, can learn from data to build feature-based classifiers and nonlinear predictive models. Training neu-ral networks involves the optimization of non-convex objective functions, and usually the learning process is costly and infeasible for applications associated with data streams. A possible, albeit counter-intuitive alternative is to randomly assign a subset of the networks' weights, so that the resulting optimization task can be formulated as a linear least-squares problem. This methodology can be applied to both feedforward and recurrent networks, and similar techniques can be used to approximate kernel functions. Many experimental results indicate that such randomized models can reach sound performance compared to fully adaptable ones, with a number of favourable benefits, including (i) simplicity of implementation , (ii) faster learning with less intervention from human beings, and (iii) possibility of leveraging over all linear regression and classification algorithms (e.g., l1 norm minimization for obtaining sparse formulations). All these points make them attractive and valuable to the data mining community, particularly for handling large scale data mining in real-time. However, the literature in the field is extremely vast and fragmented, with many results being reintroduced multiple times under different names. This overview aims at providing a self-contained, uniform introduction to the different ways in which randomization can be applied to the design of neural networks and kernel functions. A clear exposition of the basic framework underlying all these approaches helps to clarify innovative lines of research, open problems and, most importantly, foster the exchanges of well-known results throughout different communities.

Figures (3)

Figure 1: A RW-FNN architecture with two inputs, three hidden functions, and one output Fixed connections are shown as dashed lines, whilst trainable connections as fixed lines. where the mth transformation is parameterized by the vector w,,. In (1), the weights w,,, are everything that follows can be extended straightforwardly to the case of multiple outputs.

This relation is schematically shown in Fig. 2. Figure 2: Schematic representation of the kernel approximation process. The original, fixea A concrete example: random Fourier features Random Fourier features (RFF), as originally introduced by Rahimi and Recht,’ are |

Figure 3: Depiction of an RC architecture with one output. Fixed connections are shown ESN), which is common in the machine learning literature. highlighted with a light blue background. layer of fixed, randomly generated nonlinearities, followed by an adaptable linear layer in

Dianhui Wang

Information Sciences, 2017

Random Vector Functional-link (RVFL) networks, a class of learner models, can be regarded as feed-forward neural networks built with a specific randomized algorithm, i.e., the input weights and biases are randomly assigned and fixed during the training phase, and the output weights are analytically evaluated by the least square method. In this paper, we provide some insights into RVFL networks and highlight some practical issues and common pitfalls associated with RVFL-based modelling techniques. Inspired by the folklore that "all high-dimensional random vectors are almost always nearly orthogonal to each other", we establish a theoretical result on the infeasibility of RVFL networks for universal approximation, if a RVFL network is built incrementally with random selection of the input weights and biases from a fixed scope, and constructive evaluation of its output weights. This work also addresses the significance of the scope setting of random weights and biases in respect to modelling performance. Two numerical examples are employed to illustrate our findings, which theoretically and empirically reveal some facts and limits of such class of randomized learning algorithms.

Log In

Randomness in Neural Networks: An Overview

Sign up for access to the world's latest research

Abstract

Figures (3)

Related papers

Related papers

Related topics