Quantile Option Architecture (QUOTA)
On this paper, we suggest the Quantile Possibility Structure (QUOTA) for exploration primarily based on latest advances in distributional reinforcement studying (RL). In QUOTA, resolution making relies on quantiles of a worth distribution, not solely the imply. QUOTA supplies a brand new dimension for exploration through making use of each optimism and pessimism of a worth distribution. We show the efficiency benefit of QUOTA in each difficult video video games and bodily robotic simulators. …
Adversarial Multimedia Recommendation (AMR)
With the prevalence of multimedia content material on the Internet, growing recommender options that may successfully leverage the wealthy sign in multimedia knowledge is in pressing want. Owing to the success of deep neural networks in illustration studying, latest advance on multimedia advice has largely centered on exploring deep studying strategies to enhance the advice accuracy. Up to now, nonetheless, there was little effort to analyze the robustness of multimedia illustration and its influence on the efficiency of multimedia advice. On this paper, we make clear the robustness of multimedia recommender system. Utilizing the state-of-the-art advice framework and deep picture options, we show that the general system shouldn’t be sturdy, such {that a} small (however purposeful) perturbation on the enter picture will severely lower the advice accuracy. This means the potential weak point of multimedia recommender system in predicting person desire, and extra importantly, the potential of enchancment by enhancing its robustness. To this finish, we suggest a novel answer named Adversarial Multimedia Advice (AMR), which may result in a extra sturdy multimedia recommender mannequin through the use of adversarial studying. The thought is to coach the mannequin to defend an adversary, which provides perturbations to the goal picture with the aim of reducing the mannequin’s accuracy. We conduct experiments on two consultant multimedia advice duties, specifically, picture advice and visually-aware product advice. Intensive outcomes confirm the optimistic impact of adversarial studying and show the effectiveness of our AMR technique. Supply codes can be found in https://…/AMR. …
Algebraic Subspace Clustering (ASC)
Algebraic Subspace Clustering (ASC) is an easy and stylish technique primarily based on polynomial becoming and differentiation for clustering noiseless knowledge drawn from an arbitrary union of subspaces. In apply, nonetheless, ASC is restricted to equi-dimensional subspaces as a result of the estimation of the subspace dimension through algebraic strategies is delicate to noise. This paper proposes a brand new ASC algorithm that may deal with noisy knowledge drawn from subspaces of arbitrary dimensions. The important thing concepts are (1) to assemble, at every level, a reducing sequence of subspaces containing the subspace passing by means of that time; (2) to make use of the distances from another level to every subspace within the sequence to assemble a subspace clustering affinity, which is superior to various affinities each in principle and in apply. Experiments on the Hopkins 155 dataset show the prevalence of the proposed technique with respect to sparse and low rank subspace clustering strategies. …
RedNet
Indoor semantic segmentation has all the time been a tough job in laptop imaginative and prescient. On this paper, we suggest an RGB-D residual encoder-decoder structure, named RedNet, for indoor RGB-D semantic segmentation. In RedNet, the residual module is utilized to each the encoder and decoder as the essential constructing block, and the skip-connection is used to bypass the spatial function between the encoder and decoder. As a way to incorporate the depth data of the scene, a fusion construction is constructed, which makes inference on RGB picture and depth picture individually, and fuses their options over a number of layers. As a way to effectively optimize the community’s parameters, we suggest a `pyramid supervision’ coaching scheme, which applies supervised studying over totally different layers within the decoder, to deal with the issue of gradients vanishing. Experiment outcomes present that the proposed RedNet(ResNet-50) achieves a state-of-the-art mIoU accuracy of 47.8% on the SUN RGB-D benchmark dataset. …