Uncategorized ICLR 2019 most cited papers

ICLR 2019 most cited papers

ludovic.arnold September 26, 2019 September 27, 2019Uncategorized

Below are ICLR 2019 papers ranked by number of citations. The citation count was obtained by hand from Google Scholar on September 25, 2019 and may be outdated or subject to human error.

Oral presentations

Rank	Cited by	Paper name
0	265	Large Scale GAN Training for High Fidelity Natural Image Synthesis
1	85	ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness
2	84	The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
3	68	FFJORD: Free-Form Continuous Dynamics for Scalable Reversible Generative Models
4	66	Learning deep representations by mutual information estimation and maximization
5	59	How Powerful are Graph Neural Networks?
6	29	Pay Less Attention with Lightweight and Dynamic Convolutions
7	19	Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks
8	19	The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision
9	18	Slalom: Fast, Verifiable and Private Execution of Neural Networks in Trusted Hardware
10	18	Enabling Factorized Piano Music Modeling and Generation with the MAESTRO Dataset
11	18	Generating High Fidelity Images with Subscale Pixel Networks and Multidimensional Upscaling
12	13	BA-Net: Dense Bundle Adjustment Networks
13	12	Temporal Difference Variational Auto-Encoder
14	11	Deterministic Variational Inference for Robust Bayesian Neural Networks
15	6	Learning Protein Structure with a Differentiable Simulator
16	5	Smoothing the Geometry of Probabilistic Box Embeddings
17	5	KnockoffGAN: Generating Knockoffs for Feature Selection using Generative Adversarial Networks
18	5	Learning Robust Representations by Projecting Superficial Statistics Out
19	4	Meta-Learning Update Rules for Unsupervised Representation Learning
20	4	A Unified Theory of Early Visual Representations from Retina to Cortex through Anatomically Constrained Deep CNNs
21	2	Transferring Knowledge across Learning Processes
22	1	Learning to Remember More with Less Memorization
23	1	On Random Deep Weight-Tied Autoencoders: Exact Asymptotic Analysis, Phase Transitions, and Implications to Training

Posters

Rank	Cited by	Paper name
0	265	Large Scale GAN Training for High Fidelity Natural Image Synthesis
1	210	DARTS: Differentiable Architecture Search
2	135	GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
3	108	Gradient Descent Provably Optimizes Over-parameterized Neural Networks
4	90	Robustness May Be at Odds with Accuracy
5	86	ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness
6	85	The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
7	81	Diversity is All You Need: Learning Skills without a Reward Function
8	81	Large-Scale Study of Curiosity-Driven Learning
9	79	Evaluating Robustness of Neural Networks with Mixed Integer Programming
10	72	The relativistic discriminator: a key element missing from standard GAN
11	72	ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
12	68	FFJORD: Free-Form Continuous Dynamics for Scalable Reversible Generative Models
13	66	Learning deep representations by mutual information estimation and maximization
14	63	Universal Transformers
15	61	Meta-Learning with Latent Embedding Optimization
16	60	ClariNet: Parallel Wave Generation in End-to-End Text-to-Speech
17	59	How Powerful are Graph Neural Networks?
18	53	Learning a SAT Solver from Single-Bit Supervision
19	50	Rethinking the Value of Network Pruning
20	50	Exploration by random network distillation
21	44	Benchmarking Neural Network Robustness to Common Corruptions and Perturbations
22	44	SNAS: stochastic neural architecture search
23	37	Do Deep Generative Models Know What They Don’t Know?
24	36	What do you learn from context? Probing for sentence structure in contextualized word representations
25	36	Approximating CNNs with Bag-of-local-Features models works surprisingly well on ImageNet
26	34	A Universal Music Translation Network
27	34	Are adversarial examples inevitable?
28	32	A Variational Inequality Perspective on Generative Adversarial Networks
29	30	On the Convergence of A Class of Adam-Type Algorithms for Non-Convex Optimization
30	29	Pay Less Attention with Lightweight and Dynamic Convolutions
31	29	LEARNING TO PROPAGATE LABELS: TRANSDUCTIVE PROPAGATION NETWORK FOR FEW-SHOT LEARNING
32	29	Towards the first adversarially robust neural network model on MNIST
33	28	Deep Graph Infomax
34	27	Analyzing Inverse Problems with Invertible Neural Networks
35	27	A Convergence Analysis of Gradient Descent for Deep Linear Neural Networks
36	27	Meta-learning with differentiable closed-form solvers
37	26	Decoupled Weight Decay Regularization
38	26	Local SGD Converges Fast and Communicates Little
39	25	Adversarial Audio Synthesis
40	24	Wizard of Wikipedia: Knowledge-Powered Conversational Agents
41	24	Adaptive Gradient Methods with Dynamic Bound of Learning Rate
42	24	Deep Convolutional Networks as shallow Gaussian Processes
43	23	Sample Efficient Adaptive Text-to-Speech
44	23	Training for Faster Adversarial Robustness Verification via Inducing ReLU Stability
45	23	Efficient Multi-Objective Neural Architecture Search via Lamarckian Evolution
46	22	Adaptive Input Representations for Neural Language Modeling
47	22	Deep Anomaly Detection with Outlier Exposure
48	22	Episodic Curiosity through Reachability
49	22	Differentiable Learning-to-Normalize via Switchable Normalization
50	21	Query-Efficient Hard-label Black-box Attack: An Optimization-based Approach
51	20	Hierarchical Generative Modeling for Controllable Speech Synthesis
52	20	Hyperbolic Attention Networks
53	20	Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks
54	20	Prior Convictions: Black-box Adversarial Attacks with Bandits and Priors
55	19	Lagging Inference Networks and Posterior Collapse in Variational Autoencoders
56	19	The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision
57	19	Enabling Factorized Piano Music Modeling and Generation with the MAESTRO Dataset
58	18	Attentive Neural Processes
59	18	Exemplar Guided Unsupervised Image-to-Image Translation with Semantic Consistency
60	18	Slimmable Neural Networks
61	18	Gradient descent aligns the layers of deep linear networks
62	18	How to train your MAML
63	18	Slalom: Fast, Verifiable and Private Execution of Neural Networks in Trusted Hardware
64	18	The Singular Values of Convolutional Layers
65	18	GENERATING HIGH FIDELITY IMAGES WITH SUBSCALE PIXEL NETWORKS AND MULTIDIMENSIONAL UPSCALING
66	17	Recurrent Experience Replay in Distributed Reinforcement Learning
67	17	SNIP: SINGLE-SHOT NETWORK PRUNING BASED ON CONNECTION SENSITIVITY
68	17	Unsupervised Learning via Meta-Learning
69	17	code2seq: Generating Sequences from Structured Representations of Code
70	17	GAN Dissection: Visualizing and Understanding Generative Adversarial Networks
71	16	Efficient Lifelong Learning with A-GEM
72	16	Deep Decoder: Concise Image Representations from Untrained Non-convolutional Networks
73	15	FlowQA: Grasping Flow in History for Conversational Machine Comprehension
74	15	Three Mechanisms of Weight Decay Regularization
75	15	Optimistic mirror descent in saddle-point problems: Going the extra (gradient) mile
76	15	Universal Stagewise Learning for Non-Convex Problems with Convergence on Averaged Solutions
77	15	A Mean Field Theory of Batch Normalization
78	15	Deep Online Learning Via Meta-Learning: Continual Adaptation for Model-Based RL
79	15	RotatE: Knowledge Graph Embedding by Relational Rotation in Complex Space
80	15	Self-Monitoring Navigation Agent via Auxiliary Progress Estimation
81	15	Time-Agnostic Prediction: Predicting Predictable Video Frames
82	14	Attention, Learn to Solve Routing Problems!
83	14	GamePad: A Learning Environment for Theorem Proving
84	14	Learning Factorized Multimodal Representations
85	14	Quaternion Recurrent Neural Networks
86	14	Fixup Initialization: Residual Learning Without Normalization
87	14	Meta-Learning Probabilistic Inference for Prediction
88	14	Graph HyperNetworks for Neural Architecture Search
89	14	Defensive Quantization: When Efficiency Meets Robustness
90	14	Adversarial Attacks on Graph Neural Networks via Meta Learning
91	14	Understanding and Improving Interpolation in Autoencoders via an Adversarial Regularizer
92	14	Non-vacuous Generalization Bounds at the ImageNet Scale: a PAC-Bayesian Compression Approach
93	14	Bayesian Deep Convolutional Networks with Many Channels are Gaussian Processes
94	13	Janossy Pooling: Learning Deep Permutation-Invariant Functions for Variable-Size Inputs
95	13	No Training Required: Exploring Random Encoders for Sentence Classification
96	13	DialogWAE: Multimodal Response Generation with Conditional Wasserstein Auto-Encoder
97	13	Critical Learning Periods in Deep Networks
98	13	Excessive Invariance Causes Adversarial Vulnerability
99	13	On Self Modulation for Generative Adversarial Networks
100	13	L2-Nonexpansive Neural Networks
101	13	Recall Traces: Backtracking Models for Efficient Reinforcement Learning
102	13	BA-Net: Dense Bundle Adjustment Networks
103	12	Temporal Difference Variational Auto-Encoder
104	12	Diagnosing and Enhancing VAE Models
105	12	TimbreTron: A WaveNet(CycleGAN(CQT(Audio))) Pipeline for Musical Timbre Transfer
106	12	Dynamically Unfolding Recurrent Restorer: A Moving Endpoint Control Method for Image Restoration
107	12	Identifying and Controlling Important Neurons in Neural Machine Translation
108	12	Aggregated Momentum: Stability Through Passive Damping
109	12	The role of over-parametrization in generalization of neural networks
110	12	There Are Many Consistent Explanations of Unlabeled Data: Why You Should Average
111	12	Small nonlinearities in activation functions create bad local minima in neural networks
112	12	Plan Online, Learn Offline: Efficient Learning and Exploration via Model-Based Control
113	12	Near-Optimal Representation Learning for Hierarchical Reinforcement Learning
114	12	Improving the Generalization of Adversarial Training with Domain Adaptation
115	12	Supervised Community Detection with Line Graph Neural Networks
116	12	Discriminator Rejection Sampling
117	12	GANSynth: Adversarial Neural Audio Synthesis
118	11	Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives
119	11	Sliced Wasserstein Auto-Encoders
120	11	Learning to Adapt in Dynamic, Real-World Environments through Meta-Reinforcement Learning
121	11	Unsupervised Hyper-alignment for Multilingual Word Embeddings
122	11	Riemannian Adaptive Optimization Methods
123	11	Hindsight policy gradients
124	11	CEM-RL: Combining evolutionary and gradient-based methods for policy search
125	11	Emergent Coordination Through Competition
126	11	Deterministic Variational Inference for Robust Bayesian Neural Networks
127	11	Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
128	11	Multi-step Retriever-Reader Interaction for Scalable Open-domain Question Answering
129	11	Adv-BNN: Improved Adversarial Defense through Robust Bayesian Neural Network
130	11	Approximability of Discriminators Implies Diversity in GANs
131	11	Generative Code Modeling with Graphs
132	11	PeerNets: Exploiting Peer Wisdom Against Adversarial Attacks
133	11	Visual Semantic Navigation using Scene Priors
134	11	Residual Non-local Attention Networks for Image Restoration
135	11	Diffusion Scattering Transforms on Graphs
136	10	Unsupervised Control Through Non-Parametric Discriminative Rewards
137	10	FUNCTIONAL VARIATIONAL BAYESIAN NEURAL NETWORKS
138	10	Trellis Networks for Sequence Modeling
139	10	Poincare Glove: Hyperbolic Word Embeddings
140	10	Towards Understanding Regularization in Batch Normalization
141	10	Learning to Learn without Forgetting by Maximizing Transfer and Minimizing Interference
142	10	Reward Constrained Policy Optimization
143	10	L-Shapley and C-Shapley: Efficient Model Interpretation for Structured Data
144	10	Invariant and Equivariant Graph Networks
145	10	Improving Generalization and Stability of Generative Adversarial Networks
146	10	InstaGAN: Instance-aware Image-to-Image Translation
147	9	Stable Recurrent Models
148	9	From Language to Goals: Inverse Reinforcement Learning for Vision-Based Instruction Following
149	9	Learning to Represent Edits
150	9	Hierarchical interpretations for neural network predictions
151	9	Adaptivity of deep ReLU network for learning in Besov and mixed smooth Besov spaces: optimal rate and curse of dimensionality
152	9	Data-Dependent Coresets for Compressing Neural Networks with Applications to Generalization Bounds
153	9	Relaxed Quantization for Discretized Neural Networks
154	9	Optimal Completion Distillation for Sequence Learning
155	9	Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow
156	9	Adversarial Imitation via Variational Inverse Reinforcement Learning
157	9	Predict then Propagate: Graph Neural Networks meet Personalized PageRank
158	9	Learning to Infer and Execute 3D Shape Programs
159	9	Context-adaptive Entropy Model for End-to-end Optimized Image Compression
160	9	Biologically-Plausible Learning Algorithms Can Scale to Large Datasets
161	9	Model-Predictive Policy Learning with Uncertainty Regularization for Driving in Dense Traffic
162	8	Learning Particle Dynamics for Manipulating Rigid Bodies, Deformable Objects, and Fluids
163	8	Deep learning generalizes because the parameter-function map is biased towards simple functions
164	8	On the loss landscape of a class of deep neural networks with no bad local valleys
165	8	Stable Opponent Shaping in Differentiable Games
166	8	Detecting Egregious Responses in Neural Sequence-to-sequence Models
167	8	Differentiable Perturb-and-Parse: Semi-Supervised Parsing with a Structured Variational Autoencoder
168	8	ALISTA: Analytic Weights Are As Good As Learned Weights in LISTA
169	8	An analytic theory of generalization dynamics and transfer learning in deep linear networks
170	8	Automatically Composing Representation Transformations as a Means for Generalization
171	8	An Empirical Study of Example Forgetting during Deep Neural Network Learning
172	8	Hierarchical Visuomotor Control of Humanoids
173	8	Generalizable Adversarial Training via Spectral Normalization
174	8	Deep reinforcement learning with relational inductive biases
175	8	Structured Adversarial Attack: Towards General Implementation and Better Interpretability
176	8	Capsule Graph Neural Network
177	8	Whitening and Coloring Batch Transform for GANs
178	8	Dynamic Channel Pruning: Feature Boosting and Suppression
179	8	On the Minimal Supervision for Training Any Binary Classifier from Only Unlabeled Data
180	8	LanczosNet: Multi-Scale Deep Graph Convolutional Networks
181	8	Explaining Image Classifiers by Counterfactual Generation
182	7	Amortized Bayesian Meta-Learning
183	7	Preventing Posterior Collapse with delta-VAEs
184	7	Music Transformer: Generating Music with Long-Term Structure
185	7	AntisymmetricRNN: A Dynamical System View on Recurrent Neural Networks
186	7	Multilingual Neural Machine Translation with Knowledge Distillation
187	7	ProxQuant: Quantized Neural Networks via Proximal Operators
188	7	Regularized Learning for Domain Adaptation under Label Shifts
189	7	Deep Lagrangian Networks: Using Physics as Model Prior for Deep Learning
190	7	Predicting the Generalization Gap in Deep Networks with Margin Distributions
191	7	Selfless Sequential Learning
192	7	Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning
193	7	Neural Probabilistic Motor Primitives for Humanoid Control
194	7	Meta-Learning For Stochastic Gradient MCMC
195	7	Reasoning About Physical Interactions with Object-Oriented Prediction and Planning
196	7	Soft Q-Learning with Mutual-Information Regularization
197	7	DyRep: Learning Representations over Dynamic Graphs
198	7	Building Dynamic Knowledge Graphs from Text using Machine Reading Comprehension
199	7	The Unusual Effectiveness of Averaging in GAN Training
200	7	SOM-VAE: Interpretable Discrete Representation Learning on Time Series
201	7	Beyond Pixel Norm-Balls: Parametric Adversaries using an Analytically Differentiable Renderer
202	7	Spherical CNNs on Unstructured Grids
203	7	Disjoint Mapping Network for Cross-modal Matching of Voices and Faces
204	7	Generating Multiple Objects at Spatially Distinct Locations
205	6	Spreading vectors for similarity search
206	6	Learning Multimodal Graph-to-Graph Translation for Molecule Optimization
207	6	Structured Neural Summarization
208	6	Multiple-Attribute Text Rewriting
209	6	InfoBot: Transfer and Exploration via the Information Bottleneck
210	6	On the Universal Approximability and Complexity Bounds of Quantized ReLU Neural Networks
211	6	Learning Self-Imitating Diverse Policies
212	6	Stochastic Gradient/Mirror Descent: Minimax Optimality and Implicit Regularization
213	6	Fluctuation-dissipation relations for stochastic gradient descent
214	6	Efficient Training on Very Large Corpora via Gramian Estimation
215	6	ProMP: Proximal Meta-Policy Search
216	6	Analysing Mathematical Reasoning Abilities of Neural Models
217	6	Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks
218	6	Learning Multi-Level Hierarchies with Hindsight
219	6	A comprehensive, application-oriented study of catastrophic forgetting in DNNs
220	6	M^3RL: Mind-aware Multi-agent Management Reinforcement Learning
221	6	Neural Logic Machines
222	6	LayoutGAN: Generating Graphic Layouts with Wireframe Discriminators
223	6	SPIGAN: Privileged Adversarial Learning from Simulation
224	6	Verification of Non-Linear Specifications for Neural Networks
225	6	Learning Protein Structure with a Differentiable Simulator
226	6	ADef: an Iterative Algorithm to Construct Adversarial Deformations
227	5	Beyond Greedy Ranking: Slate Optimization via List-CVAE
228	5	Variance Networks: When Expectation Does Not Meet Your Expectations
229	5	Modeling Uncertainty with Hedged Instance Embeddings
230	5	How Important is a Neuron
231	5	Optimal Transport Maps For Distribution Preserving Operations on Latent Spaces of Generative Models
232	5	Spectral Inference Networks: Unifying Deep and Spectral Learning
233	5	Learning to Understand Goal Specifications by Modelling Reward
234	5	Imposing Category Trees Onto Word-Embeddings Using A Geometric Construction
235	5	Don’t Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors
236	5	Multilingual Neural Machine Translation With Soft Decoupled Encoding
237	5	Smoothing the Geometry of Probabilistic Box Embeddings
238	5	Tree-Structured Recurrent Switching Linear Dynamical Systems for Multi-Scale Modeling
239	5	Neural Speed Reading with Structural-Jump-LSTM
240	5	Coarse-grain Fine-grain Coattention Network for Multi-evidence Question Answering
241	5	Subgradient Descent Learns Orthogonal Dictionaries
242	5	Learning Two-layer Neural Networks with Symmetric Inputs
243	5	signSGD with Majority Vote is Communication Efficient and Fault Tolerant
244	5	SGD Converges to Global Minimum in Deep Learning via Star-convex Path
245	5	Deterministic PAC-Bayesian generalization bounds for deep networks via generalizing noise-resilience
246	5	Learning to Learn with Conditional Class Dependencies
247	5	Policy Transfer with Strategy Optimization
248	5	Learning to Schedule Communication in Multi-agent Reinforcement Learning
249	5	Measuring and regularizing networks in function space
250	5	Learning Exploration Policies for Navigation
251	5	Relational Forward Models for Multi-Agent Learning
252	5	Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search
253	5	KnockoffGAN: Generating Knockoffs for Feature Selection using Generative Adversarial Networks
254	5	Combinatorial Attacks on Binarized Neural Networks
255	5	Stochastic Optimization of Sorting Networks via Continuous Relaxations
256	5	On the Sensitivity of Adversarial Robustness to Input Data Distributions
257	5	RelGAN: Relational Generative Adversarial Networks for Text Generation
258	5	Learning Robust Representations by Projecting Superficial Statistics Out
259	5	PATE-GAN: Generating Synthetic Data with Differential Privacy Guarantees
260	5	Bayesian Prediction of Future Street Scenes using Synthetic Likelihoods
261	5	Multi-class classification without multi-class labels
262	5	DPSNet: End-to-end Deep Plane Sweep Stereo
263	5	Emerging Disentanglement in Auto-Encoder Based Unsupervised Image Content Transfer
264	5	Learning To Simulate
265	5	Visual Explanation by Interpretation: Improving Visual Feedback Capabilities of Deep Neural Networks
266	5	Random mesh projectors for inverse problems
267	4	MAE: Mutual Posterior-Divergence Regularization for Variational AutoEncoders
268	4	Meta-Learning Update Rules for Unsupervised Representation Learning
269	4	Bias-Reduced Uncertainty Estimation for Deep Neural Classifiers
270	4	The Deep Weight Prior
271	4	LEARNING FACTORIZED REPRESENTATIONS FOR OPEN-SET DOMAIN ADAPTATION
272	4	Neural Persistence: A Complexity Measure for Deep Neural Networks Using Algebraic Topology
273	4	Learning Neural PDE Solvers with Convergence Guarantees
274	4	Unsupervised Domain Adaptation for Distance Metric Learning
275	4	ARM: Augment-REINFORCE-Merge Gradient for Stochastic Binary Networks
276	4	Generative Question Answering: Learning to Answer the Whole Question
277	4	Stochastic Prediction of Multi-Agent Interactions from Partial Observations
278	4	Learning Programmatically Structured Representations with Perceptor Gradients
279	4	Global-to-local Memory Pointer Networks for Task-Oriented Dialogue
280	4	Multi-Agent Dual Learning
281	4	Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs
282	4	Learning protein sequence embeddings using information from structure
283	4	Harmonic Unpaired Image-to-image Translation
284	4	Characterizing Audio Adversarial Examples Using Temporal Dependency
285	4	Systematic Generalization: What Is Required and Can It Be Learned?
286	4	On the Relation Between the Sharpest Directions of DNN Loss and the SGD Step Length
287	4	Understanding Straight-Through Estimator in Training Activation Quantized Neural Nets
288	4	Caveats for information bottleneck in deterministic scenarios
289	4	A Closer Look at Deep Learning Heuristics: Learning rate restarts, Warmup and Distillation
290	4	AdaShift: Decorrelation and Convergence of Adaptive Learning Rate Methods
291	4	Opportunistic Learning: Budgeted Cost-Sensitive Learning from Data Streams
292	4	Information-Directed Exploration for Deep Reinforcement Learning
293	4	Probabilistic Recursive Reasoning for Multi-Agent Reinforcement Learning
294	4	Variance Reduction for Reinforcement Learning in Input-Driven Environments
295	4	Directed-Info GAIL: Learning Hierarchical Policies from Unsegmented Demonstrations using Directed Information
296	4	Probabilistic Planning with Sequential Monte Carlo methods
297	4	The Limitations of Adversarial Training and the Blind-Spot Attack
298	4	Large Scale Graph Learning From Smooth Signals
299	4	Feature Intertwiner for Object Detection
300	4	StrokeNet: A Neural Painting Environment
301	4	Eidetic 3D LSTM: A Model for Video Prediction and Beyond
302	4	Learning Localized Generative Models for 3D Point Clouds via Graph Convolution
303	4	Generating Multi-Agent Trajectories using Programmatic Weak Supervision
304	4	Diversity-Sensitive Conditional Generative Adversarial Networks
305	4	A Unified Theory of Early Visual Representations from Retina to Cortex through Anatomically Constrained Deep CNNs
306	3	Efficiently testing local optimality and escaping saddles for ReLU networks
307	3	Bayesian Policy Optimization for Model Uncertainty
308	3	Variational Autoencoder with Arbitrary Conditioning
309	3	DHER: Hindsight Experience Replay for Dynamic Goals
310	3	Dimensionality Reduction for Representing the Knowledge of Probabilistic Models
311	3	Learning-Based Frequency Estimation Algorithms
312	3	Practical lossless compression with latent variables using bits back coding
313	3	Label super-resolution networks
314	3	Measuring Compositionality in Representation Learning
315	3	Distribution-Interpolation Trade off in Generative Models
316	3	Improving Sequence-to-Sequence Learning via Optimal Transport
317	3	Guiding Policies with Language via Meta-Learning
318	3	Learning to Design RNA
319	3	Learning what and where to attend
320	3	Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching
321	3	Learning Finite State Representations of Recurrent Policy Networks
322	3	Backpropamine: training self-modifying neural networks with differentiable neuromodulated plasticity
323	3	Theoretical Analysis of Auto Rate-Tuning by Batch Normalization
324	3	Towards Robust, Locally Linear Deep Networks
325	3	Quasi-hyperbolic momentum and Adam for deep learning
326	3	Double Viterbi: Weight Encoding for High Compression Ratio and Fast On-Chip Reconstruction for Deep Neural Network
327	3	Optimal Control Via Neural Networks: A Convex Approach
328	3	ANYTIME MINIBATCH: EXPLOITING STRAGGLERS IN ONLINE DISTRIBUTED OPTIMIZATION
329	3	A2BCD: Asynchronous Acceleration with Optimal Complexity
330	3	Learning to Make Analogies by Contrasting Abstract Relational Structure
331	3	Adaptive Posterior Learning: few-shot learning with a surprise-based memory module
332	3	AutoLoss: Learning Discrete Schedule for Alternate Optimization
333	3	Universal Successor Features Approximators
334	3	Information asymmetry in KL-regularized RL
335	3	Learning Actionable Representations with Goal Conditioned Policies
336	3	Two-Timescale Networks for Nonlinear Value Function Approximation
337	3	Sample Efficient Imitation Learning for Continuous Control
338	3	A Direct Approach to Robust Deep Learning Using Adversarial Networks
339	3	Scalable Unbalanced Optimal Transport using Generative Adversarial Networks
340	3	Graph Wavelet Neural Network
341	3	CAMOU: Learning Physical Vehicle Camouflages to Adversarially Attack Detectors in the Wild
342	3	Learnable Embedding Space for Efficient Neural Architecture Compression
343	3	Learning To Solve Circuit-SAT: An Unsupervised Differentiable Approach
344	3	Towards GAN Benchmarks Which Require Generalization
345	3	Learning Mixed-Curvature Representations in Product Spaces
346	3	Augmented Cyclic Adversarial Learning for Low Resource Domain Adaptation
347	3	Bounce and Learn: Modeling Scene Dynamics with Real-World Bounces
348	3	Learning to Describe Scenes with Programs
349	3	DELTA: DEEP LEARNING TRANSFER USING FEATURE MAP WITH ATTENTION FOR CONVOLUTIONAL NETWORKS
350	3	Mode Normalization
351	3	A rotation-equivariant convolutional neural network model of primary visual cortex
352	3	Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition
353	3	Human-level Protein Localization with Convolutional Neural Networks
354	3	Value Propagation Networks
355	3	Diversity and Depth in Per-Example Routing Models
356	2	Accumulation Bit-Width Scaling For Ultra-Low Precision Training Of Deep Networks
357	2	Kernel Change-point Detection with Auxiliary Deep Generative Models
358	2	Learning a Meta-Solver for Syntax-Guided Program Synthesis
359	2	On the Turing Completeness of Modern Neural Network Architectures
360	2	Active Learning with Partial Feedback
361	2	Toward Understanding the Impact of Staleness in Distributed Machine Learning
362	2	Feature-Wise Bias Amplification
363	2	Transferring Knowledge across Learning Processes
364	2	Deep, Skinny Neural Networks are not Universal Approximators
365	2	Interpolation-Prediction Networks for Irregularly Sampled Time Series
366	2	Learning Representations of Sets through Optimized Permutations
367	2	Variational Bayesian Phylogenetic Inference
368	2	BabyAI: A Platform to Study the Sample Efficiency of Grounded Language Learning
369	2	Learning to Screen for Fast Softmax Inference on Large Vocabulary Neural Networks
370	2	Posterior Attention Models for Sequence to Sequence Learning
371	2	Learning Implicitly Recurrent CNNs Through Parameter Sharing
372	2	A Generative Model For Electron Paths
373	2	RNNs implicitly implement tensor-product representations
374	2	h-detach: Modifying the LSTM Gradient Towards Better Optimization
375	2	Adaptive Estimators Show Information Compression in Deep Neural Networks
376	2	Learning sparse relational transition models
377	2	From Hard to Soft: Understanding Deep Network Nonlinearities via Vector Quantization and Statistical Inference
378	2	ACCELERATING NONCONVEX LEARNING VIA REPLICA EXCHANGE LANGEVIN DIFFUSION
379	2	Per-Tensor Fixed-Point Quantization of the Back-Propagation Algorithm
380	2	Deep Layers as Stochastic Solvers
381	2	Energy-Constrained Compression for Deep Neural Networks via Weighted Sparse Projection and Layer Input Masking
382	2	An Empirical study of Binary Neural Networks’ Optimisation
383	2	Deep Frank-Wolfe For Neural Network Optimization
384	2	G-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space
385	2	Contingency-Aware Exploration in Reinforcement Learning
386	2	Hierarchical Reinforcement Learning via Advantage-Weighted Information Maximization
387	2	The Laplacian in RL: Learning Representations with Efficient Approximations
388	2	Execution-Guided Neural Program Synthesis
389	2	Marginal Policy Gradients: A Unified Family of Estimators for Bounded Action Spaces with Applications
390	2	Modeling the Long Term Future in Model-Based Reinforcement Learning
391	2	Environment Probing Interaction Policies
392	2	Neural Program Repair by Jointly Learning to Localize and Repair
393	2	DISTRIBUTIONAL CONCAVITY REGULARIZATION FOR GANS
394	2	Dynamic Sparse Graph for Efficient Deep Learning
395	2	A Statistical Approach to Assessing Neural Network Robustness
396	2	Multi-Domain Adversarial Learning
397	2	Conditional Network Embeddings
398	2	signSGD via Zeroth-Order Oracle
399	2	Robust Conditional Generative Adversarial Networks
400	2	Deep Learning 3D Shapes Using Alt-az Anisotropic 2-Sphere Convolution
401	2	AD-VAT: An Asymmetric Dueling mechanism for learning Visual Active Tracking
402	2	Latent Convolutional Models
403	2	LeMoNADe: Learned Motif and Neuronal Assembly Detection in calcium imaging videos
404	2	STCN: Stochastic Temporal Convolutional Networks
405	2	Improving MMD-GAN Training with Repulsive Loss Function
406	1	Wasserstein Barycenter Model Ensembling
407	1	Learning Grid Cells as Vector Representation of Self-Position Coupled with Matrix Representation of Self-Motion
408	1	Generating Liquid Simulations with Deformation-aware Neural Networks
409	1	Efficient Augmentation via Data Subsampling
410	1	Generative predecessor models for sample-efficient imitation learning
411	1	Auxiliary Variational MCMC
412	1	Variational Autoencoders with Jointly Optimized Latent Dependency Structure
413	1	Function Space Particle Optimization for Bayesian Neural Networks
414	1	MARGINALIZED AVERAGE ATTENTIONAL NETWORK FOR WEAKLY-SUPERVISED LEARNING
415	1	Neural TTS Stylization with Adversarial and Collaborative Games
416	1	Representation Degeneration Problem in Training Natural Language Generation Models
417	1	Transfer Learning for Sequences via Learning to Collocate
418	1	Understanding Composition of Word Embeddings via Tensor Decomposition
419	1	Complement Objective Training
420	1	DOM-Q-NET: Grounded RL on Structured Language
421	1	Generalized Tensor Models for Recurrent Neural Networks
422	1	textTOvec: DEEP CONTEXTUALIZED NEURAL AUTOREGRESSIVE TOPIC MODELS OF LANGUAGE WITH DISTRIBUTED COMPOSITIONAL PRIOR
423	1	Kernel RNN Learning (KeRNL)
424	1	Minimum Divergence vs. Maximum Margin: an Empirical Comparison on Seq2Seq Models
425	1	Analysis of Quantized Models
426	1	A Kernel Random Matrix-Based Approach for Sparse PCA
427	1	DeepOBS: A Deep Learning Optimizer Benchmark Suite
428	1	Learning concise representations for regression by evolving networks of trees
429	1	Initialized Equilibrium Propagation for Backprop-Free Training
430	1	Minimal Random Code Learning: Getting Bits Back from Compressed Model Parameters
431	1	Self-Tuning Networks: Bilevel Optimization of Hyperparameters using Structured Best-Response Functions
432	1	On Random Deep Weight-Tied Autoencoders: Exact Asymptotic Analysis, Phase Transitions, and Implications to Training
433	1	Overcoming Catastrophic Forgetting for Continual Learning via Model Adaptation
434	1	Preferences Implicit in the State of the World
435	1	Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies
436	1	Solving the Rubik’s Cube with Approximate Policy Iteration
437	1	Towards Metamerism via Foveated Style Transfer
438	1	Learning to Navigate the Web
439	1	Knowledge Flow: Improve Upon Your Teachers
440	1	Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures
441	1	Cost-Sensitive Robustness against Adversarial Examples
442	1	MisGAN: Learning from Incomplete Data with Generative Adversarial Networks
443	1	Don’t let your Discriminator be fooled
444	1	Learning to Remember More with Less Memorization
445	1	Boosting Robustness Certification of Neural Networks
446	1	Post Selection Inference with Incomplete Maximum Mean Discrepancy Estimator
447	1	RotDCF: Decomposition of Convolutional Filters for Rotation-Equivariant Deep Networks
448	1	ProbGAN: Towards Probabilistic GAN with Theoretical Guarantees
449	1	K for the Price of 1: Parameter-efficient Multi-task and Transfer Learning
450	1	Neural network gradient-based learning of black-box function interfaces
451	1	Adversarial Domain Adaptation for Stable Brain-Machine Interfaces
452	1	Unsupervised Adversarial Image Reconstruction
453	0	Unsupervised Learning of the Set of Local Maxima
454	0	Feed-forward Propagation in Probabilistic Neural Networks with Categorical and Max Layers
455	0	Maximal Divergence Sequential Autoencoder for Binary Software Vulnerability Detection
456	0	Learning Procedural Abstractions and Evaluating Discrete Latent Temporal Structure
457	0	Information Theoretic lower bounds on negative log likelihood
458	0	Learning from Positive and Unlabeled Data with a Selection Bias
459	0	Integer Networks for Data Compression with Latent-Variable Models
460	0	Improving Differentiable Neural Computers Through Memory Masking, De-allocation, and Link Distribution Sharpness Control
461	0	A Max-Affine Spline Perspective of Recurrent Neural Networks
462	0	Discovery of Natural Language Concepts in Individual Units of CNNs
463	0	Learning Recurrent Binary/Ternary Weights
464	0	Large-Scale Answerer in Questioner’s Mind for Visual Dialog Question Generation
465	0	Variational Smoothing in Recurrent Neural Network Language Models
466	0	Top-Down Neural Model For Formulae
467	0	Representing Formal Languages: A Comparison Between Finite Automata and Recurrent Neural Networks
468	0	CBOW Is Not All You Need: Combining CBOW with the Compositional Matrix Space Model
469	0	Sparse Dictionary Learning by Dynamical Neural Networks
470	0	Learning Embeddings into Entropic Wasserstein Spaces
471	0	Max-MIG: an Information Theoretic Approach for Joint Learning from Crowds
472	0	Preconditioner on Matrix Lie Group for SGD
473	0	NOODL: Provable Online Dictionary Learning and Sparse Coding
474	0	The Comparative Power of ReLU Networks and Polynomial Kernels in the Presence of Sparse Latent Structure
475	0	Off-Policy Evaluation and Learning from Logged Bandit Feedback: Error Reduction via Surrogate Policy
476	0	A Closer Look at Few-shot Classification
477	0	NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning
478	0	Composing Complex Skills by Learning Transition Policies
479	0	Supervised Policy Update for Deep Reinforcement Learning
480	0	Synthetic Datasets for Neural Program Synthesis
481	0	A new dog learns old tricks: RL finds classic optimization algorithms
482	0	Neural Graph Evolution: Automatic Robot Design
483	0	Competitive experience replay
484	0	Visceral Machines: Risk-Aversion in Reinforcement Learning with Intrinsic Physiological Rewards
485	0	Learning Latent Superstructures in Variational Autoencoders for Deep Multidimensional Clustering
486	0	On Computation and Generalization of Generative Adversarial Networks under Spectrum Control
487	0	Adversarial Reprogramming of Neural Networks
488	0	INVASE: Instance-wise Variable Selection using Neural Networks
489	0	GO Gradient for Expectation-Based Objectives
490	0	Revealing interpretable object representations from human behavior
491	0	Learning what you can do before doing anything
492	0	A Data-Driven and Distributed Approach to Sparse Signal Representation and Recovery
493	0	Convolutional Neural Networks on Non-uniform Geometrical Signals Using Euclidean Spectral Transformation
494	0	Equi-normalization of Neural Networks
495	0	ROBUST ESTIMATION VIA GENERATIVE ADVERSARIAL NETWORKS
496	0	Unsupervised Discovery of Parts, Structure, and Dynamics
497	0	Harmonizing Maximum Likelihood with GANs for Multimodal Conditional Generation
498	0	Minimal Images in Deep Neural Networks: Fragile Object Recognition in Natural Images
499	0	Overcoming the Disentanglement vs Reconstruction Trade-off via Jacobian Supervision
500	0	Visual Reasoning by Progressive Module Networks

Available translations
- English
- Français