Overcoming catastrophic forgetting in neural networks J Kirkpatrick, R Pascanu, N Rabinowitz, J Veness, G Desjardins, AA Rusu, ... Proceedings of the national academy of sciences 114 (13), 3521-3526, 2017 | 2118 | 2017 |

Theano: a CPU and GPU math expression compiler J Bergstra, O Breuleux, F Bastien, P Lamblin, R Pascanu, G Desjardins, ... Proceedings of the Python for scientific computing conference (SciPy) 4 (3), 1-7, 2010 | 1884 | 2010 |

Progressive neural networks AA Rusu, NC Rabinowitz, G Desjardins, H Soyer, J Kirkpatrick, ... arXiv preprint arXiv:1606.04671, 2016 | 1103 | 2016 |

Theano: A CPU and GPU math compiler in Python J Bergstra, O Breuleux, F Bastien, P Lamblin, R Pascanu, G Desjardins, ... Proc. 9th python in science conf 1, 3-10, 2010 | 741 | 2010 |

Theano: A Python framework for fast computation of mathematical expressions R Al-Rfou, G Alain, A Almahairi, C Angermueller, D Bahdanau, N Ballas, ... arXiv e-prints, arXiv: 1605.02688, 2016 | 645 | 2016 |

Understanding disentangling in -VAE CP Burgess, I Higgins, A Pal, L Matthey, N Watters, G Desjardins, ... arXiv preprint arXiv:1804.03599, 2018 | 425 | 2018 |

Policy distillation AA Rusu, SG Colmenarejo, C Gulcehre, G Desjardins, J Kirkpatrick, ... arXiv preprint arXiv:1511.06295, 2015 | 362 | 2015 |

Combining modality specific deep neural networks for emotion recognition in video SE Kahou, C Pal, X Bouthillier, P Froumenty, Ç Gülçehre, R Memisevic, ... Proceedings of the 15th ACM on International conference on multimodal …, 2013 | 314 | 2013 |

Theano: Deep learning on gpus with python J Bergstra, F Bastien, O Breuleux, P Lamblin, R Pascanu, O Delalleau, ... NIPS 2011, BigLearning Workshop, Granada, Spain 3, 1-48, 2011 | 302 | 2011 |

Unsupervised and transfer learning challenge: a deep learning approach GMY Dauphin, X Glorot, S Rifai, Y Bengio, I Goodfellow, E Lavoie, ... Proceedings of ICML Workshop on Unsupervised and Transfer Learning, 97-110, 2012 | 218 | 2012 |

Natural neural networks G Desjardins, K Simonyan, R Pascanu, K Kavukcuoglu arXiv preprint arXiv:1507.00210, 2015 | 167 | 2015 |

Theano: A Python framework for fast computation of mathematical expressions TTD Team, R Al-Rfou, G Alain, A Almahairi, C Angermueller, D Bahdanau, ... arXiv preprint arXiv:1605.02688, 2016 | 166 | 2016 |

Tempered Markov chain Monte Carlo for training of restricted Boltzmann machines G Desjardins, A Courville, Y Bengio, P Vincent, O Delalleau Proceedings of the thirteenth international conference on artificial …, 2010 | 129 | 2010 |

Parallel tempering for training of restricted Boltzmann machines G Desjardins, A Courville, Y Bengio, P Vincent, O Delalleau Proceedings of the thirteenth international conference on artificial …, 2010 | 92 | 2010 |

Steerable Playlist Generation by Learning Song Similarity from Radio Station Playlists. F Maillet, D Eck, G Desjardins, P Lamere ISMIR, 345-350, 2009 | 86 | 2009 |

Disentangling factors of variation via generative entangling G Desjardins, A Courville, Y Bengio arXiv preprint arXiv:1210.5474, 2012 | 78 | 2012 |

Quadratic polynomials learn better image features J Bergstra, G Desjardins, P Lamblin, Y Bengio Technical report, 1337, 2009 | 72 | 2009 |

Empirical evaluation of convolutional RBMs for vision G Desjardins, Y Bengio Technical Report 1327, Département d’Informatique et de Recherche …, 2008 | 66 | 2008 |

Information asymmetry in KL-regularized RL A Galashov, SM Jayakumar, L Hasenclever, D Tirumala, J Schwarz, ... arXiv preprint arXiv:1905.01240, 2019 | 38 | 2019 |

Adaptive parallel tempering for stochastic maximum likelihood learning of RBMs G Desjardins, A Courville, Y Bengio arXiv preprint arXiv:1012.3476, 2010 | 36 | 2010 |