Welcome to the Deep Learning Efficiency Research (DLER) team at NVIDIA Research led by Dr. Pavlo Molchanov.
Founded in 2023, we are a vibrant and dynamic research lab. We operate under the umbrella of the Learning and Perception Research (LPR) group, led by Dr. Jan Kautz.
Our mission is to drive advancements in the efficiency of deep learning technologies. In the hardware layer, we focus on reducing memory footprint, minimizing inference latency, and lowering energy consumption of models. In the software layer, we prioritize the development of efficient small models and the design of reliable agentic systems.
Currently, we study:
We solicit applications from exceptional deep learning researchers. Explore the opportunities at the NVIDIA Careers website or reach out directly to our team members via email.
May 2025
Members of our team are attending ICLR’25. We are presenting 5 papers, 1 workshop, and joining a few panel discussions.
April 2025
5 of our papers we accepted to CVPR’25.
January 2025
Our team’s contributions to Llama Nemotron, RADIO, and VILA models were featured in the GTC keynote.
December 2024
Members of our team are attending NeurIPS’24. We are presenting 3 papers and 1 workshop.
August 2024
Members of our team are attending ACL’24 and presenting 99% Conditionally Sparse Language Modelling.
June 2024
Congratulations to Shizhe Diao for winning the Best Demo Paper Award and Outstanding Paper award at NAACL 2024.
June 2024
Shizhe Diao joins the group. Welcome!
June 2024
Members of our team are attending CVPR’24 and presenting RADIO and VILA.
May 2024
Flextron and DoRA are accepted to ICML’24 for oral presentations.