Max Robotics — Global Service Robot Directory + US Certification

About

Sergey Levine is a leading researcher at the intersection of deep reinforcement learning, robot learning, and decision-making, whose work has fundamentally shaped how autonomous systems acquire complex behaviors. He is perhaps best known for co-developing Trust Region Policy Optimization (TRPO), a landmark algorithm for reliable policy learning that has accumulated over 3,100 citations and remains a cornerstone of modern reinforcement learning. His contributions to foundational RL methodology extend further through Generalized Advantage Estimation and Soft Actor-Critic, together drawing nearly 3,700 citations, advancing the stability and sample efficiency of continuous control algorithms. Levine has been equally influential in bridging deep learning with physical robotics. His pioneering end-to-end visuomotor policy work demonstrated that robots could learn directly from raw pixel inputs without hand-engineered perception pipelines, earning over 3,100 combined citations. Projects like QT-Opt and Deep Visual Foresight pushed scalable, vision-based robotic manipulation into practical territory. More recently, his comprehensive treatment of offline reinforcement learning has helped define an emerging subfield critical for real-world deployment. Across more than 12,000 citations in his most-cited works alone, Levine's research continues to drive autonomous robots closer to genuine generalist intelligence.

Research Focus

Computer science310 · 33,307 citations

Artificial intelligence301 · 30,756 citations

Reinforcement learning178 · 22,716 citations

Robot192 · 20,955 citations

Machine learning156 · 14,031 citations

Human–computer interaction127 · 11,782 citations

Engineering125 · 11,718 citations

Mathematics64 · 10,155 citations

Artificial neural network32 · 9,799 citations

Deep learning31 · 7,959 citations

Control (management)38 · 7,218 citations

Mathematical optimization20 · 6,258 citations

Key Achievements

81

H-Index

314

Papers

33,469

Total Citations

107

Avg Citations/Paper

🏆 Most Cited Paper

Trust Region Policy Optimization

3,141 citations · 2015

📈 Most Prolific Year: 2019 (41 Papers)

🤝 Key Collaborators: 621

🏛 Institutions: University of California, Berkeley, Berkeley College, Intel (United States), University of California System, Google (United States), Machine Intelligence Research Institute

Top Papers

1
Trust Region Policy Optimization
3,141 citations · 2015
2
Soft Actor-Critic Algorithms and Applications
1,952 citations · 2018
3
High-Dimensional Continuous Control Using Generalized Advantage Estimation
1,750 citations · 2015
4
End-to-end training of deep visuomotor policies
1,715 citations · 2016
5
Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates
1,452 citations · 2017
6
End-to-End Training of Deep Visuomotor Policies
1,399 citations · 2015
7
DeepMimic
802 citations · 2018
8
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
795 citations · 2020
9
Deep visual foresight for planning robot motion
627 citations · 2017
10
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation
575 citations · 2018

Key Collaborators

Contact & Links

Available for collaboration