Close Menu
  • Home
  • Opinion
  • Region
    • Africa
    • Asia
    • Europe
    • Middle East
    • North America
    • Oceania
    • South America
  • AI & Machine Learning
  • Robotics & Automation
  • Space & Deep Tech
  • Web3 & Digital Economies
  • Climate & Sustainability Tech
  • Biotech & Future Health
  • Mobility & Smart Cities
  • Global Tech Pulse
  • Cybersecurity & Digital Rights
  • Future of Work & Education
  • Trend Radar & Startup Watch
  • Creator Economy & Culture
What's Hot

Expertise Leaker Offers Us Greatest Look But on the iPhone 17e

February 13, 2026

Mazagan Seashore & Golf Resort Welcomes Ramadan with a Culinary Expertise Impressed by Moroccan Heritage

February 13, 2026

Methods to Align Giant Language Fashions with Human Preferences Utilizing Direct Desire Optimization, QLoRA, and Extremely-Suggestions

February 13, 2026
Facebook X (Twitter) Instagram LinkedIn RSS
NextTech NewsNextTech News
Facebook X (Twitter) Instagram LinkedIn RSS
  • Home
  • Africa
  • Asia
  • Europe
  • Middle East
  • North America
  • Oceania
  • South America
  • Opinion
Trending
  • Expertise Leaker Offers Us Greatest Look But on the iPhone 17e
  • Mazagan Seashore & Golf Resort Welcomes Ramadan with a Culinary Expertise Impressed by Moroccan Heritage
  • Methods to Align Giant Language Fashions with Human Preferences Utilizing Direct Desire Optimization, QLoRA, and Extremely-Suggestions
  • Bell claims Telus blocked it from launching web in Western Canada
  • Intel Core Extremely Collection 3 chips carry “actual” gaming to skinny and lightweight laptops
  • Do you have to promote your Generac inventory?
  • AutoNavi Launches ABot-M0 and ABot-N0 Embodied Basis Fashions, Tops 10 International Benchmarks
  • Shares bounce for Chinese language AI start-up Zhipu after GLM-5 launch
Friday, February 13
NextTech NewsNextTech News
Home - Robotics & Automation - How can robots purchase expertise by interactions with the bodily world? An interview with Jiaheng Hu
Robotics & Automation

How can robots purchase expertise by interactions with the bodily world? An interview with Jiaheng Hu

NextTechBy NextTechFebruary 13, 2026No Comments6 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email Copy Link
Follow Us
Google News Flipboard
How can robots purchase expertise by interactions with the bodily world? An interview with Jiaheng Hu
Share
Facebook Twitter LinkedIn Pinterest Email


One of many key challenges in constructing robots for family or industrial settings is the necessity to grasp the management of high-degree-of-freedom programs akin to cell manipulators. Reinforcement studying has been a promising avenue for buying robotic management insurance policies, nonetheless, scaling to advanced programs has proved tough. Of their work SLAC: Simulation-Pretrained Latent Motion Area for Complete-Physique Actual-World RL, Jiaheng Hu, Peter Stone and Roberto Martín-Martín introduce a way that renders real-world reinforcement studying possible for advanced embodiments. We caught up with Jiaheng to search out out extra.

What’s the matter of the analysis in your paper and why is it an fascinating space for examine?

This paper is about how robots (particularly, family robots like cell manipulators) can autonomously purchase expertise through interacting with the bodily world (i.e. real-world reinforcement studying). Reinforcement studying (RL) is a normal studying framework for studying from trial-and-error interplay with an setting, and has big potential in permitting robots to study duties with out people hand-engineering the answer. RL for robotics is a really thrilling discipline, as it could open potentialities for robots to self-improve in a scalable means, in direction of the creation of general-purpose family robots that may help folks in our on a regular basis lives.

What have been a number of the points with earlier strategies that your paper was making an attempt to handle?

Beforehand, many of the profitable functions of RL to robotics have been performed by coaching completely in simulation, then deploying the coverage within the real-world straight (i.e. zero-shot sim2real). Nonetheless, such a way has massive limitations: on one hand, it isn’t very scalable, as you want to create task-specific, high-fidelity simulation environments that extremely match the real-world setting that you simply need to deploy the robotic in, and this could typically take days or months for each job. Alternatively, some duties are literally very arduous to simulate, as they contain deformable objects and contact-rich interactions (for instance, pouring water, folding garments, wiping whiteboard). For these duties, the simulation is commonly fairly totally different from the true world. That is the place real-world RL comes into play: if we are able to permit a robotic to study by straight interacting with the bodily world, we don’t want a simulator anymore. Nonetheless, whereas a number of makes an attempt have been made in direction of realizing real-world RL, it’s really a really arduous drawback since: 1. Pattern-inefficiency: RL requires a variety of samples (i.e. interplay with the setting) to study good habits, which is commonly inconceivable to gather in giant portions within the real-world. 2. Security Points: RL requires exploration, and random exploration within the real-world is commonly very very harmful. The robotic can break itself and can by no means be capable of recuperate from that.

May you inform us in regards to the methodology (SLAC) that you simply’ve launched?

So, creating high-fidelity simulations could be very arduous, and straight studying within the real-world can also be actually arduous. What ought to we do? The important thing concept of SLAC is that we are able to use a low-fidelity simulation setting to help subsequent real-world RL. Particularly, SLAC implements this concept in a two-step course of: in step one, SLAC learns a latent motion house in simulation through unsupervised reinforcement studying. Unsupervised RL is a way that permits the robotic to discover a given setting and study task-agnostic behaviors. In SLAC, we design a particular unsupervised RL goal that encourages these behaviors to be protected and structured.

Within the second step, we deal with these discovered behaviors as the brand new motion house of the robotic, the place the robotic does real-world RL for downstream duties akin to wiping whiteboards by making choices on this new motion house. Importantly, this methodology permit us to avoid the 2 largest drawback of real-world RL: we don’t have to fret about questions of safety for the reason that new motion house is pretrained to be at all times protected; and we are able to study in a sample-efficient means as a result of our new motion house is skilled to be very structured.

IMG 1552 scaledThe robotic finishing up the duty of wiping a whiteboard.

How did you go about testing and evaluating your methodology, and what have been a number of the key outcomes?

We check our strategies on an actual Tiago robotic – a excessive degrees-of-freedom, bi-manual cell manipulation, on a sequence of very difficult real-world duties, together with wiping a big whiteboard, cleansing a desk, and sweeping trash right into a bag. These duties are difficult from three features: 1. They’re visuo-motor duties that require processing of high-dimensional picture data. 2. They require the whole-body movement of the robotic (i.e. controlling many degrees-of-freedom on the identical time), and three. They’re contact-rich, which makes it arduous to simulate precisely. On all of those duties, our methodology permits us to study high-performance insurance policies (>80% success fee) inside an hour of real-world interactions. By comparability, earlier strategies merely can’t remedy the duty, and infrequently threat breaking the robotic. So to summarize, beforehand it was merely not potential to resolve these duties through real-world RL, and our methodology has made it potential.

What are your plans for future work?

I feel there may be nonetheless much more to do on the intersection of RL and robotics. My eventual aim is to create really self-improving robots that may study completely by themselves with none human involvement. Extra lately, I’ve been all for how we are able to leverage basis fashions akin to vision-language fashions (VLMs) and vision-language-action fashions (VLAs) to additional automate the self-improvement loop.

About Jiaheng

jeff squared

Jiaheng Hu is a 4th-year PhD scholar at UT-Austin, co-advised by Prof. Peter Stone and Prof. Roberto Martín-Martín. His analysis curiosity is in Robotic Studying and Reinforcement Studying, with the long-term aim of growing self-improving robots that may study and adapt autonomously in unstructured environments. Jiaheng’s work has been revealed at top-tier Robotics and ML venues, together with CoRL, NeurIPS, RSS, and ICRA, and has earned a number of finest paper nominations and awards. Throughout his PhD, he interned at Google DeepMind and Ai2, and is a recipient of the Two Sigma PhD Fellowship.

Learn the work in full

SLAC: Simulation-Pretrained Latent Motion Area for Complete-Physique Actual-World RL, Jiaheng Hu, Peter Stone, Roberto Martín-Martín.


AIhub square 2021

AIhub
is a non-profit devoted to connecting the AI group to the general public by offering free, high-quality data in AI.

AIhub square 2021

AIhub
is a non-profit devoted to connecting the AI group to the general public by offering free, high-quality data in AI.


Smith

Lucy Smith
is Senior Managing Editor for Robohub and AIhub.

Smith

Lucy Smith
is Senior Managing Editor for Robohub and AIhub.

Elevate your perspective with NextTech Information, the place innovation meets perception.
Uncover the most recent breakthroughs, get unique updates, and join with a world community of future-focused thinkers.
Unlock tomorrow’s developments in the present day: learn extra, subscribe to our e-newsletter, and turn into a part of the NextTech group at NextTech-news.com

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
NextTech
  • Website

Related Posts

How Sennheiser elevated PCB testing by 33% with a Robotiq 2F-85 gripper

February 12, 2026

Sven Koenig wins the 2026 ACM/SIGAI Autonomous Brokers Analysis Award

February 11, 2026

Nationwide Robotics Week 2026 Underscores Robotics as a Essential U.S. Business and Workforce Engine

February 11, 2026
Add A Comment
Leave A Reply Cancel Reply

Economy News

Expertise Leaker Offers Us Greatest Look But on the iPhone 17e

By NextTechFebruary 13, 2026

Apple is predicted to launch the iPhone 17e subsequent week, on February 19, forward of…

Mazagan Seashore & Golf Resort Welcomes Ramadan with a Culinary Expertise Impressed by Moroccan Heritage

February 13, 2026

Methods to Align Giant Language Fashions with Human Preferences Utilizing Direct Desire Optimization, QLoRA, and Extremely-Suggestions

February 13, 2026
Top Trending

Expertise Leaker Offers Us Greatest Look But on the iPhone 17e

By NextTechFebruary 13, 2026

Apple is predicted to launch the iPhone 17e subsequent week, on February…

Mazagan Seashore & Golf Resort Welcomes Ramadan with a Culinary Expertise Impressed by Moroccan Heritage

By NextTechFebruary 13, 2026

In the course of the holy month of Ramadan, Mazagan Seashore &…

Methods to Align Giant Language Fashions with Human Preferences Utilizing Direct Desire Optimization, QLoRA, and Extremely-Suggestions

By NextTechFebruary 13, 2026

On this tutorial, we implement an end-to-end Direct Desire Optimization workflow to…

Subscribe to News

Get the latest sports news from NewsSite about world, sports and politics.

NEXTTECH-LOGO
Facebook X (Twitter) Instagram YouTube

AI & Machine Learning

Robotics & Automation

Space & Deep Tech

Web3 & Digital Economies

Climate & Sustainability Tech

Biotech & Future Health

Mobility & Smart Cities

Global Tech Pulse

Cybersecurity & Digital Rights

Future of Work & Education

Creator Economy & Culture

Trend Radar & Startup Watch

News By Region

Africa

Asia

Europe

Middle East

North America

Oceania

South America

2025 © NextTech-News. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms Of Service
  • Advertise With Us
  • Write For Us
  • Submit Article & Press Release

Type above and press Enter to search. Press Esc to cancel.

Subscribe For Latest Updates

Sign up to best of Tech news, informed analysis and opinions on what matters to you.

Invalid email address
 We respect your inbox and never send spam. You can unsubscribe from our newsletter at any time.     
Thanks for subscribing!