×
Well done. You've clicked the tower. This would actually achieve something if you had logged in first. Use the key for that. The name takes you home. This is where all the applicables sit. And you can't apply any changes to my site unless you are logged in.

Our policy is best summarized as "we don't care about _you_, we care about _them_", no emails, so no forgetting your password. You have no rights. It's like you don't even exist. If you publish material, I reserve the right to remove it, or use it myself.

Don't impersonate. Don't name someone involuntarily. You can lose everything if you cross the line, and no, I won't cancel your automatic payments first, so you'll have to do it the hard way. See how serious this sounds? That's how serious you're meant to take these.

×
Register


Required. 150 characters or fewer. Letters, digits and @/./+/-/_ only.
  • Your password can’t be too similar to your other personal information.
  • Your password must contain at least 8 characters.
  • Your password can’t be a commonly used password.
  • Your password can’t be entirely numeric.

Enter the same password as before, for verification.
Login

Grow A Dic
Define A Word
Make Space
Set Task
Mark Post
Apply Votestyle
Create Votes
(From: saved spaces)
Exclude Votes
Apply Dic
Exclude Dic

Click here to flash read.

We present Bayesian Controller Fusion (BCF): a hybrid control strategy that
combines the strengths of traditional hand-crafted controllers and model-free
deep reinforcement learning (RL). BCF thrives in the robotics domain, where
reliable but suboptimal control priors exist for many tasks, but RL from
scratch remains unsafe and data-inefficient. By fusing uncertainty-aware
distributional outputs from each system, BCF arbitrates control between them,
exploiting their respective strengths. We study BCF on two real-world robotics
tasks involving navigation in a vast and long-horizon environment, and a
complex reaching task that involves manipulability maximisation. For both these
domains, simple handcrafted controllers exist that can solve the task at hand
in a risk-averse manner but do not necessarily exhibit the optimal solution
given limitations in analytical modelling, controller miscalibration and task
variation. As exploration is naturally guided by the prior in the early stages
of training, BCF accelerates learning, while substantially improving beyond the
performance of the control prior, as the policy gains more experience. More
importantly, given the risk-aversity of the control prior, BCF ensures safe
exploration and deployment, where the control prior naturally dominates the
action distribution in states unknown to the policy. We additionally show BCF's
applicability to the zero-shot sim-to-real setting and its ability to deal with
out-of-distribution states in the real world. BCF is a promising approach
towards combining the complementary strengths of deep RL and traditional
robotic control, surpassing what either can achieve independently. The code and
supplementary video material are made publicly available at
https://krishanrana.github.io/bcf.

Click here to read this post out
ID: 42953; Unique Viewers: 0
Unique Voters: 0
Total Votes: 0
Votes:
Latest Change: April 5, 2023, 7:33 a.m. Changes:
Dictionaries:
Words:
Spaces:
Views: 10
CC:
No creative common's license
Comments: