RVT: Robotic View Transformer for 3D Object Manipulation
Overview Video
Summary
For 3D object manipulation, methods that build an explicit 3D representation perform better than those relying only on camera images. But using explicit 3D representations like voxels comes at large computing cost, adversely affecting scalability. We propose RVT, a multi-view transformer for 3D manipulation that is both scalable and accurate. In simulations, we find that a single RVT model works well across 18 RLBench tasks with 249 task variations, achieving 26% higher relative success than existing state-of-the-art method (PerAct). It also trains 36X faster than PerAct for achieving the same performance and achieves 2.3X the inference speed of PerAct. Further, RVT can perform a variety of manipulation tasks in the real world with just a few (~10) demonstrations per task.
Real-World Videos (2X Default Playback Speed)
Put Object in Drawer
Success: Put orange bottle in drawerSuccess: Put orange bottle in drawerSuccess: Put orange bottle in drawerFailure: Put blue marker in drawer
Put Object in Shelf
Success: Put yellow block in top shelfSuccess: Put yellow block in bottom shelfSuccess: Put yellow block in top shelfFailure: Put yellow block in top shelf
Stack Blocks
Success: Put yellow block on blue blockSuccess: Put blue block on red blockSuccess: Put red block on yellow block
Failure: Put green marker in bowlFailure: Put blue marker in bowlFailure: Put green marker in mug
RLBench Simulation Videos
Put in Drawer
Success: put the item in the top drawerSuccess: put the item in the bottom drawerSuccess: put the item in the top drawerFailure: put the item in the middle drawer
Sweep to Dustpan
Success: sweep dirt to the short dustpanSuccess: sweep dirt to the short dustpanSuccess: sweep dirt to the tall dustpanFailure: sweep dirt to the tall dustpan
Meat off Grill
Success: take the steak off the grillSuccess: take the steak off the grillSuccess: take the steak off the grillFailure: take the steak off the grill
Open Drawer
Success: open the top drawerSuccess: open the middle drawerSuccess: open the bottom drawerFailure: open the top drawer
Turn Tap
Success: turn right tapSuccess: turn left tapSuccess: turn left tapSuccess: turn right tap
Close Jar
Success: close the cyan jarSuccess: close the orange jarSuccess; close the navy jarFailure: close the red jar
Drag Stick
Success: use the stick to drag the cube onto the navy targetSuccess: use the stick to drag the cube onto the gray targetSuccess: use the stick to drag the cube onto the red targetSuccess: use the stick to drag the cube onto the silver target
Success: screw in the rose light bulbSuccess: screw in the gray light bulbSuccess: screw in the violet light bulbFailure: screw in the silver light bulb
Slide Block
Success: slide the block to pink targetSuccess: slide the block to yellow targetSuccess: slide the block to green targetFailure: slide the block to pink target
Put in Safe
Success: put the money away in the safe on the top shelfSuccess: put the money away in the safe on the bottom shelfSuccess: put the money away in the safe on the middle shelFailure: put the money away in the safe on the top shelf
Place Wine
Success: stack the wine bottle to the left of the rackSuccess: stack the wine bottle to the middle of the rackSuccess: stack the wine bottle to the right of the rackFailure: stack the wine bottle to the middle of the rack
Put in Cupboard
Success: put the coffee in the cupboardSuccess: put the mustard in the cupboardSuccess: put the chocolate jello in the cupboardFailure: put the coffee in the cupboard
Sort Shape
Success: put the cylinder in the shape sorterSuccess: put the star in the shape sorterSuccess: put the moon in the shape sorterFailure: put the star in the shape sorter
Push Bottons
Success: push the maroon button, then push the green button, then push the navy buttonSuccess: push the maroon buttonSuccess: push the maroon buttonFailure: push the maroon button
Insert Peg
Success: put the ring on the violet spokeSuccess: put the ring on the black spokeFailure: put the ring on the green spokeFailure: put the ring on the azure spoke
Stack Cups
Success: stack the other cups on top of the lime cupSuccess: stack the other cups on top of the gray cupSuccess: stack the other cups on top of the red cupFailure: stack the other cups on top of the maroon cup
Place Cups
Failure: place 3 cups on the cup holderFailure: place 2 cups on the cup holderFailure: place 2 cups on the cup holder