Researchers at Meta Actuality Labs have created a prototype VR headset with a custom-built accelerator chip specifically designed to deal with AI processing to make it doable to render the corporate’s photorealistic Codec Avatars on a standalone headset.
Lengthy earlier than the corporate modified its title, Meta has been engaged on its Codec Avatars challenge which goals to make practically photorealistic avatars in VR a actuality. Utilizing a mix of on-device sensors—like eye-tracking and mouth-tracking—and AI processing, the system animates an in depth recreation of the person in a sensible method, in real-time.
Or at the least that’s the way it works whenever you’ve obtained high-end PC {hardware}.
Early variations of the corporate’s Codec Avatars analysis had been backed by the ability of an NVIDIA Titan X GPU, which monstrously dwarfs the ability out there in one thing like Meta’s newest Quest 2 headset.
However the firm has moved on to determining easy methods to make Codec Avatars doable on low-powered standalone headsets, as evidenced by a paper revealed alongside final month’s 2022 IEEE CICC convention. Within the paper, Meta reveals it created a {custom} chip constructed with a 7nm course of to operate as an accelerator particularly for Codec Avatars.
Specifically Made

In accordance with the researchers, the chip is much from off the shelf. The group designed it with a necessary a part of the Codec Avatars processing pipeline in thoughts—particularly, analyzing the incoming eye-tracking pictures and producing the info wanted for the Codec Avatars mannequin. The chip’s footprint is a mere 1.6mm².
“The test-chip, fabricated in 7nm expertise node, includes a Neural Community (NN) accelerator consisting of a 1024 Multiply-Accumulate (MAC) array, 2MB on-chip SRAM, and a 32bit RISC-V CPU,” the researchers write.
In flip, in addition they rebuilt the a part of the Codec Avatars AI mannequin to make the most of the chip’s particular structure.
“By re-architecting the Convolutional [neural network] based mostly eye gaze extraction mannequin and tailoring it for the {hardware}, the complete mannequin suits on the chip to mitigate system-level power and latency value of off-chip reminiscence accesses,” the Actuality Labs researchers write. “By effectively accelerating the convolution operation on the circuit-level, the offered prototype [chip] achieves 30 frames per second efficiency with low-power consumption at low type elements.”

By accelerating an intensive a part of the Codec Avatars workload, the chip not solely accelerates the method, however it additionally reduces the ability and warmth required. It’s in a position to do that extra effectively than a general-purpose CPU because of the {custom} design of the chip which then knowledgeable the rearchitected software program design of the eye-tracking part of Codec Avatars.
One A part of a Pipeline
However the headset’s normal goal CPU (on this case, Quest 2’s Snapdragon XR2 chip) doesn’t get to take the break day. Whereas the {custom} chip handles a part of the Codec Avatars encoding course of, the XR2 manages the decoding course of and rendering the precise visuals of the avatar.

The work will need to have been fairly multidisciplinary, as the paper credit 12 researchers, all from Meta’s Actuality Labs: H. Ekin Sumbul, Tony F. Wu, Yuecheng Li, Syed Shakib Sarwar, William Koven, Eli Murphy-Trotzky, Xingxing Cai, Elnaz Ansari, Daniel H. Morris, Huichu Liu, Doyun Kim, and Edith Beigne.
It’s spectacular that Meta’s Codec Avatars can run on a standalone headset, even when a specialty chip is required. However one factor we don’t know is how properly the visible rendering of the avatars is dealt with. The underlying scans of the customers are extremely detailed and could also be too complicated to render on Quest 2 in full. It’s not clear how a lot the ‘photorealistic’ a part of the Codec Avatars is preserved on this occasion, even when all of the underlying items are there to drive the animations.
– – — – –
The analysis represents a sensible software of the brand new compute structure that Actuality Lab’s Chief Scientist, Michael Abrash, just lately described as a obligatory subsequent step for making the sci-fi imaginative and prescient of XR a actuality. He says that shifting away from extremely centralized processing to extra distributed processing is essential for the ability and efficiency calls for of such headsets.
One can think about a variety of XR-specific capabilities that might profit from chips specifically designed to speed up them. Spatial audio, as an illustration, is fascinating in XR throughout the board for added immersion, however practical sound simulation is computationally costly (to not point out energy hungry!). Positional-tracking and hand-tracking are a essential a part of any XR expertise—one more place the place designing the {hardware} and algorithms collectively might yield substantial advantages in velocity and energy.
Fascinated by the slicing fringe of XR science? Try our archives for extra breakdowns of fascinating analysis.