Today Apple finally revealed Vision Pro, its $3500 AR/VR headset launching in 2024. Here’s a rundown of its full specs and features.

Vision Pro is an ultra high-end headset packing in the highest resolution and most sensors ever in an AR/VR product. It introduces new features never before shipped and its visionOS re-thinks the line between 2D, AR, and VR.

Lightweight Design With Tethered Battery

Vision Pro has a custom lightweight aluminum alloy frame supporting a curved “three-dimensionally formed” laminated glass front plate to achieve a thin design.

What also keeps Vision Pro light is the separation of the battery from the headset. Some other headsets like Meta Quest Pro and Pico 4 have their battery in the rear of the strap, but Apple’s design takes it off your head entirely with an external battery tethered to a magnetic connector on the left of the headband.

Apple claims the external battery lasted 2 hours under the following test conditions:

Video playback, internet browsing, spatial video capture, and FaceTime. Video playback tested in conjunction with an Environment, using 2D movie content purchased from the Apple TV app. Internet browsing tested using 20 popular websites. FaceTime tested between two Apple Vision Pro units with Personas enabled. Tested with Wi-Fi associated to a network.

0:00

/

Alternatively, Vision Pro can used perpetually without the battery by plugging it in to a power source. Apple hasn’t yet gone into details about which sources are supported or whether the adapter for this is included.

A Plethora Of Cameras & Sensors

Vision Pro has a total of twelve cameras, four depth sensors, a LiDAR sensor, and six microphones.

0:00

/

Passthrough AR

Six of the twelve cameras are under the front glass.

Two of these six capture high resolution color to provide the headset’s passthrough view of the real world, streaming “over one billion color pixels per second.”

0:00

/

The other four front cameras perform headset positional tracking and other general computer vision tasks.

Hand Tracking

One purpose of the four depth sensors is hand tracking. Apple describes the hand tracking quality as “so precise, it completely frees up your hands from needing clumsy hardware controllers.”

0:00

/

Vision Pro lacks any kind of tracked controllers, though it supports playing 2D games on a virtual screen with a gamepad.

Environment Meshing

The LiDAR sensor is used to perform “real time 3D mapping” of your environment in conjunction with the other front cameras.

0:00

/

Apple claims Vision Pro has a “detailed understanding” of your floors, walls, surfaces, and furniture, which apps can leverage. One example Apple gave was virtual objects casting shadows on real tables, but this only scratches the surface of what should be possible.

Face & Eye Tracking For FaceTime & More

Two downwards facing cameras track your face, while four internal IR cameras beside them track your eyes, helped by a ring of LED illuminators around the lenses.

Vision Pro’s eye tracking serves three purposes: authentication, foveated rendering, and driving your FaceTime avatar.

0:00

/

Apple is calling its new iris scanning authentication OpticID, following the naming scheme of TouchID and FaceID from its other devices. OpticID is how you unlock Vision Pro, and it also works with Apple Pay purchases and password autofill. As with TouchID and FaceID, the biometric data powering OpticID is processed on-device by a Secure Enclave Processor.

Foveated rendering is a technique where only the small region of the display that your eyes are currently looking at is rendered in full resolution, thus freeing up performance since the rest is lower resolution. Freed up GPU resources can be used for better performance, to increase rendering resolution, or to increase graphics settings. It leverages the fact that our eyes only see in high resolution in the very center of the fovea.

0:00

/

Finally, eye tracking combines with the downwards cameras to track your facial expressions in real time to drive your FaceTime Persona, Apple’s take on photorealistic avatars. Meta has been showing off research towards this for over four years now, but it looks like Apple will be the first to ship – albeit not to the same quality of Meta’s research.

R1 Chip For Ultra Low Latency

To fuse the input from all these cameras, sensors, and microphones together, Apple developed a custom chip it calls R1.

Apple claims R1 “virtually eliminates lag, streaming new images to the displays within 12 milliseconds.”

For comparison, the founder of the French headset startup Lynx claims Meta Quest Pro’s passthrough latency is 35-60 milliseconds. It’s unclear if this is a like-for-like comparison though.

AR-VR Digital Crown

Vision Pro only has two physical controls, both on the top. A button to capture “spatial videos” and “spatial photos” at any moment, and a Digital Crown.

Pressing the Digital Crown brings up the system Home View. But turning it controls your level of immersion, all the way from full AR to full VR. If you go halfway, for example, you’ll see VR in front of you and AR behind you.

0:00

/

On existing headsets like Meta Quest and Pico 4, passthrough is a toggle option, meaning you have to choose between full and no immersion. Apple wants to instead let you choose exactly how engaged to be with your real surroundings.

EyeSight & Person Awareness

A completely unique feature of Vision Pro is an external display that shows your eyes to other people in the room and indicates how aware of them you are. Apple calls this technology EyeSight.

When you’re in an AR app EyeSight shows a colored pattern in front of your eyes, and when you’re in a VR app it shows only the pattern with your eyes invisible.

0:00

/

When someone comes close to you, Vision Pro will show a cutout of the person and EyeSight will reveal your eyes to them.

Apple described making sure that you’re “never isolated from the people around you” as one of its “foundational design goals” for Vision Pro, and the company sees it as a clear differentiator to fully opaque headsets like Meta Quests.

Micro OLED Displays With 23 Million Total Pixels

Vision Pro features dual micro-OLED panels with unprecedented pixel density. Apple says each is “the size of a postage stamp” yet together they have 23 million pixels, fewer than was rumored.

Apple didn’t reveal the exact resolution, but 23 million total pixels would suggest a per-eye resolution of around 3400×3400 for a square aspect ratio, or around 3200×3600 for the 9:10 aspect ratio typically used in headsets. We don’t know the exact aspect ratio in Vision Pro, however.

0:00

/

Apple confirmed Vision Pro’s displays support wide color gamut and high dynamic range, but didn’t reveal detailed specs like peak brightness.

M2 Chip For ‘Unparalleled’ Performance

Vision Pro is powered by the same Apple Silicon M2 chip used in recent Macs.

Apple says this delivers “unparalleled standalone performance”, and allows Vision Pro to “maintain a comfortable temperature and run virtually silent.”

Compared to the rumored specs of the next generation Qualcomm Snapdragon chipset Meta Quest 3 will use, Apple’s M2 should deliver roughly 25% faster single-threaded CPU performance, 75% faster multi-threaded, and roughly 15% more GPU power.

However, without knowing the exact clock speeds of the processors in each headset, this is only a very rough comparison.

visionOS With Hand & Eye Control

visionOS is Apple’s custom “spatial operating system” for Vision Pro, and presumably for future headsets in the Vision line too.

Apple describes visionOS as “familiar, yet groundbreaking.” It presents you with floating 2D apps that you scroll through with a flick of your fingers. You select menu items with your eyes by just looking at them, and you use your fingers to click.

Many of Apple’s first party apps and services are available in visionOS, including Notes, Messages, Safari, Keynote, Photos, FaceTime, and Apple Music.

Apple walks us through visionOS

Rather than just existing within their 2D frame, many of Apple’s apps “become spatial,” taking up space around you. In FaceTime group calls for example, each person’s webcam view becomes its own floating rectangle, with spatial audio coming from each person. Apple also gave the example of being able to pull out 3D models from messages into real space.

Vision Pro also lets you expand your Mac’s display into a huge virtual display wirelessly just by looking at it.

A major focus of visionOS is watching movies and TV shows on a huge virtual screen, including support for viewing 3D movies from Apple’s library with depth.

Personalized Ray Traced Spatial Audio

Vision Pro has “audio pods” on its side, each with two drivers. Apple describes it as the “most advanced Spatial Audio system ever.”

If you have an iPhone with a TrueDepth FaceID sensor you can scan your face to enable Personalized Spatial Audio, where the system will tailor the sound to your head and ear geometry for the most precise possible spatial audio.

0:00

/

Vision Pro also uses a technique called Audio Ray Tracing, where it scans the features and materials of your space to “precisely match sound to your room.” This technique is also used in Apple’s HomePod speakers.

Apple claims Vision Pro buyers will be “totally convinced that sounds are coming from the environment around you.”

Pricing & Availability

Apple Vision Pro will go on sale in the US in early 2024 starting at $3500. It’ll be available online and in Apple Stores.

Apple says more countries will get Vision Pro “later next year,” but didn’t disclose exactly which countries.

Write A Comment