Apple’s Persona technology uses Gaussian splatting to create 3D facial scans

(cnet.com)

75 points | by dmarcos 6 days ago ago

22 comments

AceJohnny2 21 minutes ago

I gotta say, these new Personas are good.

The previous beta ones were terrifying frankenstein monsters. The new ones fooled by boss for 30 minutes.

There's a bit of uncanny valley left, nevertheless. My persona's smile reminds of the horrible expressions people like to make in Source Filmmaker.

Stalker_Aloy 2 hours ago

CorridorDigital recently used the tech to assist in remaking the rooftop bullet-time scene from The Matrix. It's used for making the environment instead of modeling it from scratch.

https://www.youtube.com/watch?v=iq5JaG53dho&t=2s

SomaticPirate 3 hours ago

This video might help explain 3D Gaussian splatting. https://www.youtube.com/watch?v=wKgMxrWcW1s Essentially, an entirely new graphics pipeline with different fundamental techniques which allow for high performance and fidelity compared to... what we did before(?) Cool.

[-]

dymk an hour ago

That video didn’t explain what Gaussian splatting is at all, but I did get a minute ad read for some cloud GPU service.

reactordev 3 hours ago

Not quite, it’s just a way to assign a color value to a point in space (think point clouds) based on photogrammetry. It’s voxels on steroids but still is drawn using the same techniques. It’s the magic of creating the splats that’s interesting.

[-]

ChadNauseam 3 hours ago

A color value for each point is a good starting place to gain an intuition. Some readers might be interested to know that the color is not constant for each point, but instead dependent on viewing angle. That is part of what allows splats to look realistic. Real objects have some degree of specularity which makes them take on slightly different shades as you move your head.

[-]

adfm 2 hours ago

And since we normally see with binocular vision, a stereoscopic view adds another layer of realism you wouldn't normally perceive otherwise. Each eye sees subsurface scattering differently and integrates in your head.

smartties 3 hours ago

The same graphics pipeline is used: rasterization.

[-]

ChadNauseam 3 hours ago

Rasterization is a very general term. There is a big difference in practice between the traditional rasterization pipeline and splat rasterizers

[-]

Groxx 2 hours ago

it's kinda like saying "we still show pixels". true but almost totally useless for understanding anything.

colordrops 2 hours ago

Sorry but this is a horrible video. The guy just spews superlatives in an annoying voice until 4:30 (of a 6 minute video mind you), when he finally gives a 10 second "explanation" of Gaussian splatting, which doesn't really explain anything, then jumps to a sponsored ad.

[-]

Groxx 2 hours ago

yeah... their older videos are a bit more useful from what I remember (more time spent on the research paper content, etc), but they've become so content-free that I just block the channel outright nowadays. it's the "this changes everything (every time, every day)" hype-channel for graphics.

october8140 an hour ago

Tested talked similar about Personas. https://youtu.be/LzZ2j9CAcww?si=IRvxNaNZeBQp7WLV

dangus 2 hours ago

It’s amazing tech, it’s just a solution looking for a problem.

It feels a bit like the original Segway’s over-engineered solution versus cheap Chinese hoverboards, then the scooters and e-bikes that took over afterwards.

Why would I be paying all this money for this realistic telepresence when my shitbox HP laptop from Walmart has a perfectly serviceable webcam?

[-]

raincole an hour ago

Why do we have video call meetings when people mostly just listen and the information is carried via audio?

Why do we have 4K monitors when 1920x1080 is perfectly fine for 99.999% of use cases?

If you look at the world through this lens called "perfectly serviceability" you'll think everything is a solution looking for a problem.

[-]

criddell 37 minutes ago

> when 1920x1080 is perfectly fine for 99.999% of use cases

A lot of people here work with text all day every day and we would rather work with text that looks like it came out of a laser printer than out of a fax machine.

[-]

zamadatix 32 minutes ago

Of all places, HN should not be the one to casually conflate resolution and DPI!

spijdar 19 minutes ago

I actually think about this a lot, and I could argue both sides of this. On the one hand, you could look at your list of examples as obvious examples of modern innovation/improvement that enrich our lives. On the other, you could take it as a fascetious list that proves the point of GP, as one other commenter apparently already has.

I often think how stupid video call meetings are. Teams video calls are one of the few things that make every computer I own, including my M1 MPB, run the fans at full tilt. I've had my phone give me overheat warnings from showing the tile board of bored faces staring blankly at me. And yeah, honestly, it feels like a solution looking for a problem. I understand that it's not, and that some people are obsessed for various reasons (some more legitimate than others) with recreating the conference room vibe, but still.

And with monitors? This is a far more "spicy" take, but I think 1280x1024 is actually fine. Even 1024x768. Now, I have a 4K monitor at home, so don't get me wrong: I like my high DPI monitor.

But I think past 1024x768, the actual productivity gains from higher resolutions begins to rapidly dwindle. 1920x1080, especially in "small" displays (under 20 inches) can look pretty visually stunning. 4K is definitely nicer, but do we really need it?

I'm not trying to get existential with this, because what do we really "need"? But I think that, objectively, computing is divided into two very broad eras. The first era, ending around the mid 2000s, was marked by year-after-year innovation where 2-4 years brought new features that solved _real problems_, as in, features that gave users new qualitative capabilities. Think 24-bit color vs 8-bit color, or 64-bit vs 32-bit (or even 32-bit vs 16-bit). Having a webcam. Having 5+ hours of battery life on a laptop, with a real backlit AMLCD display. Having more than a few gigabytes of internal storage. Having a generic peripheral bus (USB/firewire). Having PCM audio. Having 3D hardware acceleration...

I'm not prepared to vigorously defend this thesis ;-) but it seems at about 2005-ish, the PC space had reached most of these "core qualitative features". After that, everything became better and faster, quantitatively superior versions of the same thing.

And sometimes yeah, it can feel both like it's all gone to waste on ludicrously inefficient software (Teams...), and sometimes, like modern computing did become a solution in search of a problem, in order to keep selling new hardware and software.

AceJohnny2 17 minutes ago

I used my VP extensively recently when working remotely. It's not glamorous, but I used Screen Sharing with a Macbook that grants you a virtual ultrawide monitor.

Once you're already in VR, it's nice to not have to break out for a meeting, and that's where Personas fit in.

It's not a killer app carrying the product, it's a necessary feature making sure there's not a gap in workflow.

october8140 an hour ago

I would not describe creating an experience that feels like you are in the room with a group of people, even allowing cross talk, is a solution looking for a problem. I think it's the thing everyone slowing dying on Zoom calls wishes they could have.

[-]

bigyabai an hour ago

I disagree. Many of us don't use a headset regularly or carry it with us like a phone or laptop; it is an express inconvenience to use, with only marginal benefits. Businesses won't want one if webcams still do the trick, and users might respond positively but are always priced-out of owning one.

If I'm doing work at my desk and I get a Zoom call, there is a 0.00% chance I will go plug in my Vision Pro to answer it. I'm just going to open the app and turn on my webcam, spatial audio be damned.

tantalor 3 hours ago

"Now out of beta"??

Just in time for Vision Pro to go big. Right?