AHTV | Augmented Human TV

DISPATCH — APRIL 2024 WHEN YOUR GLASSES CAN "SEE" — META AI WITH VISION ON RAY-BAN TL;DR Meta announced rollout of Meta AI with Vision to Ray-Ban Meta smart glasses on April 23, 2024. The update moves smart glasses from voice-only assistants to multimodal AI: you ask a question, the glasses capture what you're looking at, and you get an answer in your ear. That shift—from *searching* to *asking your surroundings*—is frictionless. And frictionless is exactly where the ethics break down. — 1) WHAT HAPPENED IN APRIL 2024 Meta rolled out three interconnected updates to Ray-Ban Meta smart glasses: **Meta AI with Vision** — The glasses now capture visual context, process it via cloud, and deliver answers. Examples: translate a menu, identify a plant, get Instagram captions, read signs. **POV Calling** — Share your view during WhatsApp and Messenger calls. The glasses broadcast what you're seeing to the person on the other end, turning video calling into *presence*. **New Styles** — Cat-eye frames (Skyler) and fit variants (low bridge) signal that adoption strategy is real. Wearables become infrastructure only when they disappear into daily life. This is not "AR" in the sci-fi overlay sense. It's something more subtle: a conversational layer attached to your point of view. And subtle is exactly why the ethics matter. — 2) HOW "LOOK AND…" WORKS The interface is deceptively simple: - You ask a question: "What plant is this?" - The glasses capture an image of what you're looking at. - A cloud process (Meta AI) analyzes the image and generates an answer. - The answer arrives in your ear. That one-sentence loop reveals the ethical core: **You aren't only asking a question.** You're capturing an environment. **The environment may include people** who didn't consent to being part of your prompt. **The capture implies a cloud step** (even if temporary), raising data retention risk, access controls, and policy reliance. **The answer can be confidently wrong**, which can matter in real settings (directions, identification, translations). Wearables turn the classic AI problem (hallucinations, bias, overconfidence) into a social problem: mistakes happen *around other people*. — 3) WHO THIS HELPS (REALISTICALLY) This is where the "Augmented Human" part is legit. **Everyday cognition, offloaded** - Travelers translating menus, signs, labels without pulling a phone. - Shoppers identifying products or comparing options in seconds. - Creators and students capturing context and generating drafts on the go. **Accessibility (quietly huge)** Anything that describes, translates, or reads text hands-free can be meaningful for people with limited vision, reading barriers, or situational constraints (carrying bags, navigating crowds, etc.). Even without a display, "audio answers" function like an assistive layer. **Social connection with less friction** Sharing your POV isn't just convenience. It's a new language for remote help: - "Is this ripe?" - "Which wire do I unplug?" - "What do you think of this?" Your camera becomes your sentence. — 4) WHO THIS CAN HARM (OR PRESSURE) Wearables don't distribute power evenly. **Bystanders become background data** A phone camera is obvious. Glasses are not. Even with a capture LED, most people aren't trained to notice it. The burden shifts to everyone *around* you to stay aware—and that's not a fair burden. **Service workers and public-facing people** Retail staff, security guards, teachers, restaurant workers. The moment "glasses cameras" normalize, you get new friction: - "Are you recording me?" - "Is this being streamed?" - "Is an AI analyzing what I said?" Even if you're not doing any of that, the *uncertainty* is stressful. **Kids and consent** POV capture around children raises stakes quickly because children can't meaningfully consent, and parents may have opposing preferences. — 5) THE ETHICS: THE 4 QUESTIONS THAT MATTER Here's the framework to judge this class of tech. **Question 1: Noticeability** Can others reasonably tell what's happening? Ray-Ban's FAQ notes the glasses have a front-facing capture LED meant to signal photo/video capture. That's a start—but noticeability is not binary. In bright daylight or crowded places, "technically visible" can still be "socially invisible." *Ethical test:* If you filmed in a room and nobody noticed, the design failed the bystander. **Question 2: Consent** Who is forced into your prompt? The moment AI uses your surroundings as input, "asking" becomes "capturing." It's not just recording; it's interpretation. That's a step up in intimacy because interpretation can produce labels: *age, emotion, identity cues, context guesses*. *Ethical test:* Would you feel okay being on the other side of the lens? **Question 3: Reliability** What happens when the AI is wrong? The Verge highlighted a truth: the system can be spot-on or confidently wrong. On a phone, you can cross-check quietly. With glasses, you may act on the answer immediately (direction, translation, identification). *Ethical test:* Does the UI encourage skepticism, or does it feel like authority? **Question 4: Drift** How quickly does this become surveillance? Even if Meta's intent is "useful assistance," the same capabilities can be repurposed—by users, by third-party services, by social norms. History is clear: if a tool can be used for stalking, harassment, or doxxing, someone will try. *Ethical test:* Are safeguards strong enough to resist the obvious abuse cases? — 6) MY TAKE: "FACE COMPUTERS" DON'T NEED DISPLAYS TO CHANGE SOCIETY April 2024 was important because it moved smart glasses from "camera + audio" into something that feels like a new interface category: *contextual AI*. And contextual AI changes behavior. Not because it's magical, but because it's *frictionless*. - You will ask more questions because you can. - You will capture more moments because it's easy. - You will record more people *incidentally*. - You may stop thinking of it as "recording," and start thinking of it as "querying reality." That last shift is the real ethical cliff. If we want this future without turning every public space into a low-grade surveillance zone, we need strong norms: - Ask before recording. - Turn it off in sensitive spaces. - Default to respecting bystanders. - Treat "Look and…" like a privilege, not a right. Because the moment glasses become normal, the only thing left protecting privacy is culture.

When Your Glasses Can 'See' — Meta AI With Vision on Ray-Ban (Apr 2024)

What Changed

Why It Matters

Sources

Subscribe to the Dispatch

Related Dispatches

Neuralink Got FDA Clearance. That's Not a Victory Lap.

Vision Pro's Real Innovation Is Making Your Eyes a Mouse

Ray-Ban Meta Made Recording Look Like a Face. That's the Ethical Problem.

Command Palette