feat(addons/objects3d): reusable 3D object detection (2D detect + depth into oriented boxes) by salmanmkc · Pull Request #417 · google/xrblocks

salmanmkc · 2026-06-26T13:52:46Z

adds an objects3d addon that turns 2D detection plus the depth mesh into oriented 3D bounding boxes you can query in world space. it's the pipeline that lived inline in the objects_3d demo, pulled out into a reusable Script so apps and other addons can just ask for detected 3D objects.

Object3DDetector runs the whole thing: snap the camera and depth mesh, run 2D detection, get a per-object segmentation mask, raycast the masked depth samples into world space, fit a yaw-aligned oriented box, fuse across views, and optionally draw debug wireframe boxes. each result is a Detected3DObject with a nearest-surface query, so you can ask for the closest point on an object, which is handy for pointing at it or placing something against it.

the backends are pluggable:

2D detection: gemini (open-vocabulary, needs a key) or mediapipe (on-device COCO, no key, fixed class set).
masks: SAM / slimsam (via @huggingface/transformers) or the mediapipe segmenter.

it also exports the pure helpers that are useful on their own: uvToNdc (the snapshot-vs-camera aspect correction), box2dIoU / unionDetections / snapBoxToFloor (2D fusion and floor snapping), and the label categorize helpers (flat / surface / tiny-flat / light buckets).

the heavy deps (@huggingface/transformers, mediapipe) stay external; the one rollup change just marks transformers external so it isn't bundled.

this is split out of the agenthands branch (#416), which originally grounded its pointing through this pipeline before I switched it to a lighter depth raycast. the addon ships standalone here with colocated vitest specs (label categorization, depth sampling, fusion, and the nearest-surface query); the objects_3d demo can move onto it as a follow-up. lint, tests and build are clean.

salmanmkc added 9 commits June 26, 2026 21:46

build: mark @huggingface/transformers external for objects3d addon

5246fb0

objects3d: add object label categorization

29fadad

objects3d: add depth sampling with uvToNdc aspect correction

77e0eb5

objects3d: add per-category oriented bounding box fitting

9f80ce9

objects3d: add multi-view box fusion

c715d19

objects3d: add slimsam and mediapipe mask backends

3265ff3

objects3d: add debug box-group visuals

3341204

objects3d: add Detected3DObject with nearest-surface query

8a64279

objects3d: add Object3DDetector orchestrator and barrel

d85c966

ruofeidu requested a review from nsalminen June 26, 2026 22:16

ruofeidu and others added 2 commits June 26, 2026 18:16

Merge branch 'main' into feat/objects3d

3b13c9c

Merge branch 'main' into feat/objects3d

82f7f5c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(addons/objects3d): reusable 3D object detection (2D detect + depth into oriented boxes)#417

feat(addons/objects3d): reusable 3D object detection (2D detect + depth into oriented boxes)#417
salmanmkc wants to merge 11 commits into
google:mainfrom
salmanmkc:feat/objects3d

salmanmkc commented Jun 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

salmanmkc commented Jun 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants