Boneyard Tools

Object Detector

Find and label every object in a photo. Drop in an image and an AI detector (D-FINE) locates each object, then this tool draws a labeled bounding box around it, gives a count of what it found, and lets you download the annotated image. Drag the confidence slider to hide weak guesses; it re-filters instantly without re-running the model. Everything runs entirely in your browser, so nothing is uploaded. The model downloads once on first use, then is cached.

How to detect objects in a photo

  1. Drop an image in, or click to browse.
  2. Wait a few seconds while the AI scans the photo for objects.
  3. Adjust the confidence slider if you like, then download the annotated image.

Examples

Photo to labeled objects

A photo of two cats on a couch with two TV remotes
Boxes labeled cat, cat, remote, remote with confidence scores

Frequently asked questions

Is my image uploaded anywhere?

No. The AI detection model runs entirely in your browser with WebAssembly. The image is processed on your device and never uploaded. Only the model itself is downloaded, once, then cached for instant reuse.

What kinds of objects can it detect?

It detects 80 common object types from the COCO dataset: people, animals (cats, dogs, birds, horses), vehicles (cars, bikes, buses, planes), and everyday objects (chairs, laptops, bottles, cups, phones, TV remotes, and more).

Which model powers the detection?

D-FINE, a 2024 state-of-the-art real-time object detector. It predicts a labeled bounding box and a confidence score for each object it finds, and runs locally via transformers.js.

What does the confidence slider do?

Each detection has a score from 0 to 1 for how sure the model is. The slider hides any box below the threshold you pick, so you can trim weak or spurious guesses. It re-filters the existing results instantly without re-running the model.

Why does the first run take a moment?

The AI model downloads the first time you use the tool, then is cached for instant reuse. Larger images also take a little longer because more pixels are analyzed.

Related tools