Can I edit multiple objects in the same photo?

Yes. Each detected object is independent. You can remove one, change the color of another, and leave the rest untouched. In BOARD, the edit history stacks per object, so you can undo individual changes without losing earlier edits to other parts of the image.

BOARD Guide · Concepts · May 2026

What is object-oriented AI photo editing?

Most photo editors treat your image as a grid of pixels. You select regions, paint masks, stack layers, and hope the result blends. Object-oriented editors treat the image as a collection of things (a dog, a person, a wall, a sign) and let you address each one by name. This is what that looks like in practice, with the honest limits.

Photo on an easel showing three labelled objects — mug, plant, book — with the mug selected by an amber bounding box, illustrating object-oriented photo editing

The short definition. Object-oriented photo editing is a workflow where the unit of edit is a named object: a person, a chair, a wall, a sign. Not a pixel region. Not a layer. Not a prompt. The editor detects every object on upload, makes each one tappable, and exposes three verbs (Remove, Edit, Replace). You point; the AI handles the mask, the blend, and the fill.

Why a new category was needed

Photo editing grew up around two ideas: pixels and layers. Both come from the 1990s, when processing power was the limit. You drew a selection, you painted a mask, you stacked layers. Each operation told the computer which pixels to touch and which to leave alone. The skill of editing was, for the most part, the skill of making good selections.

That skill is hard to learn. If you have ever tried to mask a person's hair out of a busy background, you know the problem. The tools are precise. They demand the same from you. Most beginners quit before they get a clean cutout. The one-tap mobile editors (Google Magic Eraser, Samsung Object Eraser, Apple Clean Up) hid the selection step behind a single tap, and that is why they took off.

Those tools only do one thing, though. Erase. And each one only runs on its own platform. No general-purpose editor was built on the broader idea: that the image is a set of objects, and you address them by pointing.

The core shift: from pixels to objects

Take a single edit: change the color of that jacket. Here is how the same job runs in each paradigm.

Pixel-first vs object-first: changing a jacket color
	Pixel-first (Photoshop, GIMP)	Object-first (BOARD, native Magic Eraser)
Step 1	Open the image, duplicate the layer	Open the image; objects already detected
Step 2	Pick a selection tool (lasso, quick select, magic wand)	Tap the jacket
Step 3	Carefully outline the jacket; refine the edge	Type or pick the new color
Step 4	Apply a hue/saturation adjustment to the selection	Done
Step 5	Check the edges; touch up; merge layers; export	—
Time to result	2–10 minutes (skill-dependent)	5–10 seconds
Skill required	Selection tools, masks, blend modes	Knowing what you want changed

The object-first version is faster for one reason: the selection work you used to do has already happened by the time you arrive. Detection runs on upload.

The three verbs of an object-oriented editor

Once every object is detected and tappable, the editor needs a small vocabulary of things it can do to a selected object. Most object-oriented editors converge on three verbs.

Remove

The selected object is masked, and the area it occupied is filled with what the model thinks belongs in the background. Used for tourists, power lines, photobombers, distracting clutter.

Edit

The selected object stays where it is but changes: color, material, expression, size, style. You supply a short instruction in plain English. Nothing else in the photo moves.

Replace

You swap the selected object for a different one, supplied as a description or a reference image. Geometry, lighting, and shadow follow the original scene.

Remove and Edit ship in most tools today. Replace is the newest of the three and the hardest for the model. Matching a swapped object's lighting and shadows to the scene is still an open research problem. In BOARD, Replace is an internal-only feature while the team measures quality at scale.

What makes the detection work

Object-oriented editing was not possible in 2010 for one reason. Nothing could reliably detect "the dog" or "the person in the red jacket" in a photo. Generic segmentation models existed, but they returned messy outlines. Anything past the obvious foreground fell apart.

Three things changed:

Vision-language models (Claude's vision API, Gemini, GPT-4V) can describe what's in an image with near-human accuracy.
Segmentation models like SAM3 can produce pixel-accurate masks from a single point click or text prompt.
Generative inpainting models (the same lineage that powers Photoshop's Generative Fill) can fill the gap left when something is removed.

An object-oriented editor is those three stacked together. Detection finds the objects. Segmentation produces a clean mask the moment you tap. Inpainting fills whatever you removed. None of them is the editor on its own. The editor is the interaction model that hides all three behind a tap.

How BOARD implements the model

BOARD runs in a browser at app.brd.ing. The interaction loop has four steps.

Upload a photo

Detection runs on the server in 1–2 seconds and returns a list of objects at three zoom levels: subjects (the dog), parts (the dog's ear), and details (a single tooth).

Tap an object

A pixel-accurate mask is generated on demand. You see the selected region highlight so you can confirm you have the right thing.

Pick a verb

Remove, Edit, or Replace. Edit takes a short text instruction ("make it red", "give it sunglasses"). Replace takes a reference image.

Confirm

The result renders inline in 3–8 seconds. Detected objects are persistent for the session, so you can edit one, undo, edit a different one, and the original detection still applies. Every change is reversible.

Where the approach struggles

The approach is not a fit for everything. Four jobs still belong with the older tools.

Atmospheric effects

Fog, haze, color grading, exposure changes. None of these are objects. They're properties of the whole scene. A pixel-first tool with global adjustments is still the right call.

Single-pixel touch-ups

Removing one stray hair, fixing a single hot pixel, dust-spotting a film scan. The mask granularity isn't built for this. A precision brush in Photoshop still wins.

Generating from nothing

If you want a photo that doesn't exist yet (a dragon in a forest), this isn't the tool. Object-oriented editors modify what's already in the frame.

Geometry-heavy compositing

Cutting a person out of one photo and placing them into another, matching perspective and lens distortion. Still a layer-based workflow.

For most everyday cleanup (removing a stranger, changing a wall color, taking out a sign), the object-first workflow is faster by an order of magnitude. For specialist work, the older tools hold up.

The short version

Object-oriented photo editing reorganizes the editor around what instead of where. You tell the editor which thing you mean by tapping it. You tell it what to do with a short verb. Everything else (the mask, the blend, the fill) is the editor's job. It is the closest a photo editor has come to the way people actually think about their photos: scenes full of things, not grids of pixels.

If you want to try it on one of your own photos, BOARD runs in any browser at brd.ing with 5 free edits and no signup required. Or read how AI photo cleanup works, the free no-signup workflow, or how it compares to Apple Clean Up and Google Magic Eraser.

One thing to try first. Pick a photo that is nearly perfect except for one distracting object: a sign, a wire, a person at the edge of frame. That is the cleanest test of whether the object-first workflow fits how you actually edit. Save the complex compositing jobs for a layer-based tool.

Frequently asked

What does object-oriented mean in photo editing?

The editor treats your image as a collection of named objects (a dog, a car, a person, a wall) instead of a grid of pixels. You tap the object you want to change, then pick a command: Remove, Edit, or Replace. The AI handles masking, blending, and background fill in one step. There are no brushes, layers, or selection tools.

How is this different from Photoshop's Generative Fill?

Generative Fill is a prompt-driven tool: you draw a selection with the lasso or marquee, type a description of what you want, and Adobe's model generates pixels into that selection. Object-oriented editing skips the manual selection step. The editor detects every object in the image automatically, and you tap the one you want to modify. The prompt (if any) describes the object's new state, not the pixels.

Do I need to know how to mask or make selections?

No. Masking and selecting are what object-oriented editors do for you. The AI scans the photo on upload, detects every distinct object, and makes each one tappable. You point at the thing you want to change; the editor handles the mask. This is the core difference from layer-based tools like Photoshop, Affinity Photo, or GIMP.

What can object-oriented editing not do well?

Three things, honestly. First, anything that isn't a discrete object (fog, gradient color shifts, overall exposure). Second, fine touch-ups on a single pixel scale (a layer-based tool with a precision brush is still better). Third, generating entirely new content from scratch — object-oriented editors modify what's already there.

Is this the same as AI photo editing?

AI photo editing is the umbrella term. Object-oriented is one approach inside it. Other AI approaches include prompt-only generators (DALL-E, Midjourney), filter-based AI (Lightroom's Lens Blur, Photoshop's Neural Filters), and one-tap full-image transformations. Object-oriented editing specifically refers to the workflow where the unit of work is a named object, not a region, a prompt, or the whole image.

Try the object-first workflow on one of your photos.

Tap a thing, pick a verb. No masks, no layers, no signup. 5 edits free.

Clean Up a Photo Free →