Professional Documents
Culture Documents
Context: in a crowd
We can add more detail to this description to paint a more vivid picture. In most cases,
I recommend adding the detail with the object-action-context format. This makes the
A concise way to add detail: A painted Black Lives Matter cardboard sign being
A redundant way to add detail: Black Lives Matter sign being held in a crowd.
Behind them, there is a building made of stone. The crowd is blurred out. The sign is
You can see that the redundant example repeats a lot of words and skips from one
thing to another, making it longer and more disorienting. The object-action-context
format helps fixate on the primary focus, and then slowly expands to get the full
picture.
However, sometimes there is so much detail that adding it will cause one big run-on
sentence. In this case, it’s better to keep the first sentence short and in an object-
action-context format. This way, people can get a basic understanding and skip
in a crowd of people. In the middle of the sign, there’s an illustration of the earth with
2 raised fists on either side that have a rainbow pride square background, a trans pride
Object: person
Context: crowd
To be even more descriptive, you can go into the meaning of the visual imagery. This is
where image descriptions are useful to sighted people too because they provide an
specifically Black power as popularized by the Black Panther Party in the 1960s. The
rainbow pride flag has rainbow stripes and symbolizes general LGBTQ pride. The
trans pride flag has pink, blue, and white stripes and celebrates pride for the trans and
This is obviously a longer commitment and might not be necessary if you have written
other content in a surrounding article, or if you’d rather reserve your energy from
Digital images don’t just exist in a vacuum, they have surrounding context on
whatever app or website they are found on. We often come across images when we
read news articles, scroll through our friends’ social media, go online shopping, etc.
Description: dog sitting in a field of autumn leaves
For an image on a news article, that description works reasonably well. It’s objective,
concise, and descriptive while also quite brief. However, if the photo is on a dog
breeder website that has images of several different breeds of dogs, it doesn’t actually
communicate much useful information (especially if every description just says “dog”).
For that reason, I’d include the breed and more visual descriptions of the dog.
Dog breeder website description: light brown pitbull dog breed with a black nose
If I am an artist who took the photo of the dog and want to showcase my skills and
evoke some emotion, I’d probably want to add more detailed descriptions. This is of
course subjective based on the type of artist I am and how I want the audience to
receive my work.
Artist’s website description: a photo of a light brown dog with short ears and a
pink tongue sticking out, sitting in a field of autumn leaves. The dog is in focus while
the sunlight reflects off the light orange and yellow leaves, fading into the background.
If the image is on a site or app with several hundred images, perhaps for an animal
science textbook or a flashcards app, then it’s better to optimize for quick and easy
scanning of content. Perhaps “dog in field” or even just “dog” will suffice in that
context.
This section is new and very much in progress. If you have additional thoughts about
how context impacts image descriptions, feel free to reach out! (Contact info at the
bottom.)
Describing race and gender
In general, I recommend describing race and gender if (1) it’s relevant to the image,
(2) if you know what their identity is, and (3) if the way you’re describing it is
It’s relevant if it’s an essential part of the message — such as a joke specifically about
identity or celebrating the radical visibility of marginalized people. It’s known if the
people in the photo have confirmed their identity. It’s consistent if you’re describing
everyone’s race and not just people of color (rather assuming white as the default).
For example, the stock photo project Disabled and Here has created a library
specifically of disabled BIPOC (Black and Indigenous People of Color). They have also
“Don’t bring” all with graphic illustrations. In the light purple “What to wear” section,
it says, “nondescript, solid color, layered clothing; cover identifying tattoos, goggles &
mask, emergency contacts written down, heat resistant gloves, and tie your hair up.”
In the medium purple “What to bring” section, it says, “water for drinking & tear gas,
snacks, cash/change & ID, washcloth, bandages & first aid supplies, ear plugs, and
protest signs. In the light red “Don’t bring” section, it says, “cell phone without first
turning off Face/Touch ID, going on airplane mode, and disabling data, jewelry,
I choose not to go into detail describing visuals if the text essentially already describes
them and if the description is already pretty lengthy. In this case, the goal of the image
distracting.
A few last tips
Because images are just a collection of pixels and vectors, text is
not available to screen readers (sometimes screen readers can
pick up text, but this doesn’t appear to be consistent). Make sure
to transcribe all the text in the image if you’re providing a
description.
Most social media platforms these days offer an alt text option.
From what I have heard from the community, it’s actually better
to put the description in the caption instead. This way, it’s
also visible to screen magnifier users and sighted people who
might need or benefit from the description. (It’s actually unclear
to me whether we should also add a full description or
abbreviated version in the alt tag if we are adding to the caption.
If you have thoughts or ideas about this, please reach out!
Contact info at the bottom).