What Is WCAG 1.2.1?
If your website has a podcast clip, a voicemail recording, or a silent video showing a process, this criterion is talking to you.
WCAG 1.2.1 says that if you have audio-only or video-only prerecorded content, you need to provide an alternative. Not a vague summary — an actual equivalent that gives users the same information.
Audio-only means things like a recorded podcast episode, a speech clip, or an audio interview — no video involved. Video-only means a silent demonstration video, an animation with no narration, or a recorded screen capture that shows a process but says nothing.
The rule exists because someone who is deaf cannot hear your podcast. Someone who is blind cannot follow your silent product demo. Without an alternative, they get nothing.
| Field | Details |
| WCAG Criterion | 1.2.1 |
| Conformance Level | Level A |
| Principle | Perceivable |
| WCAG Version | 2.1 |
| Official Reference | https://www.w3.org/WAI/WCAG21/Understanding/1-2-1 |
The Concept in Plain Terms
Think of it this way. If you recorded yourself walking someone through a recipe with no visuals — just your voice — a deaf user gets zero information from that clip. Now imagine you uploaded a silent time-lapse of you making that recipe — a blind user gets nothing from that either.
The solution for audio-only is a text transcript. For video-only, you can either write a transcript-style text description of what happens, or you can add an audio track that describes the visual content.
The alternative has to be genuinely equivalent. That means it covers the same information in the same order. A one-sentence summary of a ten-minute audio recording does not pass this criterion.
Who Does This Actually Affect?
The people who need this most are:
- Deaf and hard of hearing users who cannot access audio-only content
- Blind and low-vision users who cannot follow visual-only videos
- People in environments where they cannot play audio or video — think open-plan offices, public transport, waiting rooms
- People who process written information better than audio
- Search engines — transcripts make your content indexable
This is not a niche problem. Around 1.5 billion people globally have some degree of hearing loss. And plenty of fully sighted people choose to read rather than watch or listen.
Common Failures
No alternative at all
The most common failure: an audio clip or silent video sits on the page with no transcript, no description, nothing. It looks fine to someone who can use it. To everyone else, it’s a dead end.
A summary instead of an equivalent
Providing a one-paragraph summary of a 20-minute audio file does not meet this criterion. The alternative needs to deliver the same informational value as the original content.
Alternative exists but is hard to find
Burying the transcript link three pages away, or hiding it in a collapsed accordion with no clear label, fails in practice even if it technically exists. The alternative needs to be clearly associated with the content it covers.
Alternative is out of date
The audio gets updated. The transcript doesn’t. Now they no longer match. This is a common maintenance failure, especially on sites with regularly updated content.
How to Fix It
For audio-only content
Write a full text transcript. This means:
- Include everything that is said — speaker names if there are multiple speakers, all dialogue, key background audio that conveys meaning
- Format it clearly — timestamps are helpful for long recordings, headings help navigation
- Place it close to the audio player, or link to it with a clear, descriptive label (e.g., ‘Read full transcript’)
For video-only content
You have two options:
- Write a text description: Document what happens visually, in sequence. This works well for simple process videos.
- Add a narrated audio track: Record a description of the visual content and attach it as an audio alternative.
The text description is usually faster and more practical for most organisations. Keep it specific and sequential — don’t paraphrase or summarise.
| Real-world example |
| Quick example: A silent product demo video shows a user clicking through a checkout process. The text alternative should describe each screen in order: what the user sees, what they click, what happens next. Not: ‘This video shows our checkout flow.’ |
Edge Cases Worth Knowing
A few situations that trip people up:
- Decorative video-only content: If the video is purely decorative — background animation that conveys no information — it does not require an alternative under 1.2.1. But be honest with yourself about whether it truly conveys nothing.
- Content provided as an alternative to text: If your audio or video is itself provided as an alternative to existing text content, it does not need a further alternative. This avoids a circular requirement.
- Live vs prerecorded: This criterion only applies to prerecorded content. Live audio and video is covered under different criteria.
How to Test for 1.2.1
- Find every audio-only or video-only element on the page
- For audio-only: check whether a full text transcript exists and is clearly accessible
- For video-only: check whether a text description or audio alternative exists
- Read or listen to the alternative — does it provide the same information as the original?
- Check the alternative is close to the content and clearly labelled
There is no automated test that can confirm your transcript is accurate and equivalent — this requires a human review.
Why This Matters Beyond a Checkbox
Transcripts don’t just help users with disabilities. They get indexed by search engines, which means your audio and video content becomes searchable. They let people skim for the part they need. They work in environments where audio is off limits.
Getting this right is not a significant technical lift — it’s mostly a content task. And it’s one of those fixes that benefits your whole audience, not just the people you might think of as needing accessibility support.
Quick Fix Checklist
- All prerecorded audio-only content has a full text transcript
- All prerecorded video-only content has a text description or audio alternative
- Alternatives are clearly labelled and easy to find near the original content
- Alternatives are equivalent — same information, same order, not just a summary
- Alternatives are kept up to date when the original content changes
Frequently Asked Questions
Does a YouTube auto-generated caption count as an alternative?
No. Auto-generated captions are for synchronised audio description (covered under 1.2.2), not for the text transcript required by 1.2.1. They are also often inaccurate. A proper transcript needs to be reviewed for accuracy.
What if the video has both audio and visuals?
If your video has both a visual track and an audio track, it’s not video-only — it’s a full multimedia presentation. 1.2.1 does not apply, but other criteria do (1.2.2 for captions, 1.2.3 or 1.2.5 for audio descriptions).
How detailed does a video description need to be?
Detailed enough that someone listening to or reading it gets the same understanding as someone watching the video. If the video shows three steps in a process, the description should cover all three steps in order.
Do I need to host the transcript on the same page?
Not necessarily — a clearly labelled link to a separate transcript page is fine. But it needs to be obvious, close to the content, and accessible in its own right. Don’t hide it.