OfficePad

Guides/How to Redact a PDF Properly (and Why a Black Box Isn't Enough)

How to Redact a PDF Properly (and Why a Black Box Isn't Enough)

"Redacting" a document sounds like a simple visual operation — cover up the sensitive part, send the file. But a PDF is not a sheet of paper, and a shape drawn on top of text does not necessarily remove that text from the file underneath. Journalists, lawyers and government agencies have all, at various points, published "redacted" documents where the blacked-out text could be recovered in seconds by anyone who selected it, copied it, or opened the file in a different viewer. Understanding what redaction actually needs to do — and checking that it did it — takes only a little extra effort, and it is the difference between a document that is safe to share and one that only looks that way.

Why a black box on its own is not redaction

When you draw a filled rectangle over a paragraph in many editors, what you have added is a new shape on top of the page — a separate object that happens to visually cover the text below it. The original text is still there, in the page's content stream, exactly as before: still selectable, still searchable, and still extractable by copying the "covered" area, opening the file in a text editor, or running it through any tool that reads a PDF's underlying content rather than its rendered appearance.

This is exactly how a number of high-profile redaction failures have happened: a document looks correctly redacted on screen and in print, but the "removed" names, numbers or paragraphs are still sitting in the file, one Ctrl+A / Ctrl+C away from being read by the next person who opens it.

What real redaction does instead

Proper redaction removes the underlying content — the actual text characters or image data — from the page, not just hides it visually. After a real redaction, selecting the area where the text used to be selects nothing, searching the document for the redacted word returns no results, and there is no content left to extract by any method, because there is genuinely nothing left there.

In OfficePad's editor, the redaction tool works this way: marking an area for redaction removes the content underneath it from the page, leaving a solid block in its place rather than a shape drawn on top of intact content. The result is a document where the redacted information is gone, not merely covered.

The places redaction often gets missed

The visible page is not the only place sensitive information can hide in a PDF. Document metadata — author name, company, software used, sometimes even file paths from the original computer — is stored separately from the page content and is not touched by redacting page text. Comments, form field values, and earlier versions embedded for "undo" history can also retain information that has been removed from the visible pages.

If a document went through several rounds of editing before reaching its final form, it is also worth checking whether any of that editing history persists. The safest approach for a genuinely sensitive document is to treat redaction as a multi-step process: redact the visible content, then check the document's properties for anything identifying that should not be there, and consider exporting a clean final copy rather than continuing to edit the same file indefinitely.

A quick way to check your work

Before sending a redacted PDF anywhere, do the test that someone trying to recover the information would do first: try to select the text under the redacted areas, and try searching the document for the words you redacted. If selection and search both come up empty, the underlying content is genuinely gone. If you can still select or search it — even if it looks fine visually — the information is still in the file and the redaction needs to be redone properly rather than papered over a second time.

For documents where the stakes are high — legal filings, responses to information requests, anything containing personal or financial data — this thirty-second check is worth doing every time, regardless of which tool produced the redaction.

Tools mentioned in this guide