Claude Cowork · Document Analysis

Document Analysis

Read any document in minutes. Extract exactly what matters. Cross-reference multiple sources. Get structured insights, not just summaries. Claude transforms how professionals interact with documents at every level.

📑 Contracts📊 Reports🔬 Research Papers📜 Policies🔎 Comparison

What Claude Can Do With Documents

Claude can process any text-based document you upload (PDF, DOCX, TXT, code files, CSVs) or paste directly. With a 200K token context window, it can process documents up to ~150,000 words in a single session — that's a full book, a lengthy legal contract, or an entire technical specification.

🔍

Comprehension & Summarization

Extract the core argument, main points, and key takeaways. Get executive summaries calibrated to any audience level.

🏷️

Classification & Categorization

Classify sections by type, priority, risk level, or topic. Tag and organize content at scale.

🔴

Risk & Gap Identification

Flag missing clauses, unusual terms, contradictions between sections, and areas requiring expert review.

🔄

Cross-Document Comparison

Upload multiple documents and ask Claude to compare, reconcile, or identify conflicts between them.

📋

Structured Extraction

Pull out specific data points in structured format — dates, parties, obligations, financial terms — ready for spreadsheet or database input.

Q&A Against Documents

Ask specific questions and get answers grounded in the document. "Does this contract give us the right to subcontract?" → direct answer with citation.

Document Types and Approaches

Document TypeBest Claude TaskKey Prompt Pattern
📑 Legal ContractsRisk flagging, clause comparison, obligation extraction"Flag deviations from standard [contract type] terms"
📊 Business ReportsExecutive summary, key metrics extraction, trend identification"Extract the 5 most actionable insights for a [role]"
🔬 Research PapersMethodology summary, findings extraction, limitation identification"Summarize using IMRAD structure in plain English"
📜 Policy DocumentsRequirement extraction, compliance gap analysis, plain-language summary"What do I need to do to comply with Section 4?"
📧 Email ThreadsSentiment analysis, decision extraction, summary for delegation"Summarize this thread and identify all open commitments"
📰 News/ArticlesBias identification, fact extraction, perspective synthesis"Summarize the claims and distinguish facts from opinion"
💻 Technical DocsImplementation summary, prerequisite extraction, risk flags"What do I need to do before Step 3 in this guide?"

Core Analysis Techniques

Technique 1: The Layered Summary

Get multiple depth levels of summary in one prompt — perfect for understanding a document at the right depth for different audiences:

Layered Summary Prompt
[DOCUMENT PASTED OR ATTACHED]

Summarize this document at three levels:
1. TWEET SUMMARY (under 280 characters — what's the one thing?)
2. EXECUTIVE SUMMARY (3-4 sentences — for a busy CEO who needs context to make a decision)
3. ANALYSIS SUMMARY (500 words — for the team that will implement or act on this)

End with: The 3 most important questions this document raises but doesn't answer.

Technique 2: Structured Q&A Against a Document

Document Q&A — Grounded Analysis
[DOCUMENT ATTACHED / PASTED]

Answer only based on the document above. For each answer:
- Quote the relevant section (with section number if available)
- Provide your plain-English interpretation
- Note if the document is ambiguous on this point

Questions:
1. What are our specific obligations under this agreement?
2. What happens if either party fails to perform?
3. What rights to our data does the other party receive?
4. How can this agreement be terminated, and at what cost?
5. Are there any auto-renewal clauses?

If the document doesn't address a question, say so explicitly — DO NOT answer from general knowledge.

Technique 3: Red Team the Document

Adversarial Document Review
[DOCUMENT ATTACHED / PASTED]

You are a skeptical [ROLE — e.g., "investor", "opposing counsel", "board member"] reviewing this document.

1. What are the document's 3 strongest arguments or most favorable terms?
2. What are its 3 biggest weaknesses, gaps, or risks?
3. What questions would you immediately ask after reading this?
4. What is NOT in this document that should be?
5. If you were negotiating against this, what would you target first?

Be direct and specific — cite the exact sections that concern you.

Multi-Document Comparison

Claude's large context window makes it uniquely capable at cross-document analysis — uploading multiple files and asking Claude to synthesize across them:

Multi-Document Comparison Prompt
I'm uploading [N] documents. Analyze them as a set. DOCUMENT 1: [label — e.g., "Current vendor contract"] [paste content] DOCUMENT 2: [label — e.g., "Proposed new contract"] [paste content] DOCUMENT 3: [label — e.g., "Industry standard template"] [paste content] Compare across: 1. KEY DIFFERENCES: What changed between Document 1 and 2? (table format) 2. GAPS VS. STANDARD: What protections in Document 3 are missing from Document 2? 3. MOST FAVORABLE: Which document is most favorable to us in [each key area]? 4. RECOMMENDATION: Should we sign Document 2 as-is, negotiate it, or reject it? Highlight any differences in: payment terms, liability limits, IP ownership, termination rights, data usage.

Structured Data Extraction

For high-volume document processing (multiple contracts, many research papers, batch email review), use structured extraction prompts that produce machine-readable outputs:

Contract Data Extraction — JSON Output
[CONTRACT PASTED]

Extract the following fields from this contract. Return ONLY a JSON object — no explanation:

{
  "effective_date": "",
  "parties": {"party_a": "", "party_b": ""},
  "contract_value": "",
  "payment_terms": "",
  "term_length": "",
  "auto_renewal": true/false,
  "renewal_notice_days": null,
  "termination_for_convenience": true/false,
  "termination_notice_days": null,
  "liability_cap": "",
  "ip_ownership": "",
  "data_usage_rights": "",
  "governing_law": "",
  "dispute_resolution": "",
  "high_risk_flags": ["list any clauses that warrant legal review"]
}
💡 Batch Processing Tip

For processing many documents (e.g., reviewing a vendor portfolio of 30 contracts), use this JSON extraction approach for each document. The consistent structure means you can paste all outputs into a spreadsheet and immediately see which contracts have auto-renewal, which have unlimited liability, etc. — without reading every page word-by-word.

Corporate Document Workflows

🏢

Due Diligence

M&A teams use Claude to triage hundreds of documents in a data room — extracting key terms, flagging anomalies, and building a structured issues log before expensive lawyers review the flagged items.

📋

Policy Development

HR and compliance teams use Claude to compare existing policies against regulatory requirements, identify gaps, and draft updated language that is legally sound and employee-readable.

📰

Research Synthesis

Research teams upload batches of papers and have Claude create a structured literature map — what's been proven, what's disputed, what gaps exist, and which papers are most methodologically sound.

📑

RFP/RFI Processing

Procurement teams use Claude to process vendor RFP responses — extracting capability claims, comparing them against requirement criteria, and generating a structured evaluation scorecard.

Expert Tips for Document Analysis

💡 Pro Tip 1 — Tell Claude What You're Looking For

"Summarize this contract" produces a generic summary. "Summarize this contract for a CFO who needs to understand the financial obligations, caps on liability, and payment terms" produces a financially-focused analysis. Always specify your role and what you'll use the output for.

💡 Pro Tip 2 — Ground Claude to the Document

Add "Based ONLY on the document provided. Do not draw on outside knowledge. If the document doesn't address a question, say so explicitly." This prevents Claude from filling gaps with plausible-sounding but unverified information — critical for legal and compliance work.

💡 Pro Tip 3 — Use Section References

Ask Claude to cite the exact section for every claim: "For each point you make, reference the specific section, paragraph, or page number." This makes it easy to verify Claude's analysis against the original document.

⚠️ Know the Limits

Claude reads text — it cannot process images embedded in PDFs (charts, scanned pages, signatures). It also cannot access linked URLs within documents. For scanned documents, use an OCR tool first to convert to text. For anything with legal or financial consequence, use Claude's analysis as a first pass and have professionals verify critical findings.