Remove duplicate H1 titles from MDX content files
Starlight automatically renders the frontmatter title as H1, so having a duplicate # Title in the body creates redundancy. Removed from 29 content files across all sections.
This commit is contained in:
parent
43d82d095e
commit
0bea793a09
@ -5,8 +5,6 @@ description: How Milton Waddams became the patron saint of legacy document proce
|
||||
|
||||
import { Aside } from '@astrojs/starlight/components';
|
||||
|
||||
# The Backstory
|
||||
|
||||
> *"I was told I could listen to the radio at a reasonable volume from nine to eleven..."*
|
||||
|
||||
## The Relocation
|
||||
|
||||
@ -5,8 +5,6 @@ description: The open source projects and cultural references that make mcwaddam
|
||||
|
||||
import { Aside, Card, CardGrid, LinkCard } from '@astrojs/starlight/components';
|
||||
|
||||
# Credits & Attributions
|
||||
|
||||
> *"It's not just about me and my dream of doing nothing."*
|
||||
|
||||
mcwaddams stands on the shoulders of giants — both technical and cinematic.
|
||||
|
||||
@ -5,8 +5,6 @@ description: Found a bug? Have an idea? We want to hear it.
|
||||
|
||||
import { Aside, Card, CardGrid, LinkCard } from '@astrojs/starlight/components';
|
||||
|
||||
# Feedback
|
||||
|
||||
> *"Excuse me, I believe you have my bug report..."*
|
||||
|
||||
We want to hear from you — bugs, feature requests, or just letting us know what works.
|
||||
|
||||
@ -5,8 +5,6 @@ description: Who's got the most pieces of documentation flair?
|
||||
|
||||
import { Aside, Card, CardGrid } from '@astrojs/starlight/components';
|
||||
|
||||
# Flair Leaderboard
|
||||
|
||||
<div style="display: flex; justify-content: center; margin: 2rem 0;">
|
||||
<img src="/chotchkies-logo.webp" alt="Chotchkie's Bar & Grill" style="max-width: 300px; border-radius: 8px;" />
|
||||
</div>
|
||||
|
||||
@ -5,8 +5,6 @@ description: How mcwaddams processes Office documents behind the scenes.
|
||||
|
||||
import { Aside } from '@astrojs/starlight/components';
|
||||
|
||||
# Architecture
|
||||
|
||||
> *"So I was sitting in my cubicle today, and I realized... every document format is worse than the one before it."*
|
||||
|
||||
mcwaddams is designed around a single principle: **never silently fail**. When extraction works, you get content. When it doesn't, you get a clear explanation why.
|
||||
|
||||
@ -5,8 +5,6 @@ description: How mcwaddams tries multiple methods to extract your documents.
|
||||
|
||||
import { Aside } from '@astrojs/starlight/components';
|
||||
|
||||
# Fallback Strategy
|
||||
|
||||
> *"We fixed the glitch."*
|
||||
|
||||
When the primary extraction method fails, mcwaddams automatically tries alternatives.
|
||||
|
||||
@ -5,8 +5,6 @@ description: The architectural decision behind mcwaddams's modular structure.
|
||||
|
||||
import { Aside } from '@astrojs/starlight/components';
|
||||
|
||||
# Why Mixins?
|
||||
|
||||
> *"What would you say... you do here?"*
|
||||
|
||||
mcwaddams uses Python mixins to organize 20 tools into logical groups without creating multiple MCP servers.
|
||||
|
||||
@ -5,8 +5,6 @@ description: How MCP resources enable on-demand document access.
|
||||
|
||||
import { Aside } from '@astrojs/starlight/components';
|
||||
|
||||
# Resource System
|
||||
|
||||
> *"I was told I could fetch chapters at a reasonable rate..."*
|
||||
|
||||
The MCP resource system lets you access parts of indexed documents without reprocessing.
|
||||
|
||||
@ -5,8 +5,6 @@ description: Get statistical analysis and data quality insights from spreadsheet
|
||||
|
||||
import { Aside } from '@astrojs/starlight/components';
|
||||
|
||||
# Analyze Excel Data
|
||||
|
||||
> *"I did nothing and it was everything I thought it could be."*
|
||||
|
||||
Get comprehensive insights from Excel spreadsheets including statistics, data types, and quality checks.
|
||||
|
||||
@ -5,8 +5,6 @@ description: Transform Word documents into clean Markdown.
|
||||
|
||||
import { Aside } from '@astrojs/starlight/components';
|
||||
|
||||
# Convert to Markdown
|
||||
|
||||
> *"Why should I change? The document format's the one who sucks."*
|
||||
|
||||
Convert Word documents to Markdown while preserving structure, headings, lists, and tables.
|
||||
|
||||
@ -5,8 +5,6 @@ description: Get structured table data from Word documents in multiple formats.
|
||||
|
||||
import { Aside, Steps, Tabs, TabItem } from '@astrojs/starlight/components';
|
||||
|
||||
# Extract Tables from Word Documents
|
||||
|
||||
> *"We need to talk about your table extraction..."*
|
||||
|
||||
Word documents often contain important data locked in tables. mcwaddams extracts them as structured data you can actually use.
|
||||
|
||||
@ -5,8 +5,6 @@ description: Use mcwaddams without installing anything locally.
|
||||
|
||||
import { Aside, Code, Tabs, TabItem, Card, CardGrid } from '@astrojs/starlight/components';
|
||||
|
||||
# Connect to Hosted Server
|
||||
|
||||
> *"I was told there would be no installation..."*
|
||||
|
||||
Don't want to install anything? Connect to our hosted mcwaddams server via HTTP.
|
||||
|
||||
@ -5,8 +5,6 @@ description: Work with documents that exceed token limits.
|
||||
|
||||
import { Aside } from '@astrojs/starlight/components';
|
||||
|
||||
# Handle Pagination
|
||||
|
||||
> *"Yeah, I'm gonna need you to go ahead and come in on Saturday for page 2..."*
|
||||
|
||||
Documents over 25,000 tokens are automatically paginated. Use cursors to fetch subsequent pages.
|
||||
|
||||
@ -5,8 +5,6 @@ description: Extract documents directly from HTTP/HTTPS URLs.
|
||||
|
||||
import { Aside } from '@astrojs/starlight/components';
|
||||
|
||||
# Process URLs
|
||||
|
||||
> *"We fixed the glitch... by caching the download."*
|
||||
|
||||
All tools accept HTTP/HTTPS URLs directly. Files are cached for 1 hour.
|
||||
|
||||
@ -5,8 +5,6 @@ description: Get mcwaddams up and running in under a minute.
|
||||
|
||||
import { Tabs, TabItem, Aside, Steps } from '@astrojs/starlight/components';
|
||||
|
||||
# Installation
|
||||
|
||||
> *"PC Load Letter? What the f*** does that mean?"*
|
||||
|
||||
Don't worry. This is simpler than fixing the printer.
|
||||
|
||||
@ -5,8 +5,6 @@ description: Extract your first document in 60 seconds.
|
||||
|
||||
import { Aside, Steps, Code } from '@astrojs/starlight/components';
|
||||
|
||||
# Quick Start
|
||||
|
||||
> *"I'll be honest with you, I love his music. I do. I'm a Michael Bolton fan."*
|
||||
|
||||
Let's get you extracting documents faster than you can say "TPS report cover sheet."
|
||||
|
||||
@ -5,8 +5,6 @@ description: Specialized tools for Excel spreadsheet processing.
|
||||
|
||||
import { Aside } from '@astrojs/starlight/components';
|
||||
|
||||
# Excel Tools
|
||||
|
||||
These 3 tools are specialized for `.xlsx`, `.xls`, and `.csv` files.
|
||||
|
||||
## analyze_excel_data
|
||||
|
||||
@ -5,8 +5,6 @@ description: Complete list of supported Office document formats.
|
||||
|
||||
import { Aside } from '@astrojs/starlight/components';
|
||||
|
||||
# Format Support
|
||||
|
||||
mcwaddams supports all major Microsoft Office formats, both modern and legacy.
|
||||
|
||||
## Modern Formats (2007+)
|
||||
|
||||
@ -5,8 +5,6 @@ description: Resource URIs for on-demand document access.
|
||||
|
||||
import { Aside } from '@astrojs/starlight/components';
|
||||
|
||||
# MCP Resources
|
||||
|
||||
After indexing a document, access content via MCP resource URIs.
|
||||
|
||||
## Resource URI Schemes
|
||||
|
||||
@ -5,8 +5,6 @@ description: Complete reference for all 20 mcwaddams MCP tools.
|
||||
|
||||
import { Aside, Badge, Tabs, TabItem, Card, CardGrid } from '@astrojs/starlight/components';
|
||||
|
||||
# All Tools Reference
|
||||
|
||||
> *"I'm going to need you to go ahead and read the documentation..."*
|
||||
|
||||
mcwaddams provides **20 tools** organized into three categories. Each tool follows the same pattern: pass a file path (local or URL), get structured data back.
|
||||
|
||||
@ -5,8 +5,6 @@ description: Tools that work across all Office document formats.
|
||||
|
||||
import { Aside } from '@astrojs/starlight/components';
|
||||
|
||||
# Universal Tools
|
||||
|
||||
These 7 tools work with all supported formats: `.docx`, `.doc`, `.xlsx`, `.xls`, `.pptx`, `.ppt`, `.csv`.
|
||||
|
||||
## extract_text
|
||||
|
||||
@ -5,8 +5,6 @@ description: Specialized tools for Word document processing.
|
||||
|
||||
import { Aside } from '@astrojs/starlight/components';
|
||||
|
||||
# Word Tools
|
||||
|
||||
These 10 tools are specialized for `.docx` and `.doc` files.
|
||||
|
||||
## convert_to_markdown
|
||||
|
||||
@ -5,8 +5,6 @@ description: Test coverage philosophy and metrics.
|
||||
|
||||
import { Aside } from '@astrojs/starlight/components';
|
||||
|
||||
# Coverage Report
|
||||
|
||||
> *"Looks like you've been missing a lot of work lately."*
|
||||
> *"I wouldn't say I've been missing it, Bob."*
|
||||
|
||||
|
||||
@ -5,8 +5,6 @@ description: "Did you get the memo about the TPS reports?"
|
||||
|
||||
import { Aside, Badge, Card, CardGrid } from '@astrojs/starlight/components';
|
||||
|
||||
# TPS Reports: Test Dashboard
|
||||
|
||||
> *"Yeah, I'm gonna need you to go ahead and come in on Saturday..."*
|
||||
|
||||
<Aside type="danger" title="TPS = Testing Painful Stuff">
|
||||
|
||||
@ -5,8 +5,6 @@ description: "301 random Office documents walked into a bar..."
|
||||
|
||||
import { Aside, Badge, Card, CardGrid } from '@astrojs/starlight/components';
|
||||
|
||||
# Torture Test Results
|
||||
|
||||
> *"I'm gonna need you to come in on Saturday... and Sunday too."*
|
||||
|
||||
We grabbed 301 random Office documents from a real filesystem — no cherry-picking, no sanitizing, just raw production files from someone's decade-old archive.
|
||||
|
||||
@ -5,8 +5,6 @@ description: Extract text from an Office document in 60 seconds.
|
||||
|
||||
import { Aside, Steps, Code, Tabs, TabItem } from '@astrojs/starlight/components';
|
||||
|
||||
# Your First Extraction
|
||||
|
||||
> *"I'll be honest with you, I love extracting documents. I do. I'm a mcwaddams fan."*
|
||||
|
||||
Let's get you extracting documents faster than you can say "TPS report cover sheet."
|
||||
|
||||
@ -5,8 +5,6 @@ description: Efficiently access huge documents without loading everything at onc
|
||||
|
||||
import { Aside } from '@astrojs/starlight/components';
|
||||
|
||||
# Indexing Large Documents
|
||||
|
||||
> *"It's not that I'm lazy, it's that I just don't care about loading 500 pages at once."*
|
||||
|
||||
For documents over 25,000 tokens, the indexing system enables on-demand fetching through MCP resources.
|
||||
|
||||
@ -5,8 +5,6 @@ description: Handle .doc, .xls, and .ppt files from the basement archives.
|
||||
|
||||
import { Aside } from '@astrojs/starlight/components';
|
||||
|
||||
# Working with Legacy Formats
|
||||
|
||||
> *"I was told I could keep my legacy documents at a reasonable location from nine to eleven..."*
|
||||
|
||||
Legacy formats (`.doc`, `.xls`, `.ppt`) require special handling. mcwaddams uses OLE Compound Document parsing to extract content from files dating back to 1997.
|
||||
|
||||
@ -5,8 +5,6 @@ description: Access document content through the MCP resource protocol.
|
||||
|
||||
import { Aside } from '@astrojs/starlight/components';
|
||||
|
||||
# Using MCP Resources
|
||||
|
||||
> *"The thing is, Bob, it's not that I'm lazy, it's that I just don't want to load everything."*
|
||||
|
||||
MCP resources let you fetch specific parts of indexed documents without reprocessing.
|
||||
|
||||
Loading…
x
Reference in New Issue
Block a user