๐บ๏ธ PhantomBlackOP Sitemap Generator



๐ง Overview
This repo powers a dynamic โsitemap.xmlโ generator that:
- ๐ Crawls a rendered sitemap page built with Google Sites
- ๐
Extracts precise last modified times from Google Sites metadata (
data-last-updated-at-time)
- ๐ฆ Decodes tweet timestamps using Twitter/X Snowflake ID logic
- โจ Gracefully assigns UTC fallback dates to non-crawlable links (like
mailto: or .app)
- ๐ Auto-generates and commits sitemap via GitHub Actions daily at 03:00 UTC
๐ ๏ธ Tech Stack
- Python 3.11
- Playwright for headless browsing and metadata scraping
- BeautifulSoup for DOM parsing
- GitHub Actions for CI/CD automation
โก Usage
To run locally:
python scripts/generate_sitemap.py
Sample output:
โ
Extracted 57 links.
https://www.trevorion.io/home โ lastmod: 2025-07-16T17:34:08.726000Z
...
โ
Sitemap successfully written to sitemap.xml
๐ฅ Credits
Created and curated by Trevor Grech (@Trevorion)
Home of the mythkeeperโs flame, daily experiments, and structured defiance.