Skip to content

Crawls and exports Classic Google Sites into a .docx file, preserving recipe titles and bullet lists. Written in Python.

Notifications You must be signed in to change notification settings

blueheron786/google-sites-to-google-doc

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

5 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ“ Google Sites to Google Doc

Classic Google Sites used to be awesome. Now ... not so much.

This tool helps you export your Classic Google Site into a .docx document, with:

  • βœ… One page per site section
  • βœ… Headers preserved
  • βœ… Bulleted lists retained
  • βœ… Easily copy-pasteable into Google Docs

πŸš€ Usage

  1. Set the site suffix
    Open main.py and set the SITE_SUFFIX value to match your Classic Google Site URL:

    SITE_SUFFIX = "yoursite"
    # If your site is https://sites.google.com/site/yoursite/
  2. Install dependencies

    pip install -r requirements.txt
  3. Run the converter

    python main.py
  4. Open and convert

    • Open the generated yoursite.docx in an office suite (e.g., LibreOffice, Word)
    • Insert a Table of Contents on the first page
    • Save it and upload to Google Docs
    • Use File > Save as Google Docs

⚠️ Limitations

  • 🧭 No auto-generated ToC: You'll need to insert a table of contents manually using a Word processor (LibreOffice, Word, etc.) before uploading to Google Docs.
  • πŸ“„ Limited formatting: Only titles, paragraphs, and bullet lists are extracted. No images, sub-headings, or fancy styling. Modify the script if you need richer formatting.
  • πŸ” Classic Sites only: This script only supports Classic Google Sites (i.e., URLs like sites.google.com/site/...). It won’t work with the newer Google Sites format.

πŸ™Œ Contributions

Pull requests welcome!

If you know how to:

  • Programmatically generate a proper ToC that Google Docs recognizes
  • Handle images or sub-headings
  • Improve site traversal or formatting fidelity

... feel free to open an issue or submit a PR!

About

Crawls and exports Classic Google Sites into a .docx file, preserving recipe titles and bullet lists. Written in Python.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Languages