Little Falls, VA Website

The Little Falls, VA website advocates for the renaming of Falls Church, Virginia to Little Falls. It's built on Hugo framework using the Vex Hugo theme.

Website

Prerequisites

Hugo (v0.111.3 or higher)
Node.js (for npm packages)

Getting Started

Clone this repository
Install dependencies with npm install
Run the development server with npm run dev
View the site at http://localhost:1313

Building for Production

To build the site for production:

npm run build

The built site will be in the public directory.

Deployment

This site is deployed on Netlify. Any push to the main branch will trigger a new build and deployment.

Citation Style Guide

When writing research articles, use the following citation shortcode format:

{{< cite url="URL" title="Full Title of Source" >}}

Examples:

Single citation:

Falls Church became a microcosm of Southern defiance {{< cite url="https://example.com/article#:~:text=relevant,text,here" title="Article Title - Publication Name" >}}.

Multiple citations in sequence:

This fact is supported by multiple sources {{< cite url="https://example1.com" title="First Source" >}} {{< cite url="https://example2.com" title="Second Source" >}}.

Guidelines:

Always include both the url and title parameters
For URLs with text fragments (#:~:text=), include the full URL to reference specific quotes
Place citations immediately after the relevant text, before any punctuation
For multiple citations supporting the same statement, place them together without spacing
Citations will render as clickable chips showing the domain name in uppercase
Hovering over a citation will reveal the full title
Citations in the Sources section should follow the same format

Best Practices:

Be specific with text fragments in URLs to point to exact quotes
Use descriptive titles that include both the article name and publication
Place citations logically to clearly indicate which statements they support
When citing multiple sources, order them by relevance or chronologically
Include a Sources section at the end of each article listing all citations

Falls Church Research Tools

This repository contains tools to help with historical research about Falls Church, Virginia.

PDF Extraction Tools

These scripts help extract text from PDF documents and add metadata front matter.

Quick Reference for PDF Text Extraction

Script Files:

extract_pdf_text.py: Basic extraction with limited OCR (first 30 pages)
extract_full_pdf.py: Advanced extraction with batch processing for large PDFs

Basic Usage:

# Process a specific PDF with OCR for all pages:
python3 extract_full_pdf.py --pdf "YourPDFFile.pdf" --ocr

# Process a specific page range (for large PDFs):
python3 extract_full_pdf.py --pdf "YourPDFFile.pdf" --ocr --start-page 0 --max-pages 60

# Process the next batch:
python3 extract_full_pdf.py --pdf "YourPDFFile.pdf" --ocr --start-page 60 --max-pages 60

Script Requirements:

Python 3.9+
Tesseract OCR (brew install tesseract)
Poppler (brew install poppler)
Python packages: pytesseract, pdf2image, pdfminer.six

Installation:

# Install required system dependencies
brew install tesseract poppler

# Install required Python packages
python3 -m pip install pytesseract pdf2image pdfminer.six

IIIF Manifest Processing Tool

This script processes IIIF manifest URLs from digital libraries (such as the Mary Riley Styles Public Library) and creates markdown files with metadata and image information.

Features:

Extracts comprehensive metadata from IIIF manifests
Creates markdown files with structured front matter
Includes links to original images and manifests
Uses descriptive titles for filenames
Provides placeholders for adding historical significance notes

Basic Usage:

# Process a single IIIF manifest URL
python3 process_iiif_manifest.py "https://iiif.quartexcollections.com/mrspl/iiif/e3970652-8b7e-40e9-a9c9-d6dde46c2b42/manifest"

# Process multiple IIIF manifest URLs
python3 process_iiif_manifest.py "URL1" "URL2" "URL3"

# Process a list of URLs from a file
python3 process_iiif_manifest.py $(cat manifest_urls.txt)

Output:

The script creates markdown files in the .research/images/ directory with:

Front matter containing all available metadata
A description section
A complete metadata section listing all fields
Direct links to the full-resolution image
Links to the original IIIF manifest
A section for adding historical significance notes

Metadata Fields:

The script extracts and includes these fields in the front matter when available:

title: The title of the image
date: The date of the image
subject: The subject(s) of the image
creator: The creator/photographer
location: The place where the image was taken
format: The format of the original (e.g., Photographs)
source: The collection source
identifier: The unique identifier
description: A description of the image
color: Color information (b/w or color)
dimensions: The dimensions of the original
digitized: Always set to True
manifest_url: The URL of the IIIF manifest
image_url: The URL to the full-resolution image

Requirements:

Python 3.6+
Python packages: requests

Installation:

# Install required Python packages
python3 -m pip install requests

Prompt Template for AI Assistant

I need help extracting text from historical PDF documents. I have the following PDFs:

[List your PDFs here]

I've previously used scripts called extract_pdf_text.py and extract_full_pdf.py to extract text from PDFs and add front matter.

The scripts work by:
1. Attempting standard PDF text extraction first
2. Using OCR (Tesseract) if standard extraction fails
3. Processing large documents in batches (10 pages at a time)
4. Adding metadata front matter to the output markdown files

I'd like to extract all pages from [specific PDF] and add the following front matter:
- title: "[Title]"
- creator: "[Creator]"
- date: "[Date]"
- format: "[Format]"
- subject: "[Subject]"
- identifier: "[Identifier]"
- source: "[Source URL]"

Can you help me:
1. Update the script if needed
2. Run the extraction process
3. Combine the output into a single markdown file with proper front matter

IIIF Manifest Processing Prompt

I need to create markdown files with metadata from these IIIF manifest URLs:

[List your IIIF manifest URLs here]

I've previously used a script called process_iiif_manifest.py that:
1. Extracts metadata from IIIF manifests
2. Creates markdown files with detailed front matter
3. Includes links to the original images and manifests
4. Creates descriptive filenames based on the title

Can you help me:
1. Process these manifest URLs
2. Organize the resulting markdown files in my .research/images/ directory
3. Check for any errors or missing metadata

Name		Name	Last commit message	Last commit date
Latest commit History 3,161 Commits
.research		.research
archetypes		archetypes
assets/scss/templates		assets/scss/templates
content		content
data		data
i18n		i18n
images		images
layouts		layouts
netlify/functions/subscription-form-function		netlify/functions/subscription-form-function
old-site-backup		old-site-backup
runs/validation-dev-A_20250616_210057		runs/validation-dev-A_20250616_210057
scripts		scripts
static		static
themes/vex-hugo		themes/vex-hugo
.gitignore		.gitignore
.hugo_build.lock		.hugo_build.lock
README.md		README.md
apa.csl		apa.csl
arxiv_paper.log		arxiv_paper.log
arxiv_paper.tex		arxiv_paper.tex
arxiv_submission.tar.gz		arxiv_submission.tar.gz
authblk.sty		authblk.sty
build_two_column.sh		build_two_column.sh
code_style_two_column.tex		code_style_two_column.tex
config.toml		config.toml
create_placeholder_images.py		create_placeholder_images.py
library.csv		library.csv
library.json		library.json
little_falls_research.json		little_falls_research.json
little_falls_research.md		little_falls_research.md
netlify.toml		netlify.toml
package.json		package.json
posterior.pdf		posterior.pdf
refs.bib		refs.bib
requirements.txt		requirements.txt
sources_facts.json		sources_facts.json
sync_projects.py		sync_projects.py
texput.log		texput.log
tornado.pdf		tornado.pdf
validate_json.py		validate_json.py
voice_change.py		voice_change.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Little Falls, VA Website

Website

Prerequisites

Getting Started

Building for Production

Deployment

Citation Style Guide

Examples:

Guidelines:

Best Practices:

Falls Church Research Tools

PDF Extraction Tools

Quick Reference for PDF Text Extraction

Script Files:

Basic Usage:

Script Requirements:

Installation:

IIIF Manifest Processing Tool

Features:

Basic Usage:

Output:

Metadata Fields:

Requirements:

Installation:

Prompt Template for AI Assistant

IIIF Manifest Processing Prompt

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 18

Uh oh!

Languages

ryanmio/LittleFallsVA

Folders and files

Latest commit

History

Repository files navigation

Little Falls, VA Website

Website

Prerequisites

Getting Started

Building for Production

Deployment

Citation Style Guide

Examples:

Guidelines:

Best Practices:

Falls Church Research Tools

PDF Extraction Tools

Quick Reference for PDF Text Extraction

Script Files:

Basic Usage:

Script Requirements:

Installation:

IIIF Manifest Processing Tool

Features:

Basic Usage:

Output:

Metadata Fields:

Requirements:

Installation:

Prompt Template for AI Assistant

IIIF Manifest Processing Prompt

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 18

Uh oh!

Languages

Packages