DA Caselaw Document Processing

AWS Lambda function for document processing (privacy, metadata stripping, etc.).

Local Development and Testing

1. Run Tests in Docker container (matches CI/CD)

# From project root
./run-tests.sh

This builds the test Docker image with all required system dependencies (including pdfcpu for PDF processing) and runs the complete test suite in the same environment used in CI/CD, ensuring consistency between local development and deployment.

2. Test Lambda Locally

You can test the Lambda locally using Docker:

script/server

will start the server and

script/upload document_cleanser_lambda/test-event.json

will submit a valid Lambda event payload to your handler (see AWS docs for examples).

Output will be at the same filename with .output.json appended.

3. Linting and Pre-commit

Install and run pre-commit hooks to ensure code quality:

pip install pre-commit
pre-commit install
pre-commit run --all-files

4. Updating dependencies with poetry

Dependencies should be managed by renovate (see renovate.json). If you need to update dependencies, run poetry update; see pyproject.toml

5. Development Guidelines

Ensure all tests pass locally before opening a PR via ./script/test

Follow repo and code style guidelines.

Document new environment variables or requirements in the README.

Release process

Update the code
- Create a branch release/v{major}.{minor}.{patch}
- Update the version number in document_cleanser_lambda/lambda_function.py
- Update CHANGELOG.md for the release
- Commit and push
- Open a PR from that branch to main
- Get approval on the PR
Create a GitHub Release
- Create a new tag on main with the same version number.
- Generate release notes
- Publish the release
Deploy to production
- Go to the docker-build-and-deploy action
- Run a workflow using the newly tagged released against production
- Get approval for the action

Deployment

Terraform configuration

Detailed CI documentation

Changes to the main branch are deployed to staging.
Creating a new Github version and running a GitHub action is required for deploying to production.

Name		Name	Last commit message	Last commit date
Latest commit History 834 Commits
.github		.github
docs		docs
document_cleanser_lambda		document_cleanser_lambda
script		script
terraform		terraform
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.secrets.baseline		.secrets.baseline
CHANGELOG.md		CHANGELOG.md
LICENCE		LICENCE
README.md		README.md
pytest.ini		pytest.ini
renovate.json		renovate.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DA Caselaw Document Processing

Local Development and Testing

1. Run Tests in Docker container (matches CI/CD)

2. Test Lambda Locally

3. Linting and Pre-commit

4. Updating dependencies with poetry

5. Development Guidelines

Release process

Deployment

About

Uh oh!

Releases 4

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

DA Caselaw Document Processing

Local Development and Testing

1. Run Tests in Docker container (matches CI/CD)

2. Test Lambda Locally

3. Linting and Pre-commit

4. Updating dependencies with poetry

5. Development Guidelines

Release process

Deployment

About

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages