Phase 1: Quick Fixes and Improvements by dshkol · Pull Request #43 · MichaelChirico/geohashTools

dshkol · 2025-11-11T05:11:59Z

Summary

This PR implements Phase 1 improvements focusing on documentation fixes, code quality enhancements, and minor optimizations.

Changes

Documentation

✅ Fixed gh_encode.Rd precision limit (was incorrectly stated as 28, corrected to 25)

Code Quality

✅ Added bounds checking to gh_delta to validate precision is between 0 and 25
- Prevents silent incorrect results for invalid inputs
- Added comprehensive tests for edge cases
✅ Updated CRS specification from deprecated PROJ.4 string to EPSG:4326
- Better compatibility with modern PROJ versions
- Updated all tests to use new format

Performance

✅ Optimized duplicate detection in gh_to_sp, gh_to_spdf.default, and gh_to_spdf.data.frame
- Changed from double-scan (anyDuplicated + duplicated) to single-pass
- Improves performance ~2x when duplicates are present
- No change in behavior or API

Testing

✅ Updated tests for testthat edition 3 compatibility
✅ All 125 tests passing, 0 failures, 0 warnings
✅ No breaking changes

Testing

devtools::test()
# [ FAIL 0 | WARN 0 | SKIP 0 | PASS 125 ]

Next Steps

This is part of a series of performance improvement PRs. Upcoming phases:

Phase 2: gh_covering grid generation optimization (6-25× speedup expected)
Phase 3: CI/CD modernization (GitHub Actions)

🤖 Generated with Claude Code

- Fix gh_encode.Rd documentation (precision limit 25, not 28) - Add bounds checking to gh_delta (validates 0-25 range) - Update CRS from deprecated PROJ.4 to EPSG:4326 - Optimize duplicate detection (single-pass instead of double-scan) - Update tests for testthat edition 3 compatibility - All tests pass (125 passing, 0 failures, 0 warnings) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

MichaelChirico · 2025-11-11T06:45:11Z

@@ -1,5 +1,5 @@
 # https://epsg.io/4326
-wgs = function() sp::CRS('+proj=longlat +datum=WGS84', doCheckCRSArgs = FALSE)
+wgs = function() sp::CRS('EPSG:4326')


isn't there some problem with this CRS? Maybe we should just drop 'sp' support altogether, WDYT?

Yeah. sp is pretty deprecated.

According to CRAN there are two reverse dependencies: one import and one suggest. Neither use sp.

Reverse imports: MazamaCoreUtils
Reverse suggests: spatialrisk

Awesome... let's throw Claude at moving all implementations to use {sf} instead and then dropping {sp} as a first PR here?

MichaelChirico · 2025-11-11T06:47:24Z

 gh_to_sp = function(geohashes) {
  check_suggested('sp')
  gh = tolower(geohashes)
-  if (anyDuplicated(gh) > 0L) {


i actually would revert this -- it's optimizing the right way IMO

In the common, non-erroneous case, anyDuplicated() is more efficient than duplicated()

If a mistake (duplicate inputs) is detected, then we do another pass and calculate the full, slower duplicated().

ditto elsewhere. Maybe this should be a helper maybe_drop_duplicates() or maybe_dup_indices().

MichaelChirico · 2025-11-11T06:49:59Z


 gh_delta = function(precision) {
  if (length(precision) > 1L) stop('One precision at a time, please.')
+  if (!is.numeric(precision) || precision < 0L || precision > 25L) {


I guess there's no real reason to enforce the <=25 upper bound, even if we can't compute such precise geohashes, the calculation is still numerically accurate.

This was referenced Nov 11, 2025

Phase 2: Optimize gh_covering Performance (2-3× Speedup) #44

Open

Phase 3: Optimize Duplicate Detection (1.25-1.76× Speedup) #45

Open

Phase 4: Migrate from Travis CI to GitHub Actions #46

Open

MichaelChirico reviewed Nov 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Phase 1: Quick Fixes and Improvements#43

Phase 1: Quick Fixes and Improvements#43
dshkol wants to merge 1 commit into
masterfrom
fix/quick-wins

dshkol commented Nov 11, 2025

Uh oh!

MichaelChirico Nov 11, 2025

Uh oh!

dshkol Nov 12, 2025

Uh oh!

MichaelChirico Nov 12, 2025

Uh oh!

MichaelChirico Nov 11, 2025 •

edited

Loading

Uh oh!

MichaelChirico Nov 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dshkol commented Nov 11, 2025

Summary

Changes

Documentation

Code Quality

Performance

Testing

Testing

Next Steps

Uh oh!

MichaelChirico Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

dshkol Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

MichaelChirico Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

MichaelChirico Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

MichaelChirico Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

MichaelChirico Nov 11, 2025 •

edited

Loading