-
Notifications
You must be signed in to change notification settings - Fork 33
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
EDTF Full Date Processor #141
base: 2.x
Are you sure you want to change the base?
Conversation
This has been on my TODO for ages. Very excited to see this! At a quick glance, it looks like it doesn't handle Xs in the date, or seasonal dates. I think any Xs in the year could just be converted to 0s. For example, should 199X be transformed to 1990 for sorting purposes? For seasons, I think a mapping to a single day would probably work best. From https://www.loc.gov/standards/datetime/ I think some of these could be a simple map, for example, 1990-25 could probably map to 1990-04-01 rather than determining the first day of spring for the given year, which should be close enough for sorting purposes. This would likely be better for winter as well, since winter 2025 should sort to the start of 2025, not the end. It also needs to support dates with one or more ~, ?, and/or %. But for the purposes of sorting, we could just strip those out before running the code you have now. Something like this should work |
I’ve implemented all the requested changes:
|
Awesome, thanks. I will try to test this as soon as possible |
Actually, I wonder if this code is doing the same thing as controlled_access_terms/src/EDTFUtils.php Line 338 in eddcbe9
If it is, maybe we can simplify this PR by reusing that existing code? |
I have just looked into this functionality and note that a lot of code overlaps. However, before proceeding, I want to clarify a couple of differences:
only selects the first date in a set rather than processing all values. My current functionality, however, stores all dates and only sorts based on the earliest date when sorting. How would you like me to proceed with this? |
We have a tech call tomorrow. It's open to everyone, if you would like to join, but I'm also happy to bring this to the group and report back on how to tackle this. You can find the link in the Islandora Slack if you would like to join us. My thoughts are that maybe we can implement your code into the existing functionality as long as that doesn't break anything, and then in this PR we could just call the existing function (with your updates) to generate the date for Solr. I also want to ask about the season mapping they are using, because for sorting purposes I don't think it will handle winter properly (#142) |
Talking about winter sorting... the EDTF spec itself is ambiguous. The ISO standard (that adopted the EDTF spec) states
Therefore, by my reading:
which is the current sort behavior, not the proposed sorting
However, I admit that the spec is still rather ambiguous. |
I think giving an option to customize what the season matches to is a reasonable option. Granted, it might be a bit much to require @francisayyad03 to add into the PR. Perhaps we focus on this as it is and then you could do a follow-up PR, @joshdentremont? Or you could fork @francisayyad03 's branch and add in the code which supercedes this one? We would credit both of you on the commit. That said, this PR makes some interesting conversions of valid EDTF dates into the Solr DatePoint format:
Granted, most of the weird ones aren't valid EDTF values, but a few are. (This list was taken from the ISO 8601 tests.) So, what should the code do with invalid EDTF strings? Give a weird value as it does or should we be throwing warnings? At least the 'Y' prefixed year one should be fixed. Note, I wanted to respond this morning, but I was hit with emergency system stuff until now. Unfortunately, I will be out tomorrow through next week for surgery/recovery; so I won't be able to review until I get back. |
@seth-shaw-asu @francisayyad03 Sounds good to me. If we push this through without the option to customize seasons, I can definitely add it in later in another PR. |
@seth-shaw-asu I think it's probably fine if the dates have weird mappings if they are dates that the edtf widget won't accept, but I agree that we should fix any that are valid edtf. Do you get similar results when using the ISO function That being said, if it's simple to piggy back off the way the widget detects whether the edtf is valid, maybe it would make sense to set all invalid edtf to some date in the distant past, or the distant future? |
Are we expected to support all EDTF-valid dates, or is supporting Level 0 and Level 1 sufficient? For context, Level 2 features like |
Invalid dates (or even valid EDTF dates that aren't supported) can probably be skipped if you generate a warning in the Drupal logger. I would expect at least levels 0 & 1 to be supported; but in any case the plugin configuration form's text should be very clear about what is not supported. |
@francisayyad03 we just discussed this again at the tech call and this is what came up:
Thanks again |
GitHub Issue: #140
What does this Pull Request do?
This PR introduces the updated EDTF Date Processor code to index full EDTF dates (including partial or multiple dates) in Solr, rather than just extracting the year. It consolidates date logic within the controlled_access_terms module, enabling more precise date-based sorting and filtering in Drupal’s Search API.
What's new?
Adds a new EDTFDateProcessor class that:
This change does not introduce new dependencies but does require updating the search index configuration to use this processor.
Existing code dealing strictly with “year-only” fields remains unaffected.
How should this be tested?
Enable EDTF Date Processor
Configure the Processor Settings
Add the
edtf_dates
Field to Your Indexedtf_dates
and save.Enter Test Content with Various EDTF Formats
Go to any content type that uses an EDTF field (as configured in step 2).
Add or edit content, then set the EDTF field to one of the following:
2012-12-12
2012-12-12T00:00:00Z
inedtf_dates
.2012-12
2012-12-01T00:00:00Z
.2012
2012-01-01T00:00:00Z
.{2013-12-12, 2014-09-09}
2013-12-12T00:00:00Z
2014-09-09T00:00:00Z
edtf_dates
.Check the Indexed Results in Solr
edtf_dates
field for the content item.Additional Notes:
Sets for Multiple Dates:
If you wish to allow multiple dates (e.g.,
{2013-12-12, 2014-09-09}
), ensure your EDTF field is configured to accept sets. The EDTF Date Processor will then index each date separately into theedtf_dates
array.Merged Dates from Multiple Fields:
For each field you configure in the EDTF Date Processor settings, all valid dates found are added to
edtf_dates
, sorted in ascending order. This means if multiple EDTF fields are selected, the resultingedtf_dates
field will contain dates from all of them.Sorting by edtf_dates:
If you configure your index or views to sort by
edtf_dates
, the earliest date (i.e., the first element in the sortededtf_dates
array) determines the sort order.Open Interval Settings:
If you specify a start year (
open_start_year
) and end year (open_end_year
), any dates falling outside that range (and not ignored by your configuration) will be excluded fromedtf_dates
. This can help narrow the indexed date range if you only need a specific window of time.Interested parties
Tag (@ mention) interested parties or, if unsure, @Islandora/committers
@Islandora/committers