Update documentation according to `v28` changelog #287

tharropoulos · 2025-01-30T12:55:45Z

Change Summary

Stemming and Word Handling

Custom Stemming Dictionaries
- Added new stemming.md documentation
- Added navigation menu item for stemming
- Added dictionary-based word mapping functionality
- Updated collection schema with stem_dictionary parameter
- Added pre-made English plurals dictionary with download link
Field-Level Token Controls
- Added field-level token_separators and symbols_to_index configuration
- Updated collections schema with field-level parameter documentation
- Added examples in search tips guide
- Documented precedence rules over collection-level settings

Search Result Sorting

Random Sorting
- Added _rand() sorting parameter documentation
- Documented seed value behavior and use cases
- Added examples for both seeded and unseeded random sorting
- Included tips for timestamp usage and combinations
Pivot-Based Sorting
- Added pivot sorting parameter documentation
- Documented ascending/descending pivot behavior
- Added examples with timestamp-based pivot sorting
- Described use cases like reference point proximity
Decay Function Sorting
- Added documentation for gauss, linear, and exp decay functions
- Included implementation details and parameter descriptions
- Added timestamp-based decay sorting examples
- Documented best practices for each decay type
Text Match Score Bucketing
- Added bucket_size parameter documentation
- Described grouping of results into relevance buckets
- Added examples showing bucketing with secondary sorting
- Documented bucket size behavior and recommendations

Search Enhancement

Hybrid Search Re-ranking
- Added rerank_hybrid_matches parameter documentation
- Updated vector search documentation with re-ranking behavior
- Added examples showing score differences with re-ranking
- Expanded semantic search guide with detailed explanations

Geographic Features

Geographic Polygons
- Added new geopolygon field type
- Documented polygon area storage and point-in-polygon queries
- Added creation and search examples for polygon territories
- Updated field types documentation

Collection Management

Collection Truncation
- Added truncate collection endpoint documentation
- Implemented code examples in all supported languages
- Added sample response format
- Documented difference between truncate and delete operations

PR Checklist

I have read and signed the Contributor License Agreement.

- add new `stemming.md` documentation explaining basic and custom stemming - add stemming menu item in navigation config.js - update collections schema docs with custom stemming functionality - update FAQs with custom stemming example and explanation

- add documentation for `_rand()` sorting parameter - document seed value behavior and constraints - add examples of random sorting with and without seeds - include tips about timestamp usage and combining with other sorts

- add documentation for `rerank_hybrid_matches` parameter - update vector-search.md with re-ranking behavior and examples - expand semantic-search guide with detailed re-ranking explanation - add code samples showing score differences with re-ranking

- add documentation for `pivot` sorting parameter - describe ascending and descending pivot sort behavior - include example with timestamp pivot sorting - document use cases and combination with other sort fields

- add disclaimer about rules-based stemming limitations - explain potential side effects with brand names and locations - clarify impact on search relevance for specialized content

- add decay function sorting documentation with gauss, linear, exp functions - include implementation details and parameter descriptions - add examples using timestamp-based decay sorting - document best practices and tips for each decay function

- add section about pre-made stemming dictionaries - include download link for english plurals dictionary - document benefits of using pre-made dictionary vs algorithmic stemming

- add support for field-level `token_separators` and `symbols_to_index` config - update collections schema documentation with field-level parameters - add example in search tips guide - clarify precedence over collection-level settings

- add `bucket_size` parameter for text match score sorting - implement grouping of results into relevance buckets - add examples demonstrating bucketing with secondary sort criteria - document bucket size behavior and best practices

- add truncate collection endpoint documentation - implement code examples in all supported languages - include sample response format - explain difference between truncate and delete operations

- add new `geopolygon` field type - implement polygon area storage and point-in-polygon queries - update field types documentation with geopolygon details - add examples of creating and searching polygon territories

- add control over fuzzy filter_by candidates limit - update documentation for filter parameters - add parameter description and default value - document use case for prefix filtering control

- add GET /operations/schema_changes endpoint documentation - include sample response showing progress metrics - document validation and alteration status tracking - explain empty response behavior for no changes

- add API endpoint for updating embedding model API keys - document PATCH request format for key updates - include example for OpenAI embedding model - add warning about required field parameters

- document default cosine similarity metric - explain distance_threshold behavior in different contexts - add details about sorting with distance thresholds - include examples of threshold usage

- Add explanation of `buckets` and `bucket_size` parameters in API docs - Restructure ranking documentation for better readability - Add detailed examples for both bucketing approaches

tharropoulos added 15 commits January 30, 2025 14:54

docs(sort_by): add random sorting functionality

0657780

- add documentation for `_rand()` sorting parameter - document seed value behavior and constraints - add examples of random sorting with and without seeds - include tips about timestamp usage and combining with other sorts

docs(sort_by): add pivot sorting functionality

347ee47

- add documentation for `pivot` sorting parameter - describe ascending and descending pivot sort behavior - include example with timestamp pivot sorting - document use cases and combination with other sort fields

docs(stemming): clarify porter stemming behavior

6b16e7b

- add disclaimer about rules-based stemming limitations - explain potential side effects with brand names and locations - clarify impact on search relevance for specialized content

docs(stemming): add pre-made english plurals dictionary

7a1bcbe

- add section about pre-made stemming dictionaries - include download link for english plurals dictionary - document benefits of using pre-made dictionary vs algorithmic stemming

docs(sort_by): add text match score bucketing

c18b90e

- add `bucket_size` parameter for text match score sorting - implement grouping of results into relevance buckets - add examples demonstrating bucketing with secondary sort criteria - document bucket size behavior and best practices

docs(collections): add collection truncate operation

43b16a2

- add truncate collection endpoint documentation - implement code examples in all supported languages - include sample response format - explain difference between truncate and delete operations

docs(geo-poly): add support for geographic polygons

b19a8b9

- add new `geopolygon` field type - implement polygon area storage and point-in-polygon queries - update field types documentation with geopolygon details - add examples of creating and searching polygon territories

docs(search): add max_filter_by_candidates parameter

6466790

- add control over fuzzy filter_by candidates limit - update documentation for filter parameters - add parameter description and default value - document use case for prefix filtering control

docs(collections): add schema change status endpoint

b1eb0f4

- add GET /operations/schema_changes endpoint documentation - include sample response showing progress metrics - document validation and alteration status tracking - explain empty response behavior for no changes

docs(vector): add remote model API key update support

bb2ebf5

- add API endpoint for updating embedding model API keys - document PATCH request format for key updates - include example for OpenAI embedding model - add warning about required field parameters

docs(vector): clarify distance metrics behavior

ca96eee

- document default cosine similarity metric - explain distance_threshold behavior in different contexts - add details about sorting with distance thresholds - include examples of threshold usage

tharropoulos marked this pull request as ready for review January 31, 2025 07:23

tharropoulos requested a review from kishorenc January 31, 2025 07:23

tharropoulos added 2 commits January 31, 2025 10:01

docs(search): enhance documentation for text match score bucketing

87b0b91

- Add explanation of `buckets` and `bucket_size` parameters in API docs - Restructure ranking documentation for better readability - Add detailed examples for both bucketing approaches

docs(buckets): fix secondary field emphasis mention

98d63d0

kishorenc approved these changes Jan 31, 2025

View reviewed changes

kishorenc merged commit d8f49f0 into typesense:v28.0 Jan 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update documentation according to `v28` changelog #287

Update documentation according to `v28` changelog #287

tharropoulos commented Jan 30, 2025 •

edited

Loading

Update documentation according to v28 changelog #287

Update documentation according to v28 changelog #287

Conversation

tharropoulos commented Jan 30, 2025 • edited Loading

Change Summary

Stemming and Word Handling

Search Result Sorting

Search Enhancement

Geographic Features

Collection Management

PR Checklist

Update documentation according to `v28` changelog #287

Update documentation according to `v28` changelog #287

tharropoulos commented Jan 30, 2025 •

edited

Loading