Releases: pathwaycom/pathway
Releases · pathwaycom/pathway
v0.8.4
Fixed
- Pathway will only require
LiteLLMpackage, if you use one of the wrappers forLiteLLM. - Retries are implemented in
pw.io.airbyte.read. - State processing protocol is updated in
pw.io.airbyte.read.
v0.8.3
Added
- New parameters of
pw.UDFclass andpw.udfdecorator:return_type,deterministic,propagate_none,executor,cache_strategy. - The LLM Xpack now provides integrations with LlamaIndex and LangChain for running the Pathway VectorStore server.
Changed
- Subclassing
UDFSyncandUDFAsyncis deprecated.UDFshould be subclassed to create a new UDF. - Passing keyword arguments to
pw.apply,pw.apply_with_type,pw.apply_asyncis deprecated. In the future, they'll be used for configuration, not passing data to the function.
Fixed
- Fixed a minor bug with
Table.groupby()method which sometimes prevented of accessing certain columns in the followingreduce(). - Fixed warnings from using OpenAI Async embedding model in the VectorStore in Colab.
v0.8.2
Added
%:ztimezone format code tostrptime.- Support for Airbyte connectors
pw.io.airbyte.
v0.8.1
Added
- Introduced the
send_alertsfunction in thepw.io.slacknamespace, enabling users to send messages from a specified column directly to a Slack channel. - Enhanced the
pw.io.http.rest_connectorby introducing an additional argument calledrequest_validator. This feature empowers users to validate payloads and raise anHTTP 400error if necessary.
Fixed
- Addressed an issue in
pw.io.xpacks.llm.VectorStoreServerwhere the computation of the last modification timestamp for an indexed document was incorrect.
Changed
- Improved the behavior of
pw.io.kafka.write. It now includes retries when sending data to the output topic encounters failures.
v0.8.0
Added
pw.io.http.rest_connectornow supports multiple HTTP request types.pw.io.http.PathwayWebservernow allows Cross-Origin Resource Sharing (CORS) to be enabled on newly added endpoints- Wrappers for LiteLLM and HuggingFace chat services and SentenceTransformers embedding service are now added to Pathway xpack for LLMs.
Changed
pw.runnow includes an additional parameterruntime_typecheckingthat enables strict type checking at runtime.- Embedders in pathway.xpacks.llm.embedders now correctly process empty strings as queries.
- BREAKING:
pw.runandpw.run_allnow only accept keyword arguments.
Fixed
pw.Durationcan now be returned from User-Defined Functions (UDFs) or used as a constant value without resulting in errors.pw.io.debezium.readnow correctly handles tables that do not have a primary key.
v0.7.10
Added
pw.io.http.rest_connectorcan now generate Open API 3.0.3 schema that will be returned by the route/_schema.- Wrappers for OpenAI Chat and Embedding services are now added to Pathway xpack for LLMs.
- A vector indexing pipeline that allows querying for the most similar documents. It is available as class
VectorStoreas part of Pathway xpack for LLMs.
Fixed
pw.debug.table_from_markdownnow uses schema parameter (when set) to properly assign simple types (int, bool, float, str, bytes) and optional simple types to columns.
v0.7.9
Changed
pw.io.http.rest_connectornow also accepts port as a string for backwards compatibility.
v0.7.8
Added
- Support for comparisons of tuples has been added.
- Standalone versions of methods such as
pw.groupby,pw.join,pw.join_inner,pw.join_left,pw.join_right, andpw.join_outerare now available. - The
absfunction from Python can now be used on Pathway expressions. - The
asof_joinmethod now has configurable temporal behavior. Thebehaviorparameter can be used to pass the configuration. - The state of the
deduplicateoperator can now be persisted.
Changed
interval_joincan now work with intervals of zero length.- The
pw.io.http.rest_connectorcan now open multiple endpoints on the same port using a newpw.io.http.PathwayWebserverclass. - The
pw.xpacks.connectors.sharepoint.readandpw.io.gdrive.readmethods now support the size limit for a single object. If set, it will exclude too large files and won't read them.
v0.7.7
Added
- pathway.xpacks.llm.splitter.TokenCountSplitter.
v0.7.6
New Features
Conversion Methods in pw.Json
- Introducing new methods for strict conversion of
pw.Jsonto desired types within a UDF body:as_int()as_float()as_str()as_bool()as_list()as_dict()
DateTime Functionality
- Added
table.col.dt.utc_from_timestampmethod: CreatesDateTimeUtcfrom timestamps represented asints orfloats. - Enhanced the
table.col.dt.timestampmethod with a newunitargument to specify the unit of the returned timestamp.
Experimental Features
- Introduced an experimental xpack with a Microsoft SharePoint input connector.
Enhancements
Improved JSON Handling
- Index operator (
[]) can now be directly applied topw.Jsonwithin UDFs to access elements of JSON objects, arrays, and strings.
Expanded Timestamp Functionality
- Enhanced the
table.col.dt.from_timestampmethod to createDateTimeNaivefrom timestamps represented asints orfloats. - Deprecated not specifying the
unitargument of thetable.col.dt.timestampmethod.
KNNIndex Enhancements
KNNIndexnow supports returning computed distances.- Added support for cosine similarity in
KNNIndex.
Deprecated Features
- The
offsetargument ofpw.stdlib.temporal.slidingandpw.stdlib.temporal.tumblingis deprecated. Useorigininstead, as it represents a point in time, not a duration.
Bug Fixes
DateTime Fixes
- Sliding window now works correctly with UTC Datetimes.
asof_join Improvements
- Temporal column in
asof_joinno longer has to be namedt. asof_joinincludes rows with equal times for all values of thedirectionparameter.
Fixed Issues
- Fixed an issue with
pw.io.gdrive.read: Shared folders support is now working seamlessly.