Skip to content

Commit b35f806

Browse files
author
xiadong
committed
feat: expand ORC writer capabilities
1 parent 206f628 commit b35f806

10 files changed

Lines changed: 1165 additions & 42 deletions

File tree

README.md

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -12,16 +12,18 @@ See the [documentation](https://docs.rs/orc-rust/latest/orc_rust/) for examples
1212

1313
## Supported features
1414

15-
This crate currently only supports reading ORC files into Arrow arrays. Write support is planned
16-
(see [Roadmap](#roadmap)). The below features listed relate only to reading ORC files.
17-
At this time, we aim to support the [ORCv1](https://orc.apache.org/specification/ORCv1/) specification only.
15+
This crate supports reading ORC files into Arrow arrays and writing flat Arrow
16+
`RecordBatch`es to ORC files. At this time, we aim to support the
17+
[ORCv1](https://orc.apache.org/specification/ORCv1/) specification only.
1818

1919
- Read synchronously & asynchronously (using Tokio)
2020
- All compression types (Zlib, Snappy, Lzo, Lz4, Zstd)
2121
- All ORC data types
2222
- All encodings
2323
- Rudimentary support for retrieving statistics
2424
- Retrieving user metadata into Arrow schema metadata
25+
- Write Arrow arrays synchronously, with an async writer API for async sinks
26+
- Writer compression, row group indexes, bloom filters, and column statistics
2527

2628
## Roadmap
2729

@@ -32,9 +34,8 @@ The following lists the rough roadmap for features to be implemented, from highe
3234

3335
- Performance enhancements
3436
- Predicate pushdown
35-
- Row indices
36-
- Bloom filters
37-
- Write from Arrow arrays
37+
- Complete row index seek positions
38+
- Nested type write support
3839
- Encryption
3940

4041
A non-Arrow API interface is not planned at the moment. Feel free to raise an issue if there is such
@@ -120,4 +121,3 @@ To regenerate/update the [proto.rs](src/proto.rs) file, execute the [regen.sh](r
120121
```shell
121122
./regen.sh
122123
```
123-

0 commit comments

Comments
 (0)