Description
How to contributing to pyiceberg-0.9.0
Step 1 to find problem statement?
- fork or sync repo
- clone or pull locally
- run setup and tests
- find scope for improvement on issues or slack
- reach back to community for help
A. Scope: Issue-1223 on version 0.7.1 Oct 8
User wants a Count rows as a metadata-only operation.
The python-iceberg repository has released 0.8.1 version on Nov 19 2024.
The library support a function called inspect that can help a user quickly get insights on the table.metadata
- test partition row count using inspect
Step 2: to understand root cause analytics
- Write a test case to recreate the issue locally
- Find source of exception, lack of function
- Handle the test in pytest
- Proposing solution to the community on the issue and slack
- create a pr on your folk
B. Create a use case to understand the issue
With the 0.8.1 release a new feature got integrated that gives inspects the table using metadata only.
Inspects
0.7.1: Fix delete to trace existing manifests when a data file is partially rewritten
so even when we are rewriting the data partially, we still need to add the new manifestentries as "existing" entries in order to track the new data files that are re-written.
these files are unaffected by the delete and should be kept in the manifest as an existing entry.
0.7.1 pytest: tests/intergration/test_writes/test_writes.py
-
test_delete_threshold()
-
load minio catalog
-
create schema
-
partition specification
-
clean environment for testing
-
exception handling
-
create table
-
generate test data
-
design test
Let's try it out and understand root cause of this issue