Using chunksize gives `TypeError: 'TextFileReader' object does not support item assignment` 

We've been using `python-dwca-reader` with no problems loading about 13k  occurrences. We now need to scale it up to load about 3.25m occurrences.

Changing the code from:
```
        core_df = dwca.pd_read('occurrence.txt', parse_dates=True)
```
to:
```
        for chunk in dwca.pd_read('occurrence.txt', parse_dates=True, chunksize=10):
        ...
```
causes the error:
```
    ...
    for chunk in dwca.pd_read('occurrence.txt', parse_dates=True, chunksize=10):
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/opt/homebrew/opt/asdf/installs/python/3.11.7/lib/python3.11/site-packages/dwca/read.py", line 209, in pd_read
    df[shorten_term(field['term'])] = field_default_value
    ~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
TypeError: 'TextFileReader' object does not support item assignment
```

Looking at [`gbif-alert`](https://github.com/riparias/gbif-alert/blob/main/dashboard/management/commands/import_observations.py#L213), I see that you're using `enumerate(dwca)` rather than reading it in chunks, so I'll give that a try.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using chunksize gives `TypeError: 'TextFileReader' object does not support item assignment` #106

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Using chunksize gives TypeError: 'TextFileReader' object does not support item assignment #106

Description

Activity