Skip to content

Dependencies in setup.py have module conflicts. #1106

Open
@unsatisfying

Description

Background

Dependencies in setup.py have module conflicts.

Description

There are multiple dependencies mentioned in the setup.py file(the -> means the indirect dependencies):

tfds-nightly
mesh-tensorflow[transformer] -> tensorflow-datasets

To my knowledge, there seems to be a conflict between these two packages due to their shared module name, “tensorflow-datasets”.

During the pip installation process, both of these packages are installed simultaneously. However, pip does not isolate these two packages, but rather installs them both in the site-packages folder. This results in the same module (list below) from the latter installed package overwriting the one installed by the previous package. This overriding behavior, as one can imagine, may result in some functional errors.

# Note that these modules have different content in the two package
'tensorflow_datasets/core/as_dataframe.py'
'tensorflow_datasets/scripts/cli/build.py'
'tensorflow_datasets/core/dataset_builder_test.py'
'tensorflow_datasets/core/splits.py'
'tensorflow_datasets/core/file_adapters_test.py'
'tensorflow_datasets/core/file_adapters.py'
'tensorflow_datasets/core/dataset_builder.py'
'tensorflow_datasets/core/dataset_builders/huggingface_dataset_builder.py'
'tensorflow_datasets/version.py'

Steps to Reproduce

pip install xxx

Desired Change

Indeed, it is not an ideal behavior for modules to be overwritten, even if they are not actively used or if the overwritten module is the one being called. It introduces uncertainty and can cause issues in the long run, especially if there are changes or updates to the overwritten modules in future development. It is generally recommended to avoid such conflicts and ensure that only the necessary and compatible dependencies are declared in the requirements to maintain a stable and predictable environment for the project.

We believe that although this project can only modify direct dependencies and indirect dependencies are a black box, it is possible to add additional explanations rather than directly declaring both conflicting packages in the requirements.txt file. Or maybe you can check the dependencies and remove the redundant dependencies from the requirements.txt.

Adding extra explanations or documentation about the potential conflicts and the need to choose only one of the conflicting packages can help developers understand the issue and make informed decisions. Including a clear instruction or warning in the project’s documentation can guide users to choose the appropriate package based on their specific requirements.

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions