Skip to content

Conversation

@CRZbulabula
Copy link
Contributor

@CRZbulabula CRZbulabula commented Nov 26, 2025

This PR introduces significant improvements in the model storage, loading, and inference pipeline management for better extensibility, efficiency, and ease of use. The changes include the refactoring of model storage to support a wider range of models, streamlining the model loading process, and the introduction of a unified inference pipeline. These improvements aim to optimize model management, reduce memory usage, and enhance the overall inference workflow.

  • Model Storage Refactoring

    • Extended Support for Models: The system now supports not only built-in models like TimerXL and Sundial but also allows the integration of fine-tuned and user-defined models.
    • Unified Model Management: A new model management system enables model registration, deletion, and loading from both local paths and Hugging Face.
    • Code Optimization: Redundant code from previous versions has been removed, and hard-coded model management has been replaced by a more flexible approach that integrates seamlessly with the Hugging Face Transformers ecosystem.
  • Model Loading Refactoring

    • Simplified Model Loading: The previous custom loading logic with complex if...else... conditions has been replaced by a unified model loading interface, simplifying the process.
    • Automatic Model Type Detection: The system now automatically detects the model type and selects the appropriate loading method, supporting models from Transformers, sktime, and PyTorch.
    • Lazy Loading: The PR introduces lazy loading for Python modules, eliminating the need to load multiple modules at startup, reducing initialization time and memory consumption.
  • Inference Pipeline Addition

    • Unified Inference Workflow: The introduction of the Inference Pipeline encapsulates the entire model inference process, offering a standardized interface for preprocessing, inference, and post-processing.
    • Support for Multiple Tasks: The pipeline is versatile, supporting various inference tasks such as prediction, classification, and dialogue-based tasks.

@CRZbulabula CRZbulabula marked this pull request as draft November 26, 2025 09:41
@codecov
Copy link

codecov bot commented Nov 26, 2025

Codecov Report

❌ Patch coverage is 15.38462% with 154 lines in your changes missing coverage. Please review.
✅ Project coverage is 38.92%. Comparing base (100c5a3) to head (9a726eb).
⚠️ Report is 11 commits behind head on master.

Files with missing lines Patch % Lines
...ache/iotdb/db/protocol/client/an/AINodeClient.java 1.88% 104 Missing ⚠️
...apache/iotdb/commons/client/ClientPoolFactory.java 0.00% 14 Missing ⚠️
...ion/config/executor/ClusterConfigTaskExecutor.java 9.09% 10 Missing ⚠️
...relational/function/tvf/ForecastTableFunction.java 11.11% 8 Missing ⚠️
...he/iotdb/db/queryengine/plan/udf/UDTFForecast.java 0.00% 7 Missing ⚠️
...ecution/operator/process/ai/InferenceOperator.java 0.00% 3 Missing ⚠️
...e/client/async/AsyncAINodeHeartbeatClientPool.java 0.00% 2 Missing ⚠️
...ode/procedure/impl/node/RemoveAINodeProcedure.java 0.00% 2 Missing ⚠️
...che/iotdb/db/protocol/client/ConfigNodeClient.java 0.00% 2 Missing ⚠️
.../db/protocol/client/DataNodeClientPoolFactory.java 95.00% 1 Missing ⚠️
... and 1 more
Additional details and impacted files
@@             Coverage Diff              @@
##             master   #16819      +/-   ##
============================================
+ Coverage     38.76%   38.92%   +0.15%     
  Complexity      207      207              
============================================
  Files          5006     5012       +6     
  Lines        332106   332171      +65     
  Branches      42232    42286      +54     
============================================
+ Hits         128756   129287     +531     
+ Misses       203350   202884     -466     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

remove useless codes in IoTDB
Update AINodeInstanceManagementIT.java
@sonarqubecloud
Copy link

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants