- Active Mining of Data Streams [2004, SIAM]: It estimates the error of the model on the new data stream without knowing the true class labels. When significantly higher error is suspected, it investigates the true class labels of a selected number of examples in the most recent data stream to verify the suspected higher error. They took decision trees as the base model. The error is basing on the leaf statistic (probability distribution on each node) and the validation loss. (146)
- Active Learning from Data Streams [2007, ICDM]: Use a classifier ensemble to predict the labels. A Minimal Variance principle of the ensemble classifiers is introduced to guide instance labeling from data streams. In addition, a weight updating rule of ensemble classifiers is derived to ensure that our instance labeling process can adaptively adjust to dynamic drifting concepts in the data. (103)
- Unbiased Online Active Learning in Data Streams [2011, SIGKDD]: Binary classification problem. Select the instance with a higher entropy above the threshold. (117)
- Active Learning With Drifting Streaming Data [2014, TNNLS]: R-VAR. Address that many of the previous approaches apply static active learning to batches, which is not possible in data streams where historical instances cannot be stored in memory. This work proposed a framework, which not only select decide if to query the current instance, but also detect the corresponding concept drift. If the change is detect, switch the prediction function to the one trained while the change warning is signaled. They use a uncertainty sampling with a dynamic threshold and randomization. (258)
- Active and adaptive ensemble learning for online activity recognition from data streams [2017, Knowledge-Based Systems]: Multi-classification setting. Use ensemble classifiers and the weights are adjusted after each selection. Use uncertainty sampling with adaptive threshold. (25)
- Active learning for classifying data streams with unknown number of classes [2018, Neural Networks]: SAL. Also take into account possible emergence and fading of classes, known as concept evolution. Dirichlet mixture models and the stick breaking process are adopted and adapted to meet the requirements of online learning. Only labels of samples that are expected to reduce the expected future error are queried. (25)
- Adaptive Ensemble Active Learning for Drifting Data Stream Mining [2019, IJCAI]: EAL-MAB. A plug-in solution, capable of working with most of existing streaming ensemble classifiers. They this process as a Multi-Armed Bandit problem, obtaining an efficient and adaptive ensemble active learning procedure by selecting the most competent classifier from the pool for each query. Guide the instance selection by measuring the generalization capabilities of the classifiers. The proposed approach selects dynamically the most competent classifier to be responsible for the query decision. Selecting instances for label query by measuring the increase in generalization capabilities of the classifier according to a metric m on a separate validation set V. (3)
- QActor: On-line Active Learning for Noisy Labeled Stream Data [Arxiv, 2020]
- Active Learning for Deep Neural Networks on Edge Devices [2021, Arxiv]
- Online Active Learning for Drifting Data Streams [2021, TNNLS]
- A comprehensive active learning method for multiclass imbalanced data streams with concept drift [2021, KBS]
- Stream‐based active learning for sliding windows under the influence of verification latency [2021, Machine Learning]
- Online Active Learning with Surrogate Loss Functions [2021, NeurIPS]: ALPS. Query the current instance based on the disagreement between hypotheses and to construct pseudo-labels fot the non-queried instance. The models are trained on both labeled and pseudo-labeled data.
- Mining Drifting Data Streams on a Budget: Combining Active Learning with Self-Labeling [2021]
- Online Active Learning with Dynamic Marginal Gain Thresholding [2022]
- Online active classification via margin‐based and feature‐based label queries [2022, Machine Learning]
- Reinforced Meta Active Learning [2022]: Treat AL as an agent.
- Online active learning for an evolving fuzzy neural classifier based on data density and specificity [2022, Neurocomputing]
- Improved Algorithms for Neural Active Learning [2022]
- An Online Active Broad Learning Approach for Real-Time Safety Assessment of Dynamic Systems in Nonstationary Environments [2022, TNNLS]: Online incremental update the model.
- Clustering-based Active Learning Classification towards Data Stream [2023, ACM Trans. Intell. Syst. Technol.]
- An active learning budget-based oversampling approach for partially labeled multi-class imbalanced data streams [2023, SAC]
- Adaptive Selective Sampling for Online Prediction with Experts [2023]
- Streaming Active Learning with Deep Neural Networks [2023]
- Active Cost-aware Labeling of Streaming Data [2023, AISTATS]
- Continual Active Adaptation to Evolving Distributional Shifts [2022, CVPR]
- Few-Shot Continual Active Learning by a Robot [2022, NeurIPS]
- Meta-learning for dynamic tuning of active learning on stream classification [2023, Pattern Recognition]
The feature space expands over time.
- Online Passive-Aggressive Active Learning for Trapezoidal Data Streams [2022, TNNLS]