每月论文更新 - 2025年09月02日

## 最后更新：2025-09-02 00:09
**本次更新执行命令**
```
D:\a\MyAutoPapers\MyAutoPapers\target\release\my_auto_papers.exe --keywords=
             efficient RL,
             partial observable markov decision process/pomdp,sparse reward reinforcement learning,
             casual RL/counterfactual RL/casual reinforcement learning,
             causal inference/causal discovery/counterfactual reasoning,
             video super resolution,
             knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding,
             combinatorial game theory/xiangqi/chinese chess,
             code llm,
             speech recognition,
             zero shot tracking/few shot tracking/pose tracking/pose estimation,
             text to 3d/image to 3d/text to texture,
             automated theorem proving/interactive theorem proving/formal verification
              --exclude-keywords=multi-agent,multiagent --per-keyword-max-result=8
```

**参数详解**
- 关键词：`efficient RL`, `partial observable markov decision process/pomdp`, `sparse reward reinforcement learning`, `casual RL/counterfactual RL/casual reinforcement learning`, `causal inference/causal discovery/counterfactual reasoning`, `video super resolution`, `knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding`, `combinatorial game theory/xiangqi/chinese chess`, `code llm`, `speech recognition`, `zero shot tracking/few shot tracking/pose tracking/pose estimation`, `text to 3d/image to 3d/text to texture`, `automated theorem proving/interactive theorem proving/formal verification`
- 排除关键词：`multi-agent`, `multiagent`
- 每关键词最大结果：`8`
- 目标领域：`cs`, `stat`
- 每关键词重试次数：`3`


## 论文汇总（201篇）

**更好的阅读体验请访问 [Github页面](https://github.com/dbsxdbsx/MyAutoPapers)。**


### 1. efficient RL
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[rStar2-Agent: Agentic Reasoning Technical Report](http://arxiv.org/abs/2508.20722v1)** | 2025-08-28 |
| **2** | **[M2IO-R1: An Efficient RL-Enhanced Reasoning Framework for Multimodal Retrieval Augmented Multimodal Generation](http://arxiv.org/abs/2508.06328v1)** | 2025-08-08 |
| **3** | **[Shuffle-R1: Efficient RL framework for Multimodal Large Language Models via Data-centric Dynamic Shuffle](http://arxiv.org/abs/2508.05612v2)** | 2025-08-07 |
| **4** | **[MindSpeed RL: Distributed Dataflow for Scalable and Efficient RL Training on Ascend NPU Cluster](http://arxiv.org/abs/2507.19017v1)** | 2025-07-25 |
| **5** | **[Reinforcement Learning Fine-Tunes a Sparse Subnetwork in Large Language Models](http://arxiv.org/abs/2507.17107v2)** | 2025-07-23 |
| **6** | **[Efficient RL for optimizing conversation level outcomes with an LLM-based tutor](http://arxiv.org/abs/2507.16252v1)** | 2025-07-22 |
| **7** | **[Efficient RL Training for Reasoning Models via Length-Aware Optimization](http://arxiv.org/abs/2505.12284v2)** | 2025-05-18 |
| **8** | **[RL$^3$: Boosting Meta Reinforcement Learning via RL inside RL$^2$](http://arxiv.org/abs/2306.15909v6)** | 2023-06-28 |
### 2. partial observable markov decision process/pomdp
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[Convergence of regularized agent-state-based Q-learning in POMDPs](http://arxiv.org/abs/2508.21314v1)** | 2025-08-29 |
| **2** | **[Uncertainty-Resilient Active Intention Recognition for Robotic Assistants](http://arxiv.org/abs/2508.19150v1)** | 2025-08-26 |
| **3** | **[A coalgebraic perspective on predictive processing](http://arxiv.org/abs/2508.16877v1)** | 2025-08-23 |
| **4** | **[Dream to Chat: Model-based Reinforcement Learning on Dialogues with User Belief Modeling](http://arxiv.org/abs/2508.16876v2)** | 2025-08-23 |
| **5** | **[Universal Reinforcement Learning in Coalgebras: Asynchronous Stochastic Computation via Conduction](http://arxiv.org/abs/2508.15128v1)** | 2025-08-20 |
| **6** | **[Towards Agent-based Test Support Systems: An Unsupervised Environment Design Approach](http://arxiv.org/abs/2508.14135v1)** | 2025-08-19 |
| **7** | **[Multi-Group Equivariant Augmentation for Reinforcement Learning in Robot Manipulation](http://arxiv.org/abs/2508.11204v1)** | 2025-08-15 |
| **8** | **[Sensitivity of Filter Kernels and Robustness to Incorrect Transition and Measurement Kernel Perturbations in Partially Observable Stochastic Control](http://arxiv.org/abs/2508.10658v2)** | 2025-08-14 |
| **9** | **[Learning-Enabled Adaptive Power Capping Scheme for Cloud Data Centers](http://dx.doi.org/10.1109/TSG.2025.3598070)** | 2025-08-09 |
| **10** | **[Robust Finite-Memory Policy Gradients for Hidden-Model POMDPs](http://arxiv.org/abs/2505.09518v3)** | 2025-05-14 |
| **11** | **[Hierarchical Object-Oriented POMDP Planning for Object Rearrangement](http://arxiv.org/abs/2412.01348v3)** | 2024-12-02 |
| **12** | **[Maintenance Optimization for Asset Networks with Unknown Degradation Parameters](http://arxiv.org/abs/2410.18246v2)** | 2024-10-23 |
| **13** | **[Pessimistic Iterative Planning with RNNs for Robust POMDPs](http://arxiv.org/abs/2408.08770v4)** | 2024-08-16 |
### 3. sparse reward reinforcement learning
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[LLM-Driven Intrinsic Motivation for Sparse Reward Reinforcement Learning](http://arxiv.org/abs/2508.18420v1)** | 2025-08-25 |
| **2** | **[SuperRL: Reinforcement Learning with Supervision to Boost Language Model Reasoning](http://arxiv.org/abs/2506.01096v2)** | 2025-06-01 |
| **3** | **[DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning](http://arxiv.org/abs/2505.19850v1)** | 2025-05-26 |
| **4** | **[STAR-R1: Spatial TrAnsformation Reasoning by Reinforcing Multimodal LLMs](http://arxiv.org/abs/2505.15804v3)** | 2025-05-21 |
| **5** | **[Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model](http://arxiv.org/abs/2503.11339v2)** | 2025-03-14 |
| **6** | **[Hedging with Sparse Reward Reinforcement Learning](http://arxiv.org/abs/2503.04218v1)** | 2025-03-06 |
| **7** | **[Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations](http://arxiv.org/abs/2412.01114v2)** | 2024-12-02 |
| **8** | **[Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning](http://arxiv.org/abs/2309.04459v2)** | 2023-09-08 |
### 4. casual RL/counterfactual RL/casual reinforcement learning
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL](http://arxiv.org/abs/2502.12436v3)** | 2025-02-18 |
| **2** | **[Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation](http://arxiv.org/abs/2012.09092v1)** | 2020-12-16 |
### 5. causal inference/causal discovery/counterfactual reasoning
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[Orientability of Causal Relations in Time Series using Summary Causal Graphs and Faithful Distributions](http://arxiv.org/abs/2508.21742v1)** | 2025-08-29 |
| **2** | **[Treatment effects at the margin: Everyone is marginal](http://arxiv.org/abs/2508.21583v1)** | 2025-08-29 |
| **3** | **[ORCA: ORchestrating Causal Agent](http://arxiv.org/abs/2508.21304v1)** | 2025-08-29 |
| **4** | **[ChainReaction! Structured Approach with Causal Chains as Intermediate Representations for Improved and Explainable Causal Video Question Answering](http://arxiv.org/abs/2508.21010v1)** | 2025-08-28 |
| **5** | **[Understanding and evaluating computer vision models through the lens of counterfactuals](http://arxiv.org/abs/2508.20881v1)** | 2025-08-28 |
| **6** | **[When Is Causal Inference Possible? A Statistical Test for Unmeasured Confounding](http://arxiv.org/abs/2508.20366v1)** | 2025-08-28 |
| **7** | **[Stochastic Gradients under Nuisances](http://arxiv.org/abs/2508.20326v1)** | 2025-08-28 |
| **8** | **[MOCHA: Discovering Multi-Order Dynamic Causality in Temporal Point Processes](http://arxiv.org/abs/2508.18873v1)** | 2025-08-26 |
| **9** | **[Explainable Counterfactual Reasoning in Depression Medication Selection at Multi-Levels (Personalized and Population)](http://arxiv.org/abs/2508.17207v1)** | 2025-08-24 |
| **10** | **[Causal Beam Selection for Reliable Initial Access in AI-driven Beam Management](http://arxiv.org/abs/2508.16352v1)** | 2025-08-22 |
| **11** | **[A Logic of Stability: Formalizing Similarity in Counterfactual Reasoning](http://arxiv.org/abs/2508.12502v1)** | 2025-08-17 |
| **12** | **[ORBIT: An Object Property Reasoning Benchmark for Visual Inference Tasks](http://arxiv.org/abs/2508.10956v1)** | 2025-08-14 |
| **13** | **[Inference on Nonlinear Counterfactual Functionals under a Multiplicative IV Model](http://arxiv.org/abs/2507.15612v2)** | 2025-07-21 |
| **14** | **[Boosting Temporal Sentence Grounding via Causal Inference](http://arxiv.org/abs/2507.04958v2)** | 2025-07-07 |
| **15** | **[Causal Feedback Discovery using Convergence Cross Mapping from Sea Ice Data](http://arxiv.org/abs/2505.09001v2)** | 2025-05-13 |
| **16** | **[What Changed and What Could Have Changed? State-Change Counterfactuals for Procedure-Aware Video Representation Learning](http://arxiv.org/abs/2503.21055v6)** | 2025-03-27 |
| **17** | **[Causal resilience curves: A data-driven framework for quantifying the spatiotemporal impacts of metro service disruptions](http://arxiv.org/abs/2310.07514v2)** | 2023-10-11 |
| **18** | **[Sophisticated Learning: A novel algorithm for active learning during model-based planning](http://arxiv.org/abs/2308.08029v2)** | 2023-08-15 |
| **19** | **[Robust Universal Inference For Misspecified Models](http://arxiv.org/abs/2307.04034v4)** | 2023-07-08 |
| **20** | **[Integrating Large Language Model for Improved Causal Discovery](http://arxiv.org/abs/2306.16902v2)** | 2023-06-29 |
| **21** | **[A Survey on Causal Discovery: Theory and Practice](http://dx.doi.org/10.1016/j.ijar.2022.09.004)** | 2023-05-17 |
| **22** | **[Identifiability of causal graphs under nonadditive conditionally parametric causal models](http://arxiv.org/abs/2303.15376v6)** | 2023-03-27 |
### 6. video super resolution
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[Structural Damage Detection Using AI Super Resolution and Visual Language Model](http://arxiv.org/abs/2508.17130v1)** | 2025-08-23 |
| **2** | **[Trajectory-aware Shifted State Space Models for Online Video Super-Resolution](http://arxiv.org/abs/2508.10453v1)** | 2025-08-14 |
| **3** | **[QuantVSR: Low-Bit Post-Training Quantization for Real-World Video Super-Resolution](http://arxiv.org/abs/2508.04485v1)** | 2025-08-06 |
| **4** | **[Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework](http://arxiv.org/abs/2508.04090v1)** | 2025-08-06 |
| **5** | **[Semantic and Temporal Integration in Latent Diffusion Space for High-Fidelity Video Super-Resolution](http://arxiv.org/abs/2508.00471v1)** | 2025-08-01 |
| **6** | **[RealisVSR: Detail-enhanced Diffusion for Real-World 4K Video Super-Resolution](http://arxiv.org/abs/2507.19138v1)** | 2025-07-25 |
| **7** | **[UltraVSR: Achieving Ultra-Realistic Video Super-Resolution with Efficient One-Step Diffusion Space](http://arxiv.org/abs/2505.19958v2)** | 2025-05-26 |
| **8** | **[Spatio-Temporal Distortion Aware Omnidirectional Video Super-Resolution](http://arxiv.org/abs/2410.11506v3)** | 2024-10-15 |
### 7. knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[Improving Biomedical Knowledge Graph Quality: A Community Approach](http://arxiv.org/abs/2508.21774v1)** | 2025-08-29 |
| **2** | **[Unsupervised Video Continual Learning via Non-Parametric Deep Embedded Clustering](http://arxiv.org/abs/2508.21773v1)** | 2025-08-29 |
| **3** | **[Geospatial Question Answering on Historical Maps Using Spatio-Temporal Knowledge Graphs and Large Language Models](http://arxiv.org/abs/2508.21491v1)** | 2025-08-29 |
| **4** | **[A Knowledge Distillation-empowered Adaptive Federated Reinforcement Learning Framework for Multi-Domain IoT Applications Scheduling](http://arxiv.org/abs/2508.21328v1)** | 2025-08-29 |
| **5** | **[MyGO: Memory Yielding Generative Offline-consolidation for Lifelong Learning Systems](http://arxiv.org/abs/2508.21296v1)** | 2025-08-29 |
| **6** | **[Addressing accuracy and hallucination of LLMs in Alzheimer's disease research through knowledge graphs](http://arxiv.org/abs/2508.21238v1)** | 2025-08-28 |
| **7** | **[Efficient Large-Scale Cross-Domain Sequential Recommendation with Dynamic State Representations](http://arxiv.org/abs/2508.20945v1)** | 2025-08-28 |
| **8** | **[Re4: Scientific Computing Agent with Rewriting, Resolution, Review and Revision](http://arxiv.org/abs/2508.20729v1)** | 2025-08-28 |
| **9** | **[Unified Multi-task Learning for Voice-Based Detection of Diverse Clinical Conditions](http://arxiv.org/abs/2508.20717v1)** | 2025-08-28 |
| **10** | **[MobileCLIP2: Improving Multi-Modal Reinforced Training](http://arxiv.org/abs/2508.20691v1)** | 2025-08-28 |
| **11** | **[Enhancing Semantic Document Retrieval- Employing Group Steiner Tree Algorithm with Domain Knowledge Enrichment](http://arxiv.org/abs/2508.20543v1)** | 2025-08-28 |
| **12** | **[Dual-Model Weight Selection and Self-Knowledge Distillation for Medical Image Classification](http://arxiv.org/abs/2508.20461v1)** | 2025-08-28 |
| **13** | **[KG-CQR: Leveraging Structured Relation Representations in Knowledge Graphs for Contextual Query Retrieval](http://arxiv.org/abs/2508.20417v2)** | 2025-08-28 |
| **14** | **[ATMS-KD: Adaptive Temperature and Mixed Sample Knowledge Distillation for a Lightweight Residual CNN in Agricultural Embedded Systems](http://arxiv.org/abs/2508.20232v1)** | 2025-08-27 |
| **15** | **[Complementary Learning System Empowers Online Continual Learning of Vehicle Motion Forecasting in Smart Cities](http://arxiv.org/abs/2508.19597v1)** | 2025-08-27 |
| **16** | **[Toward Edge General Intelligence with Agentic AI and Agentification: Concepts, Technologies, and Future Directions](http://arxiv.org/abs/2508.18725v1)** | 2025-08-26 |
| **17** | **[Pandora: Leveraging Code-driven Knowledge Transfer for Unified Structured Knowledge Reasoning](http://arxiv.org/abs/2508.17905v1)** | 2025-08-25 |
| **18** | **[CultranAI at PalmX 2025: Data Augmentation for Cultural Knowledge Representation](http://arxiv.org/abs/2508.17324v1)** | 2025-08-24 |
| **19** | **[Information Ecosystem Reengineering via Public Sector Knowledge Representation](http://arxiv.org/abs/2508.15916v1)** | 2025-08-21 |
| **20** | **[Transplant Then Regenerate: A New Paradigm for Text Data Augmentation](http://arxiv.org/abs/2508.14723v1)** | 2025-08-20 |
| **21** | **[Statistical Comparative Analysis of Semantic Similarities and Model Transferability Across Datasets for Short Answer Grading](http://arxiv.org/abs/2508.15837v1)** | 2025-08-19 |
| **22** | **[Semantic Discrepancy-aware Detector for Image Forgery Identification](http://arxiv.org/abs/2508.12341v1)** | 2025-08-17 |
| **23** | **[Beyond the Rosetta Stone: Unification Forces in Generalization Dynamics](http://arxiv.org/abs/2508.11017v2)** | 2025-08-14 |
| **24** | **[Physical Autoregressive Model for Robotic Manipulation without Action Pretraining](http://arxiv.org/abs/2508.09822v3)** | 2025-08-13 |
| **25** | **[What Breaks Knowledge Graph based RAG? Empirical Insights into Reasoning under Incomplete Knowledge](http://arxiv.org/abs/2508.08344v2)** | 2025-08-11 |
| **26** | **[Pr$^2$R: Information-Fused and Style-Aware Privacy-Preserving Replay for Lifelong Person Re-Identification](http://arxiv.org/abs/2508.01587v2)** | 2025-08-03 |
| **27** | **[Can Large Language Models Develop Strategic Reasoning? Post-training Insights from Learning Chess](http://arxiv.org/abs/2507.00726v3)** | 2025-07-01 |
| **28** | **[On the Fundamental Impossibility of Hallucination Control in Large Language Models](http://arxiv.org/abs/2506.06382v5)** | 2025-06-04 |
| **29** | **[Hydra: Structured Cross-Source Enhanced Large Language Model Reasoning](http://arxiv.org/abs/2505.17464v3)** | 2025-05-23 |
| **30** | **[FedSDAF: Leveraging Source Domain Awareness for Enhanced Federated Domain Generalization](http://arxiv.org/abs/2505.02515v3)** | 2025-05-05 |
| **31** | **[Evaluating Knowledge Graph Based Retrieval Augmented Generation Methods under Knowledge Incompleteness](http://arxiv.org/abs/2504.05163v2)** | 2025-04-07 |
| **32** | **[VectorFit : Adaptive Singular & Bias Vector Fine-Tuning of Pre-trained Foundation Models](http://arxiv.org/abs/2503.19530v3)** | 2025-03-25 |
| **33** | **[Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning](http://arxiv.org/abs/2503.08751v2)** | 2025-03-11 |
| **34** | **[Retrieval-Augmented Machine Translation with Unstructured Knowledge](http://arxiv.org/abs/2412.04342v2)** | 2024-12-05 |
| **35** | **[Query Optimization for Parametric Knowledge Refinement in Retrieval-Augmented Large Language Models](http://arxiv.org/abs/2411.07820v3)** | 2024-11-12 |
| **36** | **[SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Models](http://arxiv.org/abs/2411.02433v3)** | 2024-11-01 |
| **37** | **[Rethinking Invariance Regularization in Adversarial Training to Improve Robustness-Accuracy Trade-off](http://arxiv.org/abs/2402.14648v4)** | 2024-02-22 |
### 8. combinatorial game theory/xiangqi/chinese chess
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[Xiangqi-R1: Enhancing Spatial Strategic Reasoning in LLMs for Chinese Chess via Reinforcement Learning](http://arxiv.org/abs/2507.12215v1)** | 2025-07-16 |
| **2** | **[On 3-terminal positions in Hex](http://arxiv.org/abs/2507.08247v2)** | 2025-07-11 |
| **3** | **[A number game reconciliation](http://arxiv.org/abs/2507.04717v1)** | 2025-07-07 |
| **4** | **[Deep Reinforcement Learning Xiangqi Player with Monte Carlo Tree Search](http://arxiv.org/abs/2506.15880v1)** | 2025-06-18 |
| **5** | **[Circular Game Coloring of Signed Graphs](http://arxiv.org/abs/2505.21586v1)** | 2025-05-27 |
| **6** | **[Computational and Algebraic Structure of Board Games](http://arxiv.org/abs/2503.01850v1)** | 2025-02-18 |
| **7** | **[RemoteChess: Enhancing Older Adults' Social Connectedness via Designing a Virtual Reality Chinese Chess (Xiangqi) Community](http://dx.doi.org/10.1145/3706598.3714236)** | 2025-02-17 |
| **8** | **[Temperatures of Robin Hood](http://arxiv.org/abs/2501.07239v1)** | 2025-01-13 |
| **9** | **[On Conway's Numbers and Games, the Von Neumann Universe, and Pure Set Theory](http://arxiv.org/abs/2501.04412v2)** | 2025-01-08 |
| **10** | **[Complete Implementation of WXF Chinese Chess Rules](http://arxiv.org/abs/2412.17334v1)** | 2024-12-23 |
| **11** | **[Maker-Breaker on Galton-Watson trees](http://arxiv.org/abs/2412.08334v2)** | 2024-12-11 |
| **12** | **[Relationship between misère NIM and two-player GOISHI HIROI](http://arxiv.org/abs/2412.03996v1)** | 2024-12-05 |
| **13** | **[The Game Value of Sequential Compounds of Integers and Stars](http://arxiv.org/abs/2411.08611v1)** | 2024-11-13 |
| **14** | **[Mastering Chinese Chess AI (Xiangqi) Without Search](http://arxiv.org/abs/2410.04865v1)** | 2024-10-07 |
| **15** | **[XQSV: A Structurally Variable Network to Imitate Human Play in Xiangqi](http://arxiv.org/abs/2407.04678v1)** | 2024-07-05 |
| **16** | **[Shogi and Frieze group](http://arxiv.org/abs/2401.08591v2)** | 2023-11-15 |
| **17** | **[JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games](http://arxiv.org/abs/2308.04719v1)** | 2023-08-09 |
| **18** | **[Niel's Chess -- Rules for Xiangqi](http://arxiv.org/abs/2311.12181v2)** | 2023-06-27 |
| **19** | **[On the complexity of Dark Chinese Chess](http://arxiv.org/abs/2112.02989v1)** | 2021-12-06 |
### 9. code llm
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[RepoMark: A Code Usage Auditing Framework for Code Large Language Models](http://arxiv.org/abs/2508.21432v1)** | 2025-08-29 |
| **2** | **[The Fools are Certain; the Wise are Doubtful: Exploring LLM Confidence in Code Completion](http://arxiv.org/abs/2508.16131v1)** | 2025-08-22 |
| **3** | **[Hallucination in LLM-Based Code Generation: An Automotive Case Study](http://arxiv.org/abs/2508.11257v1)** | 2025-08-15 |
| **4** | **[VisCodex: Unified Multimodal Code Generation via Merging Vision and Coding Models](http://arxiv.org/abs/2508.09945v1)** | 2025-08-13 |
| **5** | **[A Taxonomy of Inefficiencies in LLM-Generated Python Code](http://arxiv.org/abs/2503.06327v3)** | 2025-03-08 |
| **6** | **[RefineCoder: Iterative Improving of Large Language Models via Adaptive Critique Refinement for Code Generation](http://arxiv.org/abs/2502.09183v2)** | 2025-02-13 |
| **7** | **[HAFix: History-Augmented Large Language Models for Bug Fixing](http://arxiv.org/abs/2501.09135v2)** | 2025-01-15 |
| **8** | **[Robo-Instruct: Simulator-Augmented Instruction Alignment For Finetuning Code LLMs](http://arxiv.org/abs/2405.20179v4)** | 2024-05-30 |
### 10. speech recognition
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[Towards Improved Speech Recognition through Optimized Synthetic Data Generation](http://arxiv.org/abs/2508.21631v1)** | 2025-08-29 |
| **2** | **[NSPDI-SNN: An efficient lightweight SNN based on nonlinear synaptic pruning and dendritic integration](http://arxiv.org/abs/2508.21566v1)** | 2025-08-29 |
| **3** | **[Can Layer-wise SSL Features Improve Zero-Shot ASR Performance for Children's Speech?](http://dx.doi.org/10.1109/LSP.2025.3602636)** | 2025-08-28 |
| **4** | **[Benchmarking Large Pretrained Multilingual Models on Québec French Speech Recognition](http://arxiv.org/abs/2508.21193v1)** | 2025-08-28 |
| **5** | **[OLMoASR: Open Models and Data for Training Robust Speech Recognition Models](http://arxiv.org/abs/2508.20869v1)** | 2025-08-28 |
| **6** | **[Generative Annotation for ASR Named Entity Correction](http://arxiv.org/abs/2508.20700v1)** | 2025-08-28 |
| **7** | **[MoTAS: MoE-Guided Feature Selection from TTS-Augmented Speech for Enhanced Multimodal Alzheimer's Early Screening](http://arxiv.org/abs/2508.20513v1)** | 2025-08-28 |
| **8** | **[OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset](http://arxiv.org/abs/2301.06375v2)** | 2023-01-16 |
### 11. zero shot tracking/few shot tracking/pose tracking/pose estimation
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[Efficient Diffusion-Based 3D Human Pose Estimation with Hierarchical Temporal Pruning](http://arxiv.org/abs/2508.21363v1)** | 2025-08-29 |
| **2** | **[PHD: Personalized 3D Human Body Fitting with Point Diffusion](http://arxiv.org/abs/2508.21257v1)** | 2025-08-28 |
| **3** | **[COMETH: Convex Optimization for Multiview Estimation and Tracking of Humans](http://arxiv.org/abs/2508.20920v1)** | 2025-08-28 |
| **4** | **[Estimating 2D Keypoints of Surgical Tools Using Vision-Language Models with Low-Rank Adaptation](http://arxiv.org/abs/2508.20830v1)** | 2025-08-28 |
| **5** | **[ROBUST-MIPS: A Combined Skeletal Pose and Instance Segmentation Dataset for Laparoscopic Surgical Instruments](http://arxiv.org/abs/2508.21096v1)** | 2025-08-27 |
| **6** | **[WEBEYETRACK: Scalable Eye-Tracking for the Browser via On-Device Few-Shot Personalization](http://arxiv.org/abs/2508.19544v1)** | 2025-08-27 |
| **7** | **[PersPose: 3D Human Pose Estimation with Perspective Encoding and Perspective Rotation](http://arxiv.org/abs/2508.17239v2)** | 2025-08-24 |
| **8** | **[6-DoF Object Tracking with Event-based Optical Flow and Frames](http://arxiv.org/abs/2508.14776v1)** | 2025-08-20 |
| **9** | **[DynamicPose: Real-time and Robust 6D Object Pose Tracking for Fast-Moving Cameras and Objects](http://arxiv.org/abs/2508.11950v1)** | 2025-08-16 |
| **10** | **[Visuomotor Grasping with World Models for Surgical Robots](http://arxiv.org/abs/2508.11200v1)** | 2025-08-15 |
| **11** | **[Embracing Dynamics: Dynamics-aware 4D Gaussian Splatting SLAM](http://arxiv.org/abs/2504.04844v2)** | 2025-04-07 |
| **12** | **[PicoPose: Progressive Pixel-to-Pixel Correspondence Learning for Novel Object Pose Estimation](http://arxiv.org/abs/2504.02617v2)** | 2025-04-03 |
| **13** | **[Bring Your Rear Cameras for Egocentric 3D Human Pose Estimation](http://arxiv.org/abs/2503.11652v2)** | 2025-03-14 |
| **14** | **[Learning Whole-Body Loco-Manipulation for Omni-Directional Task Space Pose Tracking with a Wheeled-Quadrupedal-Manipulator](http://dx.doi.org/10.1109/LRA.2024.3519856)** | 2024-12-04 |
| **15** | **[OmniPose6D: Towards Short-Term Object Pose Tracking in Dynamic Scenes from Monocular RGB](http://arxiv.org/abs/2410.06694v2)** | 2024-10-09 |
| **16** | **[Faster Model Predictive Control via Self-Supervised Initialization Learning](http://arxiv.org/abs/2408.03394v2)** | 2024-08-06 |
| **17** | **[Matching Anything by Segmenting Anything](http://arxiv.org/abs/2406.04221v1)** | 2024-06-06 |
| **18** | **[Input-Output Extension of Underactuated Nonlinear Systems](http://arxiv.org/abs/2403.03117v5)** | 2024-03-05 |
| **19** | **[Multi-Scale Memory Comparison for Zero-/Few-Shot Anomaly Detection](http://arxiv.org/abs/2308.04789v2)** | 2023-08-09 |
| **20** | **[Zero-Shot Anomaly Detection with Pre-trained Segmentation Models](http://arxiv.org/abs/2306.09269v1)** | 2023-06-15 |
| **21** | **[APRIL-GAN: A Zero-/Few-Shot Anomaly Classification and Segmentation Method for CVPR 2023 VAND Workshop Challenge Tracks 1&2: 1st Place on Zero-shot AD and 4th Place on Few-shot AD](http://arxiv.org/abs/2305.17382v3)** | 2023-05-27 |
| **22** | **[Unifying Tracking and Image-Video Object Detection](http://arxiv.org/abs/2211.11077v2)** | 2022-11-20 |
| **23** | **[Exploring the Effectiveness of Self-supervised Learning and Classifier Chains in Emotion Recognition of Nonverbal Vocalizations](http://arxiv.org/abs/2206.10695v1)** | 2022-06-21 |
| **24** | **[The Multi-speaker Multi-style Voice Cloning Challenge 2021](http://arxiv.org/abs/2104.01818v1)** | 2021-04-05 |
### 12. text to 3d/image to 3d/text to texture
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[DATR: Diffusion-based 3D Apple Tree Reconstruction Framework with Sparse-View](http://arxiv.org/abs/2508.19508v1)** | 2025-08-27 |
| **2** | **[Structural Energy-Guided Sampling for View-Consistent Text-to-3D](http://arxiv.org/abs/2508.16917v1)** | 2025-08-23 |
| **3** | **[MV-RAG: Retrieval Augmented Multiview Diffusion](http://arxiv.org/abs/2508.16577v1)** | 2025-08-22 |
| **4** | **[Say It, See It: A Systematic Evaluation on Speech-Based 3D Content Generation Methods in Augmented Reality](http://arxiv.org/abs/2508.12498v1)** | 2025-08-17 |
| **5** | **[CoreEditor: Consistent 3D Editing via Correspondence-constrained Diffusion](http://arxiv.org/abs/2508.11603v1)** | 2025-08-15 |
| **6** | **[Enhancing Monocular 3D Hand Reconstruction with Learned Texture Priors](http://arxiv.org/abs/2508.09629v1)** | 2025-08-13 |
| **7** | **[TexTailor: Customized Text-aligned Texturing via Effective Resampling](http://arxiv.org/abs/2506.10612v1)** | 2025-06-12 |
| **8** | **[CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx](http://arxiv.org/abs/2506.04931v1)** | 2025-06-05 |
| **9** | **[MonoCoP: Chain-of-Prediction for Monocular 3D Object Detection](http://arxiv.org/abs/2505.04594v5)** | 2025-05-07 |
| **10** | **[SORT3D: Spatial Object-centric Reasoning Toolbox for Zero-Shot 3D Grounding Using Large Language Models](http://arxiv.org/abs/2504.18684v2)** | 2025-04-25 |
| **11** | **[CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading](http://arxiv.org/abs/2504.06856v1)** | 2025-04-09 |
| **12** | **[Text-to-3D Generation using Jensen-Shannon Score Distillation](http://arxiv.org/abs/2503.10660v3)** | 2025-03-08 |
| **13** | **[ProcTex: Consistent and Interactive Text-to-texture Synthesis for Procedural Models](http://arxiv.org/abs/2501.17895v1)** | 2025-01-28 |
| **14** | **[Improving Viewpoint Consistency in 3D Generation via Structure Feature and CLIP Guidance](http://arxiv.org/abs/2412.02287v4)** | 2024-12-03 |
| **15** | **[Fancy123: One Image to High-Quality 3D Mesh Generation via Plug-and-Play Deformation](http://arxiv.org/abs/2411.16185v2)** | 2024-11-25 |
| **16** | **[MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D](http://arxiv.org/abs/2411.02336v1)** | 2024-11-04 |
| **17** | **[3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation](http://arxiv.org/abs/2410.18974v2)** | 2024-10-24 |
| **18** | **[Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control](http://arxiv.org/abs/2410.06985v1)** | 2024-10-09 |
| **19** | **[Localized Gaussian Splatting Editing with Contextual Awareness](http://dx.doi.org/10.1109/WACV61041.2025.00509)** | 2024-07-31 |
| **20** | **[REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment](http://arxiv.org/abs/2405.18525v2)** | 2024-05-28 |
| **21** | **[FlashTex: Fast Relightable Mesh Texturing with LightControlNet](http://arxiv.org/abs/2402.13251v3)** | 2024-02-20 |
### 13. automated theorem proving/interactive theorem proving/formal verification
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[Verifying Probabilistic Regions of Attraction with Neural Lyapunov Functions for Stochastic Systems](http://arxiv.org/abs/2508.21213v1)** | 2025-08-28 |
| **2** | **[Formal Modeling and Verification of the Algorand Consensus Protocol in CADP](http://arxiv.org/abs/2508.19452v2)** | 2025-08-26 |
| **3** | **[Formal Verification of Physical Layer Security Protocols for Next-Generation Communication Networks (extended version)](http://arxiv.org/abs/2508.19430v2)** | 2025-08-26 |
| **4** | **[MoveScanner: Analysis of Security Risks of Move Smart Contracts](http://arxiv.org/abs/2508.17964v2)** | 2025-08-25 |
| **5** | **[Lean Meets Theoretical Computer Science: Scalable Synthesis of Theorem Proving Challenges in Formal-Informal Pairs](http://arxiv.org/abs/2508.15878v1)** | 2025-08-21 |
| **6** | **[Repairing General Game Descriptions (extended version)](http://arxiv.org/abs/2508.10438v1)** | 2025-08-14 |
| **7** | **[TPTP World Infrastructure for Non-classical Logics](http://arxiv.org/abs/2508.09318v1)** | 2025-08-12 |
| **8** | **[Policy Design in Zero-Trust Distributed Networks: Challenges and Solutions](http://arxiv.org/abs/2508.04526v2)** | 2025-08-06 |
| **9** | **[Goedel-Prover-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction](http://arxiv.org/abs/2508.03613v1)** | 2025-08-05 |
| **10** | **[StepFun-Prover Preview: Let's Think and Verify Step by Step](http://arxiv.org/abs/2507.20199v3)** | 2025-07-27 |
| **11** | **[An ACL2s Interface to Z3](http://dx.doi.org/10.4204/EPTCS.423.10)** | 2025-07-25 |
| **12** | **[The AlphaPhysics Term Rewriting System for Marking Algebraic Expressions in Physics Exams](http://arxiv.org/abs/2507.18337v2)** | 2025-07-24 |
| **13** | **[Leveraging LLMs for Formal Software Requirements -- Challenges and Prospects](http://arxiv.org/abs/2507.14330v3)** | 2025-07-18 |
| **14** | **[Generalized Tree Edit Distance (GTED): A Faithful Evaluation Metric for Statement Autoformalization](http://arxiv.org/abs/2507.07399v2)** | 2025-07-10 |
| **15** | **[Quantifying Bounded Rationality: Formal Verification of Simon's Satisficing Through Flexible Stochastic Dominance](http://arxiv.org/abs/2507.07052v1)** | 2025-07-02 |
| **16** | **[Software is infrastructure: failures, successes, costs, and the case for formal verification](http://arxiv.org/abs/2506.13821v3)** | 2025-06-15 |
| **17** | **[APOLLO: Automated LLM and Lean Collaboration for Advanced Formal Reasoning](http://arxiv.org/abs/2505.05758v3)** | 2025-05-09 |
| **18** | **[TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving](http://arxiv.org/abs/2504.15780v2)** | 2025-04-22 |
| **19** | **[Automated Discovery of Tactic Libraries for Interactive Theorem Proving](http://arxiv.org/abs/2503.24036v2)** | 2025-03-31 |
| **20** | **[LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction](http://arxiv.org/abs/2502.17925v2)** | 2025-02-25 |
| **21** | **[Proving the Coding Interview: A Benchmark for Formally Verified Code Generation](http://arxiv.org/abs/2502.05714v1)** | 2025-02-08 |
| **22** | **[Learning Rules Explaining Interactive Theorem Proving Tactic Prediction](http://arxiv.org/abs/2411.01188v1)** | 2024-11-02 |
| **23** | **[A Certified Proof Checker for Deep Neural Network Verification in Imandra](http://arxiv.org/abs/2405.10611v2)** | 2024-05-17 |


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

每月论文更新 - 2025年09月02日 #26

最后更新：2025-09-02 00:09

论文汇总（201篇）

1. efficient RL

2. partial observable markov decision process/pomdp

3. sparse reward reinforcement learning

4. casual RL/counterfactual RL/casual reinforcement learning

5. causal inference/causal discovery/counterfactual reasoning

6. video super resolution

7. knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding

8. combinatorial game theory/xiangqi/chinese chess

9. code llm

10. speech recognition

11. zero shot tracking/few shot tracking/pose tracking/pose estimation

12. text to 3d/image to 3d/text to texture

13. automated theorem proving/interactive theorem proving/formal verification

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

序号	标题	日期
1	rStar2-Agent: Agentic Reasoning Technical Report	2025-08-28
2	M2IO-R1: An Efficient RL-Enhanced Reasoning Framework for Multimodal Retrieval Augmented Multimodal Generation	2025-08-08
3	Shuffle-R1: Efficient RL framework for Multimodal Large Language Models via Data-centric Dynamic Shuffle	2025-08-07
4	MindSpeed RL: Distributed Dataflow for Scalable and Efficient RL Training on Ascend NPU Cluster	2025-07-25
5	Reinforcement Learning Fine-Tunes a Sparse Subnetwork in Large Language Models	2025-07-23
6	Efficient RL for optimizing conversation level outcomes with an LLM-based tutor	2025-07-22
7	Efficient RL Training for Reasoning Models via Length-Aware Optimization	2025-05-18
8	RL$^3$: Boosting Meta Reinforcement Learning via RL inside RL$^2$	2023-06-28

序号	标题	日期
1	Convergence of regularized agent-state-based Q-learning in POMDPs	2025-08-29
2	Uncertainty-Resilient Active Intention Recognition for Robotic Assistants	2025-08-26
3	A coalgebraic perspective on predictive processing	2025-08-23
4	Dream to Chat: Model-based Reinforcement Learning on Dialogues with User Belief Modeling	2025-08-23
5	Universal Reinforcement Learning in Coalgebras: Asynchronous Stochastic Computation via Conduction	2025-08-20
6	Towards Agent-based Test Support Systems: An Unsupervised Environment Design Approach	2025-08-19
7	Multi-Group Equivariant Augmentation for Reinforcement Learning in Robot Manipulation	2025-08-15
8	Sensitivity of Filter Kernels and Robustness to Incorrect Transition and Measurement Kernel Perturbations in Partially Observable Stochastic Control	2025-08-14
9	Learning-Enabled Adaptive Power Capping Scheme for Cloud Data Centers	2025-08-09
10	Robust Finite-Memory Policy Gradients for Hidden-Model POMDPs	2025-05-14
11	Hierarchical Object-Oriented POMDP Planning for Object Rearrangement	2024-12-02
12	Maintenance Optimization for Asset Networks with Unknown Degradation Parameters	2024-10-23
13	Pessimistic Iterative Planning with RNNs for Robust POMDPs	2024-08-16

序号	标题	日期
1	LLM-Driven Intrinsic Motivation for Sparse Reward Reinforcement Learning	2025-08-25
2	SuperRL: Reinforcement Learning with Supervision to Boost Language Model Reasoning	2025-06-01
3	DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning	2025-05-26
4	STAR-R1: Spatial TrAnsformation Reasoning by Reinforcing Multimodal LLMs	2025-05-21
5	Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model	2025-03-14
6	Hedging with Sparse Reward Reinforcement Learning	2025-03-06
7	Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations	2024-12-02
8	Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning	2023-09-08

序号	标题	日期
1	Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL	2025-02-18
2	Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation	2020-12-16

序号	标题	日期
1	Orientability of Causal Relations in Time Series using Summary Causal Graphs and Faithful Distributions	2025-08-29
2	Treatment effects at the margin: Everyone is marginal	2025-08-29
3	ORCA: ORchestrating Causal Agent	2025-08-29
4	ChainReaction! Structured Approach with Causal Chains as Intermediate Representations for Improved and Explainable Causal Video Question Answering	2025-08-28
5	Understanding and evaluating computer vision models through the lens of counterfactuals	2025-08-28
6	When Is Causal Inference Possible? A Statistical Test for Unmeasured Confounding	2025-08-28
7	Stochastic Gradients under Nuisances	2025-08-28
8	MOCHA: Discovering Multi-Order Dynamic Causality in Temporal Point Processes	2025-08-26
9	Explainable Counterfactual Reasoning in Depression Medication Selection at Multi-Levels (Personalized and Population)	2025-08-24
10	Causal Beam Selection for Reliable Initial Access in AI-driven Beam Management	2025-08-22
11	A Logic of Stability: Formalizing Similarity in Counterfactual Reasoning	2025-08-17
12	ORBIT: An Object Property Reasoning Benchmark for Visual Inference Tasks	2025-08-14
13	Inference on Nonlinear Counterfactual Functionals under a Multiplicative IV Model	2025-07-21
14	Boosting Temporal Sentence Grounding via Causal Inference	2025-07-07
15	Causal Feedback Discovery using Convergence Cross Mapping from Sea Ice Data	2025-05-13
16	What Changed and What Could Have Changed? State-Change Counterfactuals for Procedure-Aware Video Representation Learning	2025-03-27
17	Causal resilience curves: A data-driven framework for quantifying the spatiotemporal impacts of metro service disruptions	2023-10-11
18	Sophisticated Learning: A novel algorithm for active learning during model-based planning	2023-08-15
19	Robust Universal Inference For Misspecified Models	2023-07-08
20	Integrating Large Language Model for Improved Causal Discovery	2023-06-29
21	A Survey on Causal Discovery: Theory and Practice	2023-05-17
22	Identifiability of causal graphs under nonadditive conditionally parametric causal models	2023-03-27

序号	标题	日期
1	Structural Damage Detection Using AI Super Resolution and Visual Language Model	2025-08-23
2	Trajectory-aware Shifted State Space Models for Online Video Super-Resolution	2025-08-14
3	QuantVSR: Low-Bit Post-Training Quantization for Real-World Video Super-Resolution	2025-08-06
4	Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework	2025-08-06
5	Semantic and Temporal Integration in Latent Diffusion Space for High-Fidelity Video Super-Resolution	2025-08-01
6	RealisVSR: Detail-enhanced Diffusion for Real-World 4K Video Super-Resolution	2025-07-25
7	UltraVSR: Achieving Ultra-Realistic Video Super-Resolution with Efficient One-Step Diffusion Space	2025-05-26
8	Spatio-Temporal Distortion Aware Omnidirectional Video Super-Resolution	2024-10-15

序号	标题	日期
1	Improving Biomedical Knowledge Graph Quality: A Community Approach	2025-08-29
2	Unsupervised Video Continual Learning via Non-Parametric Deep Embedded Clustering	2025-08-29
3	Geospatial Question Answering on Historical Maps Using Spatio-Temporal Knowledge Graphs and Large Language Models	2025-08-29
4	A Knowledge Distillation-empowered Adaptive Federated Reinforcement Learning Framework for Multi-Domain IoT Applications Scheduling	2025-08-29
5	MyGO: Memory Yielding Generative Offline-consolidation for Lifelong Learning Systems	2025-08-29
6	Addressing accuracy and hallucination of LLMs in Alzheimer's disease research through knowledge graphs	2025-08-28
7	Efficient Large-Scale Cross-Domain Sequential Recommendation with Dynamic State Representations	2025-08-28
8	Re4: Scientific Computing Agent with Rewriting, Resolution, Review and Revision	2025-08-28
9	Unified Multi-task Learning for Voice-Based Detection of Diverse Clinical Conditions	2025-08-28
10	MobileCLIP2: Improving Multi-Modal Reinforced Training	2025-08-28
11	Enhancing Semantic Document Retrieval- Employing Group Steiner Tree Algorithm with Domain Knowledge Enrichment	2025-08-28
12	Dual-Model Weight Selection and Self-Knowledge Distillation for Medical Image Classification	2025-08-28
13	KG-CQR: Leveraging Structured Relation Representations in Knowledge Graphs for Contextual Query Retrieval	2025-08-28
14	ATMS-KD: Adaptive Temperature and Mixed Sample Knowledge Distillation for a Lightweight Residual CNN in Agricultural Embedded Systems	2025-08-27
15	Complementary Learning System Empowers Online Continual Learning of Vehicle Motion Forecasting in Smart Cities	2025-08-27
16	Toward Edge General Intelligence with Agentic AI and Agentification: Concepts, Technologies, and Future Directions	2025-08-26
17	Pandora: Leveraging Code-driven Knowledge Transfer for Unified Structured Knowledge Reasoning	2025-08-25
18	CultranAI at PalmX 2025: Data Augmentation for Cultural Knowledge Representation	2025-08-24
19	Information Ecosystem Reengineering via Public Sector Knowledge Representation	2025-08-21
20	Transplant Then Regenerate: A New Paradigm for Text Data Augmentation	2025-08-20
21	Statistical Comparative Analysis of Semantic Similarities and Model Transferability Across Datasets for Short Answer Grading	2025-08-19
22	Semantic Discrepancy-aware Detector for Image Forgery Identification	2025-08-17
23	Beyond the Rosetta Stone: Unification Forces in Generalization Dynamics	2025-08-14
24	Physical Autoregressive Model for Robotic Manipulation without Action Pretraining	2025-08-13
25	What Breaks Knowledge Graph based RAG? Empirical Insights into Reasoning under Incomplete Knowledge	2025-08-11
26	Pr$^2$R: Information-Fused and Style-Aware Privacy-Preserving Replay for Lifelong Person Re-Identification	2025-08-03
27	Can Large Language Models Develop Strategic Reasoning? Post-training Insights from Learning Chess	2025-07-01
28	On the Fundamental Impossibility of Hallucination Control in Large Language Models	2025-06-04
29	Hydra: Structured Cross-Source Enhanced Large Language Model Reasoning	2025-05-23
30	FedSDAF: Leveraging Source Domain Awareness for Enhanced Federated Domain Generalization	2025-05-05
31	Evaluating Knowledge Graph Based Retrieval Augmented Generation Methods under Knowledge Incompleteness	2025-04-07
32	VectorFit : Adaptive Singular & Bias Vector Fine-Tuning of Pre-trained Foundation Models	2025-03-25
33	Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning	2025-03-11
34	Retrieval-Augmented Machine Translation with Unstructured Knowledge	2024-12-05
35	Query Optimization for Parametric Knowledge Refinement in Retrieval-Augmented Large Language Models	2024-11-12
36	SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Models	2024-11-01
37	Rethinking Invariance Regularization in Adversarial Training to Improve Robustness-Accuracy Trade-off	2024-02-22

序号	标题	日期
1	Xiangqi-R1: Enhancing Spatial Strategic Reasoning in LLMs for Chinese Chess via Reinforcement Learning	2025-07-16
2	On 3-terminal positions in Hex	2025-07-11
3	A number game reconciliation	2025-07-07
4	Deep Reinforcement Learning Xiangqi Player with Monte Carlo Tree Search	2025-06-18
5	Circular Game Coloring of Signed Graphs	2025-05-27
6	Computational and Algebraic Structure of Board Games	2025-02-18
7	RemoteChess: Enhancing Older Adults' Social Connectedness via Designing a Virtual Reality Chinese Chess (Xiangqi) Community	2025-02-17
8	Temperatures of Robin Hood	2025-01-13
9	On Conway's Numbers and Games, the Von Neumann Universe, and Pure Set Theory	2025-01-08
10	Complete Implementation of WXF Chinese Chess Rules	2024-12-23
11	Maker-Breaker on Galton-Watson trees	2024-12-11
12	Relationship between misère NIM and two-player GOISHI HIROI	2024-12-05
13	The Game Value of Sequential Compounds of Integers and Stars	2024-11-13
14	Mastering Chinese Chess AI (Xiangqi) Without Search	2024-10-07
15	XQSV: A Structurally Variable Network to Imitate Human Play in Xiangqi	2024-07-05
16	Shogi and Frieze group	2023-11-15
17	JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games	2023-08-09
18	Niel's Chess -- Rules for Xiangqi	2023-06-27
19	On the complexity of Dark Chinese Chess	2021-12-06

序号	标题	日期
1	RepoMark: A Code Usage Auditing Framework for Code Large Language Models	2025-08-29
2	The Fools are Certain; the Wise are Doubtful: Exploring LLM Confidence in Code Completion	2025-08-22
3	Hallucination in LLM-Based Code Generation: An Automotive Case Study	2025-08-15
4	VisCodex: Unified Multimodal Code Generation via Merging Vision and Coding Models	2025-08-13
5	A Taxonomy of Inefficiencies in LLM-Generated Python Code	2025-03-08
6	RefineCoder: Iterative Improving of Large Language Models via Adaptive Critique Refinement for Code Generation	2025-02-13
7	HAFix: History-Augmented Large Language Models for Bug Fixing	2025-01-15
8	Robo-Instruct: Simulator-Augmented Instruction Alignment For Finetuning Code LLMs	2024-05-30

序号	标题	日期
1	Towards Improved Speech Recognition through Optimized Synthetic Data Generation	2025-08-29
2	NSPDI-SNN: An efficient lightweight SNN based on nonlinear synaptic pruning and dendritic integration	2025-08-29
3	Can Layer-wise SSL Features Improve Zero-Shot ASR Performance for Children's Speech?	2025-08-28
4	Benchmarking Large Pretrained Multilingual Models on Québec French Speech Recognition	2025-08-28
5	OLMoASR: Open Models and Data for Training Robust Speech Recognition Models	2025-08-28
6	Generative Annotation for ASR Named Entity Correction	2025-08-28
7	MoTAS: MoE-Guided Feature Selection from TTS-Augmented Speech for Enhanced Multimodal Alzheimer's Early Screening	2025-08-28
8	OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset	2023-01-16

序号	标题	日期
1	Efficient Diffusion-Based 3D Human Pose Estimation with Hierarchical Temporal Pruning	2025-08-29
2	PHD: Personalized 3D Human Body Fitting with Point Diffusion	2025-08-28
3	COMETH: Convex Optimization for Multiview Estimation and Tracking of Humans	2025-08-28
4	Estimating 2D Keypoints of Surgical Tools Using Vision-Language Models with Low-Rank Adaptation	2025-08-28
5	ROBUST-MIPS: A Combined Skeletal Pose and Instance Segmentation Dataset for Laparoscopic Surgical Instruments	2025-08-27
6	WEBEYETRACK: Scalable Eye-Tracking for the Browser via On-Device Few-Shot Personalization	2025-08-27
7	PersPose: 3D Human Pose Estimation with Perspective Encoding and Perspective Rotation	2025-08-24
8	6-DoF Object Tracking with Event-based Optical Flow and Frames	2025-08-20
9	DynamicPose: Real-time and Robust 6D Object Pose Tracking for Fast-Moving Cameras and Objects	2025-08-16
10	Visuomotor Grasping with World Models for Surgical Robots	2025-08-15
11	Embracing Dynamics: Dynamics-aware 4D Gaussian Splatting SLAM	2025-04-07
12	PicoPose: Progressive Pixel-to-Pixel Correspondence Learning for Novel Object Pose Estimation	2025-04-03
13	Bring Your Rear Cameras for Egocentric 3D Human Pose Estimation	2025-03-14
14	Learning Whole-Body Loco-Manipulation for Omni-Directional Task Space Pose Tracking with a Wheeled-Quadrupedal-Manipulator	2024-12-04
15	OmniPose6D: Towards Short-Term Object Pose Tracking in Dynamic Scenes from Monocular RGB	2024-10-09
16	Faster Model Predictive Control via Self-Supervised Initialization Learning	2024-08-06
17	Matching Anything by Segmenting Anything	2024-06-06
18	Input-Output Extension of Underactuated Nonlinear Systems	2024-03-05
19	Multi-Scale Memory Comparison for Zero-/Few-Shot Anomaly Detection	2023-08-09
20	Zero-Shot Anomaly Detection with Pre-trained Segmentation Models	2023-06-15
21	APRIL-GAN: A Zero-/Few-Shot Anomaly Classification and Segmentation Method for CVPR 2023 VAND Workshop Challenge Tracks 1&2: 1st Place on Zero-shot AD and 4th Place on Few-shot AD	2023-05-27
22	Unifying Tracking and Image-Video Object Detection	2022-11-20
23	Exploring the Effectiveness of Self-supervised Learning and Classifier Chains in Emotion Recognition of Nonverbal Vocalizations	2022-06-21
24	The Multi-speaker Multi-style Voice Cloning Challenge 2021	2021-04-05

序号	标题	日期
1	DATR: Diffusion-based 3D Apple Tree Reconstruction Framework with Sparse-View	2025-08-27
2	Structural Energy-Guided Sampling for View-Consistent Text-to-3D	2025-08-23
3	MV-RAG: Retrieval Augmented Multiview Diffusion	2025-08-22
4	Say It, See It: A Systematic Evaluation on Speech-Based 3D Content Generation Methods in Augmented Reality	2025-08-17
5	CoreEditor: Consistent 3D Editing via Correspondence-constrained Diffusion	2025-08-15
6	Enhancing Monocular 3D Hand Reconstruction with Learned Texture Priors	2025-08-13
7	TexTailor: Customized Text-aligned Texturing via Effective Resampling	2025-06-12
8	CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx	2025-06-05
9	MonoCoP: Chain-of-Prediction for Monocular 3D Object Detection	2025-05-07
10	SORT3D: Spatial Object-centric Reasoning Toolbox for Zero-Shot 3D Grounding Using Large Language Models	2025-04-25
11	CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading	2025-04-09
12	Text-to-3D Generation using Jensen-Shannon Score Distillation	2025-03-08
13	ProcTex: Consistent and Interactive Text-to-texture Synthesis for Procedural Models	2025-01-28
14	Improving Viewpoint Consistency in 3D Generation via Structure Feature and CLIP Guidance	2024-12-03
15	Fancy123: One Image to High-Quality 3D Mesh Generation via Plug-and-Play Deformation	2024-11-25
16	MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D	2024-11-04
17	3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation	2024-10-24
18	Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control	2024-10-09
19	Localized Gaussian Splatting Editing with Contextual Awareness	2024-07-31
20	REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment	2024-05-28
21	FlashTex: Fast Relightable Mesh Texturing with LightControlNet	2024-02-20

序号	标题	日期
1	Verifying Probabilistic Regions of Attraction with Neural Lyapunov Functions for Stochastic Systems	2025-08-28
2	Formal Modeling and Verification of the Algorand Consensus Protocol in CADP	2025-08-26
3	Formal Verification of Physical Layer Security Protocols for Next-Generation Communication Networks (extended version)	2025-08-26
4	MoveScanner: Analysis of Security Risks of Move Smart Contracts	2025-08-25
5	Lean Meets Theoretical Computer Science: Scalable Synthesis of Theorem Proving Challenges in Formal-Informal Pairs	2025-08-21
6	Repairing General Game Descriptions (extended version)	2025-08-14
7	TPTP World Infrastructure for Non-classical Logics	2025-08-12
8	Policy Design in Zero-Trust Distributed Networks: Challenges and Solutions	2025-08-06
9	Goedel-Prover-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction	2025-08-05
10	StepFun-Prover Preview: Let's Think and Verify Step by Step	2025-07-27
11	An ACL2s Interface to Z3	2025-07-25
12	The AlphaPhysics Term Rewriting System for Marking Algebraic Expressions in Physics Exams	2025-07-24
13	Leveraging LLMs for Formal Software Requirements -- Challenges and Prospects	2025-07-18
14	Generalized Tree Edit Distance (GTED): A Faithful Evaluation Metric for Statement Autoformalization	2025-07-10
15	Quantifying Bounded Rationality: Formal Verification of Simon's Satisficing Through Flexible Stochastic Dominance	2025-07-02
16	Software is infrastructure: failures, successes, costs, and the case for formal verification	2025-06-15
17	APOLLO: Automated LLM and Lean Collaboration for Advanced Formal Reasoning	2025-05-09
18	TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving	2025-04-22
19	Automated Discovery of Tactic Libraries for Interactive Theorem Proving	2025-03-31
20	LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction	2025-02-25
21	Proving the Coding Interview: A Benchmark for Formally Verified Code Generation	2025-02-08
22	Learning Rules Explaining Interactive Theorem Proving Tactic Prediction	2024-11-02
23	A Certified Proof Checker for Deep Neural Network Verification in Imandra	2024-05-17

每月论文更新 - 2025年09月02日 #26

Description

最后更新：2025-09-02 00:09

论文汇总（201篇）

1. efficient RL

2. partial observable markov decision process/pomdp

3. sparse reward reinforcement learning

4. casual RL/counterfactual RL/casual reinforcement learning

5. causal inference/causal discovery/counterfactual reasoning

6. video super resolution

7. knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding

8. combinatorial game theory/xiangqi/chinese chess

9. code llm

10. speech recognition

11. zero shot tracking/few shot tracking/pose tracking/pose estimation

12. text to 3d/image to 3d/text to texture

13. automated theorem proving/interactive theorem proving/formal verification

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions