每月论文更新 - 2025年10月02日

## 最后更新：2025-10-02 00:09
**本次更新执行命令**
```
D:\a\MyAutoPapers\MyAutoPapers\target\release\my_auto_papers.exe --keywords=
             efficient RL,
             partial observable markov decision process/pomdp,sparse reward reinforcement learning,
             casual RL/counterfactual RL/casual reinforcement learning,
             causal inference/causal discovery/counterfactual reasoning,
             video super resolution,
             knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding,
             combinatorial game theory/xiangqi/chinese chess,
             code llm,
             speech recognition,
             zero shot tracking/few shot tracking/pose tracking/pose estimation,
             text to 3d/image to 3d/text to texture,
             automated theorem proving/interactive theorem proving/formal verification
              --exclude-keywords=multi-agent,multiagent --per-keyword-max-result=8
```

**参数详解**
- 关键词：`efficient RL`, `partial observable markov decision process/pomdp`, `sparse reward reinforcement learning`, `casual RL/counterfactual RL/casual reinforcement learning`, `causal inference/causal discovery/counterfactual reasoning`, `video super resolution`, `knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding`, `combinatorial game theory/xiangqi/chinese chess`, `code llm`, `speech recognition`, `zero shot tracking/few shot tracking/pose tracking/pose estimation`, `text to 3d/image to 3d/text to texture`, `automated theorem proving/interactive theorem proving/formal verification`
- 排除关键词：`multi-agent`, `multiagent`
- 每关键词最大结果：`8`
- 目标领域：`cs`, `stat`
- 每关键词重试次数：`3`


## 论文汇总（202篇）

**更好的阅读体验请访问 [Github页面](https://github.com/dbsxdbsx/MyAutoPapers)。**


### 1. efficient RL
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation](http://arxiv.org/abs/2509.15965v1)** | 2025-09-19 |
| **2** | **[TDRM: Smooth Reward Models with Temporal Difference for LLM RL and Inference](http://arxiv.org/abs/2509.15110v2)** | 2025-09-18 |
| **3** | **[Gradient Free Deep Reinforcement Learning With TabPFN](http://arxiv.org/abs/2509.11259v1)** | 2025-09-14 |
| **4** | **[SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning](http://arxiv.org/abs/2509.09674v1)** | 2025-09-11 |
| **5** | **[RLFactory: A Plug-and-Play Reinforcement Learning Post-Training Framework for LLM Multi-Turn Tool-Use](http://arxiv.org/abs/2509.06980v1)** | 2025-08-31 |
| **6** | **[rStar2-Agent: Agentic Reasoning Technical Report](http://arxiv.org/abs/2508.20722v1)** | 2025-08-28 |
| **7** | **[Angles Don't Lie: Unlocking Training-Efficient RL Through the Model's Own Signals](http://arxiv.org/abs/2506.02281v2)** | 2025-06-02 |
| **8** | **[Efficient RL Training for Reasoning Models via Length-Aware Optimization](http://arxiv.org/abs/2505.12284v2)** | 2025-05-18 |
### 2. partial observable markov decision process/pomdp
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[Accelerating Transformers in Online RL](http://arxiv.org/abs/2509.26137v1)** | 2025-09-30 |
| **2** | **[Model-Based Reinforcement Learning under Random Observation Delays](http://arxiv.org/abs/2509.20869v1)** | 2025-09-25 |
| **3** | **[Assistive Decision-Making for Right of Way Navigation at Uncontrolled Intersections](http://arxiv.org/abs/2509.18407v1)** | 2025-09-22 |
| **4** | **[Dream to Chat: Model-based Reinforcement Learning on Dialogues with User Belief Modeling](http://arxiv.org/abs/2508.16876v3)** | 2025-08-23 |
| **5** | **[Synthetic POMDPs to Challenge Memory-Augmented RL: Memory Demand Structure Modeling](http://arxiv.org/abs/2508.04282v2)** | 2025-08-06 |
| **6** | **[PIGDreamer: Privileged Information Guided World Models for Safe Partially Observable Reinforcement Learning](http://arxiv.org/abs/2508.02159v2)** | 2025-08-04 |
| **7** | **[Mixing Any Cocktail with Limited Ingredients: On the Structure of Payoff Sets in Multi-Objective POMDPs and its Impact on Randomised Strategies](http://arxiv.org/abs/2502.18296v2)** | 2025-02-25 |
| **8** | **[Solving Truly Massive Budgeted Monotonic POMDPs with Oracle-Guided Meta-Reinforcement Learning](http://arxiv.org/abs/2408.07192v3)** | 2024-08-13 |
| **9** | **[Contributions on complexity bounds for Deterministic Partially Observed Markov Decision Process](http://arxiv.org/abs/2301.08567v2)** | 2023-01-20 |
### 3. sparse reward reinforcement learning
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[What Fundamental Structure in Reward Functions Enables Efficient Sparse-Reward Learning?](http://arxiv.org/abs/2509.03790v2)** | 2025-09-04 |
| **2** | **[LLM-Driven Intrinsic Motivation for Sparse Reward Reinforcement Learning](http://arxiv.org/abs/2508.18420v1)** | 2025-08-25 |
| **3** | **[SuperRL: Reinforcement Learning with Supervision to Boost Language Model Reasoning](http://arxiv.org/abs/2506.01096v2)** | 2025-06-01 |
| **4** | **[DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning](http://arxiv.org/abs/2505.19850v1)** | 2025-05-26 |
| **5** | **[STAR-R1: Spatial TrAnsformation Reasoning by Reinforcing Multimodal LLMs](http://arxiv.org/abs/2505.15804v3)** | 2025-05-21 |
| **6** | **[Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model](http://arxiv.org/abs/2503.11339v2)** | 2025-03-14 |
| **7** | **[Hedging with Sparse Reward Reinforcement Learning](http://arxiv.org/abs/2503.04218v1)** | 2025-03-06 |
| **8** | **[Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations](http://arxiv.org/abs/2412.01114v2)** | 2024-12-02 |
### 4. casual RL/counterfactual RL/casual reinforcement learning
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL](http://arxiv.org/abs/2502.12436v3)** | 2025-02-18 |
| **2** | **[Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation](http://arxiv.org/abs/2012.09092v1)** | 2020-12-16 |
### 5. causal inference/causal discovery/counterfactual reasoning
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[Computationally and statistically efficient estimation of time-smoothed counterfactual curves](http://arxiv.org/abs/2509.26554v1)** | 2025-09-30 |
| **2** | **[An Orthogonal Learner for Individualized Outcomes in Markov Decision Processes](http://arxiv.org/abs/2509.26429v1)** | 2025-09-30 |
| **3** | **[MR$^2$-Bench: Going Beyond Matching to Reasoning in Multimodal Retrieval](http://arxiv.org/abs/2509.26378v1)** | 2025-09-30 |
| **4** | **[Staged Event Trees for Transparent Treatment Effect Estimation](http://arxiv.org/abs/2509.26265v1)** | 2025-09-30 |
| **5** | **[Characterization and Learning of Causal Graphs with Latent Confounders and Post-treatment Selection from Interventional Data](http://arxiv.org/abs/2509.25800v1)** | 2025-09-30 |
| **6** | **[MuPlon: Multi-Path Causal Optimization for Claim Verification through Controlling Confounding](http://arxiv.org/abs/2509.25715v1)** | 2025-09-30 |
| **7** | **[TimeOmni-1: Incentivizing Complex Reasoning with Time Series in Large Language Models](http://arxiv.org/abs/2509.24803v1)** | 2025-09-29 |
| **8** | **[Guide: Generalized-Prior and Data Encoders for DAG Estimation](http://arxiv.org/abs/2509.23992v1)** | 2025-09-28 |
| **9** | **[Diagnosing Failure Root Causes in Platform-Orchestrated Agentic Systems: Dataset, Taxonomy, and Benchmark](http://arxiv.org/abs/2509.23735v1)** | 2025-09-28 |
| **10** | **[Improving constraint-based discovery with robust propagation and reliable LLM priors](http://arxiv.org/abs/2509.23570v1)** | 2025-09-28 |
| **11** | **[One-Shot Multi-Label Causal Discovery in High-Dimensional Event Sequences](http://arxiv.org/abs/2509.23213v1)** | 2025-09-27 |
| **12** | **[Efficient Ensemble Conditional Independence Test Framework for Causal Discovery](http://arxiv.org/abs/2509.21021v1)** | 2025-09-25 |
| **13** | **[DeFacto: Counterfactual Thinking with Images for Enforcing Evidence-Grounded and Faithful Reasoning](http://arxiv.org/abs/2509.20912v1)** | 2025-09-25 |
| **14** | **[A Counterfactual Reasoning Framework for Fault Diagnosis in Robot Perception Systems](http://arxiv.org/abs/2509.18460v1)** | 2025-09-22 |
| **15** | **[Causal-Counterfactual RAG: The Integration of Causal-Counterfactual Reasoning into RAG](http://arxiv.org/abs/2509.14435v2)** | 2025-09-17 |
| **16** | **[Causality-guided Prompt Learning for Vision-language Models via Visual Granulation](http://arxiv.org/abs/2509.03803v3)** | 2025-09-04 |
| **17** | **[Mapping beyond diseases: Controlled variable selection for secondary phenotypes using tilted knockoffs](http://arxiv.org/abs/2508.18548v2)** | 2025-08-25 |
| **18** | **[Deep Graph Learning for Industrial Carbon Emission Analysis and Policy Impact](http://arxiv.org/abs/2507.02912v2)** | 2025-06-25 |
| **19** | **[EgoVIS@CVPR: What Changed and What Could Have Changed? State-Change Counterfactuals for Procedure-Aware Video Representation Learning](http://arxiv.org/abs/2506.00101v2)** | 2025-05-30 |
| **20** | **[No Black Boxes: Interpretable and Interactable Predictive Healthcare with Knowledge-Enhanced Agentic Causal Discovery](http://arxiv.org/abs/2505.16288v2)** | 2025-05-22 |
| **21** | **[A Review on Riemannian Metric Learning: Closer to You than You Imagine](http://arxiv.org/abs/2503.05321v2)** | 2025-03-07 |
| **22** | **[Multi-View Causal Discovery without Non-Gaussianity: Identifiability and Algorithms](http://arxiv.org/abs/2502.20115v3)** | 2025-02-27 |
| **23** | **[Can LLMs Explain Themselves Counterfactually?](http://arxiv.org/abs/2502.18156v2)** | 2025-02-25 |
### 6. video super resolution
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[Continuous Space-Time Video Super-Resolution with 3D Fourier Fields](http://arxiv.org/abs/2509.26325v1)** | 2025-09-30 |
| **2** | **[PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution](http://arxiv.org/abs/2509.26025v1)** | 2025-09-30 |
| **3** | **[Asymmetric VAE for One-Step Video Super-Resolution Acceleration](http://arxiv.org/abs/2509.24142v1)** | 2025-09-29 |
| **4** | **[Towards Redundancy Reduction in Diffusion Models for Efficient Video Super-Resolution](http://arxiv.org/abs/2509.23980v1)** | 2025-09-28 |
| **5** | **[VividFace: High-Quality and Efficient One-Step Diffusion For Video Face Enhancement](http://arxiv.org/abs/2509.23584v1)** | 2025-09-28 |
| **6** | **[MedVSR: Medical Video Super-Resolution with Cross State-Space Propagation](http://arxiv.org/abs/2509.21265v1)** | 2025-09-25 |
| **7** | **[OS-DiffVSR: Towards One-step Latent Diffusion Model for High-detailed Real-world Video Super-Resolution](http://arxiv.org/abs/2509.16507v1)** | 2025-09-20 |
| **8** | **[SimpleGVR: A Simple Baseline for Latent-Cascaded Video Super-Resolution](http://arxiv.org/abs/2506.19838v3)** | 2025-06-24 |
### 7. knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[TAP: Two-Stage Adaptive Personalization of Multi-task and Multi-Modal Foundation Models in Federated Learning](http://arxiv.org/abs/2509.26524v1)** | 2025-09-30 |
| **2** | **[Revealing the Power of Post-Training for Small Language Models via Knowledge Distillation](http://arxiv.org/abs/2509.26497v1)** | 2025-09-30 |
| **3** | **[Combining Knowledge Graphs and NLP to Analyze Instant Messaging Data in Criminal Investigations](http://dx.doi.org/10.1007/978-981-96-0567-5_30)** | 2025-09-30 |
| **4** | **[OntoAligner Meets Knowledge Graph Embedding Aligners](http://arxiv.org/abs/2509.26417v1)** | 2025-09-30 |
| **5** | **[Efficient and Transferable Agentic Knowledge Graph RAG via Reinforcement Learning](http://arxiv.org/abs/2509.26383v1)** | 2025-09-30 |
| **6** | **[Interpret, prune and distill Donut : towards lightweight VLMs for VQA on document](http://arxiv.org/abs/2509.26235v1)** | 2025-09-30 |
| **7** | **[Type-Less yet Type-Aware Inductive Link Prediction with Pretrained Language Models](http://arxiv.org/abs/2509.26224v1)** | 2025-09-30 |
| **8** | **[MEDAKA: Construction of Biomedical Knowledge Graphs Using Large Language Models](http://arxiv.org/abs/2509.26128v1)** | 2025-09-30 |
| **9** | **[Items Proxy Bridging: Enabling Frictionless Critiquing in Knowledge Graph Recommendations](http://arxiv.org/abs/2509.26107v1)** | 2025-09-30 |
| **10** | **[CAST: Continuous and Differentiable Semi-Structured Sparsity-Aware Training for Large Language Models](http://arxiv.org/abs/2509.25996v1)** | 2025-09-30 |
| **11** | **[Data-Free Continual Learning of Server Models in Model-Heterogeneous Federated learning](http://arxiv.org/abs/2509.25977v1)** | 2025-09-30 |
| **12** | **[Autonomy-Aware Clustering: When Local Decisions Supersede Global Prescriptions](http://arxiv.org/abs/2509.25775v1)** | 2025-09-30 |
| **13** | **[How Does Preconditioning Guide Feature Learning in Deep Neural Networks?](http://arxiv.org/abs/2509.25637v1)** | 2025-09-30 |
| **14** | **[DAM: Dual Active Learning with Multimodal Foundation Model for Source-Free Domain Adaptation](http://arxiv.org/abs/2509.24896v1)** | 2025-09-29 |
| **15** | **[Patient-specific Biomolecular Instruction Tuning](http://arxiv.org/abs/2509.22853v1)** | 2025-09-26 |
| **16** | **[Advancing Natural Language Formalization to First Order Logic with Fine-tuned LLMs](http://arxiv.org/abs/2509.22338v1)** | 2025-09-26 |
| **17** | **[Frustratingly Easy Zero-Day Audio DeepFake Detection via Retrieval Augmentation and Profile Matching](http://arxiv.org/abs/2509.21728v1)** | 2025-09-26 |
| **18** | **[One Filters All: A Generalist Filter for State Estimation](http://arxiv.org/abs/2509.20051v1)** | 2025-09-24 |
| **19** | **[Dual-View Alignment Learning with Hierarchical-Prompt for Class-Imbalance Multi-Label Classification](http://arxiv.org/abs/2509.17747v1)** | 2025-09-22 |
| **20** | **[OpenGVL -- Benchmarking Visual Temporal Progress for Data Curation](http://arxiv.org/abs/2509.17321v2)** | 2025-09-22 |
| **21** | **[K-DeCore: Facilitating Knowledge Transfer in Continual Structured Knowledge Reasoning via Knowledge Decoupling](http://arxiv.org/abs/2509.16929v4)** | 2025-09-21 |
| **22** | **[Language-Instructed Reasoning for Group Activity Detection via Multimodal Large Language Model](http://arxiv.org/abs/2509.16054v1)** | 2025-09-19 |
| **23** | **[Artificially Fluent: Swahili AI Performance Benchmarks Between English-Trained and Natively-Trained Datasets](http://arxiv.org/abs/2509.04516v2)** | 2025-09-03 |
| **24** | **[Semantic Discrepancy-aware Detector for Image Forgery Identification](http://arxiv.org/abs/2508.12341v3)** | 2025-08-17 |
| **25** | **[Learning Unified User Quantized Tokenizers for User Representation](http://arxiv.org/abs/2508.00956v2)** | 2025-08-01 |
| **26** | **[Static Word Embeddings for Sentence Semantic Representation](http://arxiv.org/abs/2506.04624v2)** | 2025-06-05 |
| **27** | **[Personalized Subgraph Federated Learning with Differentiable Auxiliary Projections](http://arxiv.org/abs/2505.23864v2)** | 2025-05-29 |
| **28** | **[Multilingual Prompting for Improving LLM Generation Diversity](http://arxiv.org/abs/2505.15229v2)** | 2025-05-21 |
| **29** | **[Language-Specific Latent Process Hinders Cross-Lingual Performance](http://arxiv.org/abs/2505.13141v3)** | 2025-05-19 |
| **30** | **[Simple yet Effective Semi-supervised Knowledge Distillation from Vision-Language Models via Dual-Head Optimization](http://arxiv.org/abs/2505.07675v2)** | 2025-05-12 |
| **31** | **[KDC-Diff: A Latent-Aware Diffusion Model with Knowledge Retention for Memory-Efficient Image Generation](http://arxiv.org/abs/2505.06995v2)** | 2025-05-11 |
| **32** | **[Using Knowledge Graphs to harvest datasets for efficient CLIP model training](http://arxiv.org/abs/2505.02746v3)** | 2025-05-05 |
| **33** | **[A Survey of Graph Retrieval-Augmented Generation for Customized Large Language Models](http://arxiv.org/abs/2501.13958v3)** | 2025-01-21 |
| **34** | **[Efficient Dynamic Ensembling for Multiple LLM Experts](http://arxiv.org/abs/2412.07448v2)** | 2024-12-10 |
| **35** | **[CoT-TL: Low-Resource Temporal Knowledge Representation of Planning Instructions Using Chain-of-Thought Reasoning](http://dx.doi.org/10.1109/IROS58592.2024.10801817)** | 2024-10-21 |
| **36** | **[On the Integration of Spatial-Temporal Knowledge: A Lightweight Approach to Atmospheric Time Series Forecasting](http://arxiv.org/abs/2408.09695v2)** | 2024-08-19 |
| **37** | **[Adaptive Modality Balanced Online Knowledge Distillation for Brain-Eye-Computer based Dim Object Detection](http://dx.doi.org/10.1109/TNNLS.2025.3605710)** | 2024-07-02 |
| **38** | **[Representing Knowledge and Querying Data using Double-Functorial Semantics](http://dx.doi.org/10.4204/EPTCS.429.9)** | 2024-03-28 |
| **39** | **[Semantic Data Representation for Explainable Windows Malware Detection Models](http://arxiv.org/abs/2403.11669v2)** | 2024-03-18 |
### 8. combinatorial game theory/xiangqi/chinese chess
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[Various Diamond Properties in Combinatorial Game Theory](http://arxiv.org/abs/2509.21744v1)** | 2025-09-26 |
| **2** | **[Xiangqi-R1: Enhancing Spatial Strategic Reasoning in LLMs for Chinese Chess via Reinforcement Learning](http://arxiv.org/abs/2507.12215v1)** | 2025-07-16 |
| **3** | **[On 3-terminal positions in Hex](http://arxiv.org/abs/2507.08247v2)** | 2025-07-11 |
| **4** | **[A number game reconciliation](http://arxiv.org/abs/2507.04717v1)** | 2025-07-07 |
| **5** | **[Deep Reinforcement Learning Xiangqi Player with Monte Carlo Tree Search](http://arxiv.org/abs/2506.15880v1)** | 2025-06-18 |
| **6** | **[Circular Game Coloring of Signed Graphs](http://arxiv.org/abs/2505.21586v1)** | 2025-05-27 |
| **7** | **[Computational and Algebraic Structure of Board Games](http://arxiv.org/abs/2503.01850v1)** | 2025-02-18 |
| **8** | **[RemoteChess: Enhancing Older Adults' Social Connectedness via Designing a Virtual Reality Chinese Chess (Xiangqi) Community](http://dx.doi.org/10.1145/3706598.3714236)** | 2025-02-17 |
| **9** | **[Temperatures of Robin Hood](http://arxiv.org/abs/2501.07239v1)** | 2025-01-13 |
| **10** | **[On Conway's Numbers and Games, the Von Neumann Universe, and Pure Set Theory](http://arxiv.org/abs/2501.04412v2)** | 2025-01-08 |
| **11** | **[Complete Implementation of WXF Chinese Chess Rules](http://arxiv.org/abs/2412.17334v1)** | 2024-12-23 |
| **12** | **[Maker-Breaker on Galton-Watson trees](http://arxiv.org/abs/2412.08334v2)** | 2024-12-11 |
| **13** | **[Relationship between misère NIM and two-player GOISHI HIROI](http://arxiv.org/abs/2412.03996v1)** | 2024-12-05 |
| **14** | **[Mastering Chinese Chess AI (Xiangqi) Without Search](http://arxiv.org/abs/2410.04865v1)** | 2024-10-07 |
| **15** | **[XQSV: A Structurally Variable Network to Imitate Human Play in Xiangqi](http://arxiv.org/abs/2407.04678v1)** | 2024-07-05 |
| **16** | **[Shogi and Frieze group](http://arxiv.org/abs/2401.08591v2)** | 2023-11-15 |
| **17** | **[JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games](http://arxiv.org/abs/2308.04719v1)** | 2023-08-09 |
| **18** | **[Niel's Chess -- Rules for Xiangqi](http://arxiv.org/abs/2311.12181v2)** | 2023-06-27 |
| **19** | **[On the complexity of Dark Chinese Chess](http://arxiv.org/abs/2112.02989v1)** | 2021-12-06 |
### 9. code llm
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[Bridging Developer Instructions and Code Completion Through Instruction-Aware Fill-in-the-Middle Paradigm](http://arxiv.org/abs/2509.24637v1)** | 2025-09-29 |
| **2** | **[Verification Limits Code LLM Training](http://arxiv.org/abs/2509.20837v1)** | 2025-09-25 |
| **3** | **[Do Code Semantics Help? A Comprehensive Study on Execution Trace-Based Information for Code Large Language Models](http://arxiv.org/abs/2509.11686v3)** | 2025-09-15 |
| **4** | **[CADmium: Fine-Tuning Code Language Models for Text-Driven Sequential CAD Design](http://arxiv.org/abs/2507.09792v2)** | 2025-07-13 |
| **5** | **[Can Code Language Models Learn Clarification-Seeking Behaviors?](http://arxiv.org/abs/2504.16331v2)** | 2025-04-23 |
| **6** | **[A Preliminary Study on the Robustness of Code Generation by Large Language Models](http://arxiv.org/abs/2503.20197v4)** | 2025-03-26 |
| **7** | **[ExeCoder: Empowering Large Language Models with Executability Representation for Code Translation](http://arxiv.org/abs/2501.18460v3)** | 2025-01-30 |
| **8** | **[GALLa: Graph Aligned Large Language Models for Improved Source Code Understanding](http://arxiv.org/abs/2409.04183v4)** | 2024-09-06 |
### 10. speech recognition
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[IR-UWB Radar-Based Contactless Silent Speech Recognition with Attention-Enhanced Temporal Convolutional Networks](http://arxiv.org/abs/2509.26409v1)** | 2025-09-30 |
| **2** | **[ASR Under Noise: Exploring Robustness for Sundanese and Javanese](http://arxiv.org/abs/2509.25878v1)** | 2025-09-30 |
| **3** | **[Beyond WER: Probing Whisper's Sub-token Decoder Across Diverse Language Resource Levels](http://arxiv.org/abs/2509.25516v1)** | 2025-09-29 |
| **4** | **[Confidence-Guided Error Correction for Disordered Speech Recognition](http://arxiv.org/abs/2509.25048v1)** | 2025-09-29 |
| **5** | **[MeanFlowSE: One-Step Generative Speech Enhancement via MeanFlow](http://arxiv.org/abs/2509.23299v2)** | 2025-09-27 |
| **6** | **[Streaming Sequence-to-Sequence Learning with Delayed Streams Modeling](http://arxiv.org/abs/2509.08753v2)** | 2025-09-10 |
| **7** | **[Regularizing Learnable Feature Extraction for Automatic Speech Recognition](http://dx.doi.org/10.21437/Interspeech.2025-694)** | 2025-06-11 |
| **8** | **[Exploring the Impact of Data Quantity on ASR in Extremely Low-resource Languages](http://arxiv.org/abs/2409.08872v2)** | 2024-09-13 |
### 11. zero shot tracking/few shot tracking/pose tracking/pose estimation
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[TTT3R: 3D Reconstruction as Test-Time Training](http://arxiv.org/abs/2509.26645v1)** | 2025-09-30 |
| **2** | **[A Multi-purpose Tracking Framework for Salmon Welfare Monitoring in Challenging Environments](http://arxiv.org/abs/2509.25969v1)** | 2025-09-30 |
| **3** | **[User-Centric Communication Service Provision for Edge-Assisted Mobile Augmented Reality](http://arxiv.org/abs/2509.25905v1)** | 2025-09-30 |
| **4** | **[Physics-Informed Learning for Human Whole-Body Kinematics Prediction via Sparse IMUs](http://arxiv.org/abs/2509.25704v1)** | 2025-09-30 |
| **5** | **[Robust Visual Localization in Compute-Constrained Environments by Salient Edge Rendering and Weighted Hamming Similarity](http://dx.doi.org/10.1109/LRA.2025.3614045)** | 2025-09-29 |
| **6** | **[VGGT-X: When VGGT Meets Dense Novel View Synthesis](http://arxiv.org/abs/2509.25191v1)** | 2025-09-29 |
| **7** | **[PAD3R: Pose-Aware Dynamic 3D Reconstruction from Casual Videos](http://arxiv.org/abs/2509.25183v1)** | 2025-09-29 |
| **8** | **[SDPose: Exploiting Diffusion Priors for Out-of-Domain and Robust Pose Estimation](http://arxiv.org/abs/2509.24980v1)** | 2025-09-29 |
| **9** | **[Good Weights: Proactive, Adaptive Dead Reckoning Fusion for Continuous and Robust Visual SLAM](http://arxiv.org/abs/2509.22910v1)** | 2025-09-26 |
| **10** | **[MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training](http://arxiv.org/abs/2509.22199v2)** | 2025-09-26 |
| **11** | **[MASt3R-Fusion: Integrating Feed-Forward Visual Model with IMU, GNSS for High-Functionality SLAM](http://arxiv.org/abs/2509.20757v2)** | 2025-09-25 |
| **12** | **[UniTac2Pose: A Unified Approach Learned in Simulation for Category-level Visuotactile In-hand Pose Estimation](http://arxiv.org/abs/2509.15934v1)** | 2025-09-19 |
| **13** | **[Seg2Track-SAM2: SAM2-based Multi-object Tracking and Segmentation for Zero-shot Generalization](http://arxiv.org/abs/2509.11772v1)** | 2025-09-15 |
| **14** | **[IMD: A 6-DoF Pose Estimation Benchmark for Industrial Metallic Objects](http://arxiv.org/abs/2509.11680v1)** | 2025-09-15 |
| **15** | **[Hierarchical Reactive Grasping via Task-Space Velocity Fields and Joint-Space Quadratic Programming](http://arxiv.org/abs/2509.01044v2)** | 2025-09-01 |
| **16** | **[PRISM-DP: Spatial Pose-based Observations for Diffusion-Policies via Segmentation, Mesh Generation, and Pose Tracking](http://arxiv.org/abs/2504.20359v3)** | 2025-04-29 |
| **17** | **[BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation](http://arxiv.org/abs/2504.07955v2)** | 2025-04-10 |
| **18** | **[Faster Model Predictive Control via Self-Supervised Initialization Learning](http://arxiv.org/abs/2408.03394v2)** | 2024-08-06 |
| **19** | **[Matching Anything by Segmenting Anything](http://arxiv.org/abs/2406.04221v1)** | 2024-06-06 |
| **20** | **[Multi-Scale Memory Comparison for Zero-/Few-Shot Anomaly Detection](http://arxiv.org/abs/2308.04789v2)** | 2023-08-09 |
| **21** | **[Zero-Shot Anomaly Detection with Pre-trained Segmentation Models](http://arxiv.org/abs/2306.09269v1)** | 2023-06-15 |
| **22** | **[APRIL-GAN: A Zero-/Few-Shot Anomaly Classification and Segmentation Method for CVPR 2023 VAND Workshop Challenge Tracks 1&2: 1st Place on Zero-shot AD and 4th Place on Few-shot AD](http://arxiv.org/abs/2305.17382v3)** | 2023-05-27 |
| **23** | **[Unifying Tracking and Image-Video Object Detection](http://arxiv.org/abs/2211.11077v2)** | 2022-11-20 |
| **24** | **[Exploring the Effectiveness of Self-supervised Learning and Classifier Chains in Emotion Recognition of Nonverbal Vocalizations](http://arxiv.org/abs/2206.10695v1)** | 2022-06-21 |
| **25** | **[The Multi-speaker Multi-style Voice Cloning Challenge 2021](http://arxiv.org/abs/2104.01818v1)** | 2021-04-05 |
### 12. text to 3d/image to 3d/text to texture
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[PAD3R: Pose-Aware Dynamic 3D Reconstruction from Casual Videos](http://arxiv.org/abs/2509.25183v1)** | 2025-09-29 |
| **2** | **[Light-SQ: Structure-aware Shape Abstraction with Superquadrics for Generated Meshes](http://dx.doi.org/10.1145/3757377.3763835)** | 2025-09-29 |
| **3** | **[UP2You: Fast Reconstruction of Yourself from Unconstrained Photo Collections](http://arxiv.org/abs/2509.24817v1)** | 2025-09-29 |
| **4** | **[Towards Fine-Grained Text-to-3D Quality Assessment: A Benchmark and A Two-Stage Rank-Learning Metric](http://arxiv.org/abs/2509.23841v1)** | 2025-09-28 |
| **5** | **[ZeroScene: A Zero-Shot Framework for 3D Scene Generation from a Single Image and Controllable Texture Editing](http://arxiv.org/abs/2509.23607v1)** | 2025-09-28 |
| **6** | **[Drag4D: Align Your Motion with Text-Driven 3D Scene Generation](http://arxiv.org/abs/2509.21888v1)** | 2025-09-26 |
| **7** | **[Vision-Language Models as Differentiable Semantic and Spatial Rewards for Text-to-3D Generation](http://arxiv.org/abs/2509.15772v1)** | 2025-09-19 |
| **8** | **[AToken: A Unified Tokenizer for Vision](http://arxiv.org/abs/2509.14476v2)** | 2025-09-17 |
| **9** | **[T2Bs: Text-to-Character Blendshapes via Video Generation](http://arxiv.org/abs/2509.10678v2)** | 2025-09-12 |
| **10** | **[One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation](http://arxiv.org/abs/2509.07978v1)** | 2025-09-09 |
| **11** | **[A Scalable Attention-Based Approach for Image-to-3D Texture Mapping](http://arxiv.org/abs/2509.05131v1)** | 2025-09-05 |
| **12** | **[TexTailor: Customized Text-aligned Texturing via Effective Resampling](http://arxiv.org/abs/2506.10612v1)** | 2025-06-12 |
| **13** | **[CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx](http://arxiv.org/abs/2506.04931v1)** | 2025-06-05 |
| **14** | **[ART-DECO: Arbitrary Text Guidance for 3D Detailizer Construction](http://arxiv.org/abs/2505.20431v3)** | 2025-05-26 |
| **15** | **[Making Physical Objects with Generative AI and Robotic Assembly: Considering Fabrication Constraints, Sustainability, Time, Functionality, and Accessibility](http://arxiv.org/abs/2504.19131v3)** | 2025-04-27 |
| **16** | **[CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading](http://arxiv.org/abs/2504.06856v1)** | 2025-04-09 |
| **17** | **[ProcTex: Consistent and Interactive Text-to-texture Synthesis for Procedural Models](http://arxiv.org/abs/2501.17895v1)** | 2025-01-28 |
| **18** | **[FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction](http://arxiv.org/abs/2412.09573v2)** | 2024-12-12 |
| **19** | **[MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D](http://arxiv.org/abs/2411.02336v1)** | 2024-11-04 |
| **20** | **[3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation](http://arxiv.org/abs/2410.18974v2)** | 2024-10-24 |
| **21** | **[Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control](http://arxiv.org/abs/2410.06985v1)** | 2024-10-09 |
| **22** | **[FlashTex: Fast Relightable Mesh Texturing with LightControlNet](http://arxiv.org/abs/2402.13251v3)** | 2024-02-20 |
| **23** | **[Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints](http://arxiv.org/abs/2310.03602v5)** | 2023-10-05 |
### 13. automated theorem proving/interactive theorem proving/formal verification
| **序号** | **标题** | **日期** |
| --- | --- | --- |
| **1** | **[Towards Verified Code Reasoning by LLMs](http://arxiv.org/abs/2509.26546v1)** | 2025-09-30 |
| **2** | **[Learning-Based Testing for Deep Learning: Enhancing Model Robustness with Adversarial Input Prioritization](http://arxiv.org/abs/2509.23961v1)** | 2025-09-28 |
| **3** | **[GPM: The Gaussian Pancake Mechanism for Planting Undetectable Backdoors in Differential Privacy](http://arxiv.org/abs/2509.23834v1)** | 2025-09-28 |
| **4** | **[PAT-Agent: Autoformalization for Model Checking](http://arxiv.org/abs/2509.23675v1)** | 2025-09-28 |
| **5** | **[L-Mosaics and Bounded Join-Semilattices in Isabelle/HOL](http://arxiv.org/abs/2509.19854v1)** | 2025-09-24 |
| **6** | **[EconProver: Towards More Economical Test-Time Scaling for Automated Theorem Proving](http://arxiv.org/abs/2509.12603v1)** | 2025-09-16 |
| **7** | **[Saturation-Driven Dataset Generation for LLM Mathematical Reasoning in the TPTP Ecosystem](http://arxiv.org/abs/2509.06809v1)** | 2025-09-08 |
| **8** | **[Contradictions](http://arxiv.org/abs/2509.07026v1)** | 2025-09-07 |
| **9** | **[Formal Modeling and Verification of the Algorand Consensus Protocol in CADP](http://arxiv.org/abs/2508.19452v5)** | 2025-08-26 |
| **10** | **[An ACL2s Interface to Z3](http://dx.doi.org/10.4204/EPTCS.423.10)** | 2025-07-25 |
| **11** | **[Generalized Tree Edit Distance (GTED): A Faithful Evaluation Metric for Statement Autoformalization](http://arxiv.org/abs/2507.07399v2)** | 2025-07-10 |
| **12** | **[Quantifying Bounded Rationality: Formal Verification of Simon's Satisficing Through Flexible Stochastic Dominance](http://arxiv.org/abs/2507.07052v1)** | 2025-07-02 |
| **13** | **[Prover Agent: An Agent-Based Framework for Formal Mathematical Proofs](http://arxiv.org/abs/2506.19923v3)** | 2025-06-24 |
| **14** | **[Logic Gate Neural Networks are Good for Verification](http://arxiv.org/abs/2505.19932v2)** | 2025-05-26 |
| **15** | **[A Formal Proof of Complexity Bounds on Diophantine Equations](http://dx.doi.org/10.4230/LIPIcs.ITP.2025.3)** | 2025-05-22 |
| **16** | **[Generalizable Process Reward Models via Formally Verified Training Data](http://arxiv.org/abs/2505.15960v2)** | 2025-05-21 |
| **17** | **[Canonical for Automated Theorem Proving in Lean](http://dx.doi.org/10.4230/LIPIcs.ITP.2025.14)** | 2025-04-08 |
| **18** | **[Automated Discovery of Tactic Libraries for Interactive Theorem Proving](http://arxiv.org/abs/2503.24036v2)** | 2025-03-31 |
| **19** | **[LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction](http://arxiv.org/abs/2502.17925v2)** | 2025-02-25 |
| **20** | **[Proving the Coding Interview: A Benchmark for Formally Verified Code Generation](http://arxiv.org/abs/2502.05714v1)** | 2025-02-08 |
| **21** | **[A Certified Proof Checker for Deep Neural Network Verification in Imandra](http://arxiv.org/abs/2405.10611v2)** | 2024-05-17 |
| **22** | **[Consensus-Free Spreadsheet Integration](http://arxiv.org/abs/2209.14457v2)** | 2022-09-28 |


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

每月论文更新 - 2025年10月02日 #27

最后更新：2025-10-02 00:09

论文汇总（202篇）

1. efficient RL

2. partial observable markov decision process/pomdp

3. sparse reward reinforcement learning

4. casual RL/counterfactual RL/casual reinforcement learning

5. causal inference/causal discovery/counterfactual reasoning

6. video super resolution

7. knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding

8. combinatorial game theory/xiangqi/chinese chess

9. code llm

10. speech recognition

11. zero shot tracking/few shot tracking/pose tracking/pose estimation

12. text to 3d/image to 3d/text to texture

13. automated theorem proving/interactive theorem proving/formal verification

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

序号	标题	日期
1	RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation	2025-09-19
2	TDRM: Smooth Reward Models with Temporal Difference for LLM RL and Inference	2025-09-18
3	Gradient Free Deep Reinforcement Learning With TabPFN	2025-09-14
4	SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning	2025-09-11
5	RLFactory: A Plug-and-Play Reinforcement Learning Post-Training Framework for LLM Multi-Turn Tool-Use	2025-08-31
6	rStar2-Agent: Agentic Reasoning Technical Report	2025-08-28
7	Angles Don't Lie: Unlocking Training-Efficient RL Through the Model's Own Signals	2025-06-02
8	Efficient RL Training for Reasoning Models via Length-Aware Optimization	2025-05-18

序号	标题	日期
1	Accelerating Transformers in Online RL	2025-09-30
2	Model-Based Reinforcement Learning under Random Observation Delays	2025-09-25
3	Assistive Decision-Making for Right of Way Navigation at Uncontrolled Intersections	2025-09-22
4	Dream to Chat: Model-based Reinforcement Learning on Dialogues with User Belief Modeling	2025-08-23
5	Synthetic POMDPs to Challenge Memory-Augmented RL: Memory Demand Structure Modeling	2025-08-06
6	PIGDreamer: Privileged Information Guided World Models for Safe Partially Observable Reinforcement Learning	2025-08-04
7	Mixing Any Cocktail with Limited Ingredients: On the Structure of Payoff Sets in Multi-Objective POMDPs and its Impact on Randomised Strategies	2025-02-25
8	Solving Truly Massive Budgeted Monotonic POMDPs with Oracle-Guided Meta-Reinforcement Learning	2024-08-13
9	Contributions on complexity bounds for Deterministic Partially Observed Markov Decision Process	2023-01-20

序号	标题	日期
1	What Fundamental Structure in Reward Functions Enables Efficient Sparse-Reward Learning?	2025-09-04
2	LLM-Driven Intrinsic Motivation for Sparse Reward Reinforcement Learning	2025-08-25
3	SuperRL: Reinforcement Learning with Supervision to Boost Language Model Reasoning	2025-06-01
4	DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning	2025-05-26
5	STAR-R1: Spatial TrAnsformation Reasoning by Reinforcing Multimodal LLMs	2025-05-21
6	Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model	2025-03-14
7	Hedging with Sparse Reward Reinforcement Learning	2025-03-06
8	Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations	2024-12-02

序号	标题	日期
1	Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL	2025-02-18
2	Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation	2020-12-16

序号	标题	日期
1	Computationally and statistically efficient estimation of time-smoothed counterfactual curves	2025-09-30
2	An Orthogonal Learner for Individualized Outcomes in Markov Decision Processes	2025-09-30
3	MR$^2$-Bench: Going Beyond Matching to Reasoning in Multimodal Retrieval	2025-09-30
4	Staged Event Trees for Transparent Treatment Effect Estimation	2025-09-30
5	Characterization and Learning of Causal Graphs with Latent Confounders and Post-treatment Selection from Interventional Data	2025-09-30
6	MuPlon: Multi-Path Causal Optimization for Claim Verification through Controlling Confounding	2025-09-30
7	TimeOmni-1: Incentivizing Complex Reasoning with Time Series in Large Language Models	2025-09-29
8	Guide: Generalized-Prior and Data Encoders for DAG Estimation	2025-09-28
9	Diagnosing Failure Root Causes in Platform-Orchestrated Agentic Systems: Dataset, Taxonomy, and Benchmark	2025-09-28
10	Improving constraint-based discovery with robust propagation and reliable LLM priors	2025-09-28
11	One-Shot Multi-Label Causal Discovery in High-Dimensional Event Sequences	2025-09-27
12	Efficient Ensemble Conditional Independence Test Framework for Causal Discovery	2025-09-25
13	DeFacto: Counterfactual Thinking with Images for Enforcing Evidence-Grounded and Faithful Reasoning	2025-09-25
14	A Counterfactual Reasoning Framework for Fault Diagnosis in Robot Perception Systems	2025-09-22
15	Causal-Counterfactual RAG: The Integration of Causal-Counterfactual Reasoning into RAG	2025-09-17
16	Causality-guided Prompt Learning for Vision-language Models via Visual Granulation	2025-09-04
17	Mapping beyond diseases: Controlled variable selection for secondary phenotypes using tilted knockoffs	2025-08-25
18	Deep Graph Learning for Industrial Carbon Emission Analysis and Policy Impact	2025-06-25
19	EgoVIS@CVPR: What Changed and What Could Have Changed? State-Change Counterfactuals for Procedure-Aware Video Representation Learning	2025-05-30
20	No Black Boxes: Interpretable and Interactable Predictive Healthcare with Knowledge-Enhanced Agentic Causal Discovery	2025-05-22
21	A Review on Riemannian Metric Learning: Closer to You than You Imagine	2025-03-07
22	Multi-View Causal Discovery without Non-Gaussianity: Identifiability and Algorithms	2025-02-27
23	Can LLMs Explain Themselves Counterfactually?	2025-02-25

序号	标题	日期
1	Continuous Space-Time Video Super-Resolution with 3D Fourier Fields	2025-09-30
2	PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution	2025-09-30
3	Asymmetric VAE for One-Step Video Super-Resolution Acceleration	2025-09-29
4	Towards Redundancy Reduction in Diffusion Models for Efficient Video Super-Resolution	2025-09-28
5	VividFace: High-Quality and Efficient One-Step Diffusion For Video Face Enhancement	2025-09-28
6	MedVSR: Medical Video Super-Resolution with Cross State-Space Propagation	2025-09-25
7	OS-DiffVSR: Towards One-step Latent Diffusion Model for High-detailed Real-world Video Super-Resolution	2025-09-20
8	SimpleGVR: A Simple Baseline for Latent-Cascaded Video Super-Resolution	2025-06-24

序号	标题	日期
1	TAP: Two-Stage Adaptive Personalization of Multi-task and Multi-Modal Foundation Models in Federated Learning	2025-09-30
2	Revealing the Power of Post-Training for Small Language Models via Knowledge Distillation	2025-09-30
3	Combining Knowledge Graphs and NLP to Analyze Instant Messaging Data in Criminal Investigations	2025-09-30
4	OntoAligner Meets Knowledge Graph Embedding Aligners	2025-09-30
5	Efficient and Transferable Agentic Knowledge Graph RAG via Reinforcement Learning	2025-09-30
6	Interpret, prune and distill Donut : towards lightweight VLMs for VQA on document	2025-09-30
7	Type-Less yet Type-Aware Inductive Link Prediction with Pretrained Language Models	2025-09-30
8	MEDAKA: Construction of Biomedical Knowledge Graphs Using Large Language Models	2025-09-30
9	Items Proxy Bridging: Enabling Frictionless Critiquing in Knowledge Graph Recommendations	2025-09-30
10	CAST: Continuous and Differentiable Semi-Structured Sparsity-Aware Training for Large Language Models	2025-09-30
11	Data-Free Continual Learning of Server Models in Model-Heterogeneous Federated learning	2025-09-30
12	Autonomy-Aware Clustering: When Local Decisions Supersede Global Prescriptions	2025-09-30
13	How Does Preconditioning Guide Feature Learning in Deep Neural Networks?	2025-09-30
14	DAM: Dual Active Learning with Multimodal Foundation Model for Source-Free Domain Adaptation	2025-09-29
15	Patient-specific Biomolecular Instruction Tuning	2025-09-26
16	Advancing Natural Language Formalization to First Order Logic with Fine-tuned LLMs	2025-09-26
17	Frustratingly Easy Zero-Day Audio DeepFake Detection via Retrieval Augmentation and Profile Matching	2025-09-26
18	One Filters All: A Generalist Filter for State Estimation	2025-09-24
19	Dual-View Alignment Learning with Hierarchical-Prompt for Class-Imbalance Multi-Label Classification	2025-09-22
20	OpenGVL -- Benchmarking Visual Temporal Progress for Data Curation	2025-09-22
21	K-DeCore: Facilitating Knowledge Transfer in Continual Structured Knowledge Reasoning via Knowledge Decoupling	2025-09-21
22	Language-Instructed Reasoning for Group Activity Detection via Multimodal Large Language Model	2025-09-19
23	Artificially Fluent: Swahili AI Performance Benchmarks Between English-Trained and Natively-Trained Datasets	2025-09-03
24	Semantic Discrepancy-aware Detector for Image Forgery Identification	2025-08-17
25	Learning Unified User Quantized Tokenizers for User Representation	2025-08-01
26	Static Word Embeddings for Sentence Semantic Representation	2025-06-05
27	Personalized Subgraph Federated Learning with Differentiable Auxiliary Projections	2025-05-29
28	Multilingual Prompting for Improving LLM Generation Diversity	2025-05-21
29	Language-Specific Latent Process Hinders Cross-Lingual Performance	2025-05-19
30	Simple yet Effective Semi-supervised Knowledge Distillation from Vision-Language Models via Dual-Head Optimization	2025-05-12
31	KDC-Diff: A Latent-Aware Diffusion Model with Knowledge Retention for Memory-Efficient Image Generation	2025-05-11
32	Using Knowledge Graphs to harvest datasets for efficient CLIP model training	2025-05-05
33	A Survey of Graph Retrieval-Augmented Generation for Customized Large Language Models	2025-01-21
34	Efficient Dynamic Ensembling for Multiple LLM Experts	2024-12-10
35	CoT-TL: Low-Resource Temporal Knowledge Representation of Planning Instructions Using Chain-of-Thought Reasoning	2024-10-21
36	On the Integration of Spatial-Temporal Knowledge: A Lightweight Approach to Atmospheric Time Series Forecasting	2024-08-19
37	Adaptive Modality Balanced Online Knowledge Distillation for Brain-Eye-Computer based Dim Object Detection	2024-07-02
38	Representing Knowledge and Querying Data using Double-Functorial Semantics	2024-03-28
39	Semantic Data Representation for Explainable Windows Malware Detection Models	2024-03-18

序号	标题	日期
1	Various Diamond Properties in Combinatorial Game Theory	2025-09-26
2	Xiangqi-R1: Enhancing Spatial Strategic Reasoning in LLMs for Chinese Chess via Reinforcement Learning	2025-07-16
3	On 3-terminal positions in Hex	2025-07-11
4	A number game reconciliation	2025-07-07
5	Deep Reinforcement Learning Xiangqi Player with Monte Carlo Tree Search	2025-06-18
6	Circular Game Coloring of Signed Graphs	2025-05-27
7	Computational and Algebraic Structure of Board Games	2025-02-18
8	RemoteChess: Enhancing Older Adults' Social Connectedness via Designing a Virtual Reality Chinese Chess (Xiangqi) Community	2025-02-17
9	Temperatures of Robin Hood	2025-01-13
10	On Conway's Numbers and Games, the Von Neumann Universe, and Pure Set Theory	2025-01-08
11	Complete Implementation of WXF Chinese Chess Rules	2024-12-23
12	Maker-Breaker on Galton-Watson trees	2024-12-11
13	Relationship between misère NIM and two-player GOISHI HIROI	2024-12-05
14	Mastering Chinese Chess AI (Xiangqi) Without Search	2024-10-07
15	XQSV: A Structurally Variable Network to Imitate Human Play in Xiangqi	2024-07-05
16	Shogi and Frieze group	2023-11-15
17	JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games	2023-08-09
18	Niel's Chess -- Rules for Xiangqi	2023-06-27
19	On the complexity of Dark Chinese Chess	2021-12-06

序号	标题	日期
1	Bridging Developer Instructions and Code Completion Through Instruction-Aware Fill-in-the-Middle Paradigm	2025-09-29
2	Verification Limits Code LLM Training	2025-09-25
3	Do Code Semantics Help? A Comprehensive Study on Execution Trace-Based Information for Code Large Language Models	2025-09-15
4	CADmium: Fine-Tuning Code Language Models for Text-Driven Sequential CAD Design	2025-07-13
5	Can Code Language Models Learn Clarification-Seeking Behaviors?	2025-04-23
6	A Preliminary Study on the Robustness of Code Generation by Large Language Models	2025-03-26
7	ExeCoder: Empowering Large Language Models with Executability Representation for Code Translation	2025-01-30
8	GALLa: Graph Aligned Large Language Models for Improved Source Code Understanding	2024-09-06

序号	标题	日期
1	IR-UWB Radar-Based Contactless Silent Speech Recognition with Attention-Enhanced Temporal Convolutional Networks	2025-09-30
2	ASR Under Noise: Exploring Robustness for Sundanese and Javanese	2025-09-30
3	Beyond WER: Probing Whisper's Sub-token Decoder Across Diverse Language Resource Levels	2025-09-29
4	Confidence-Guided Error Correction for Disordered Speech Recognition	2025-09-29
5	MeanFlowSE: One-Step Generative Speech Enhancement via MeanFlow	2025-09-27
6	Streaming Sequence-to-Sequence Learning with Delayed Streams Modeling	2025-09-10
7	Regularizing Learnable Feature Extraction for Automatic Speech Recognition	2025-06-11
8	Exploring the Impact of Data Quantity on ASR in Extremely Low-resource Languages	2024-09-13

序号	标题	日期
1	TTT3R: 3D Reconstruction as Test-Time Training	2025-09-30
2	A Multi-purpose Tracking Framework for Salmon Welfare Monitoring in Challenging Environments	2025-09-30
3	User-Centric Communication Service Provision for Edge-Assisted Mobile Augmented Reality	2025-09-30
4	Physics-Informed Learning for Human Whole-Body Kinematics Prediction via Sparse IMUs	2025-09-30
5	Robust Visual Localization in Compute-Constrained Environments by Salient Edge Rendering and Weighted Hamming Similarity	2025-09-29
6	VGGT-X: When VGGT Meets Dense Novel View Synthesis	2025-09-29
7	PAD3R: Pose-Aware Dynamic 3D Reconstruction from Casual Videos	2025-09-29
8	SDPose: Exploiting Diffusion Priors for Out-of-Domain and Robust Pose Estimation	2025-09-29
9	Good Weights: Proactive, Adaptive Dead Reckoning Fusion for Continuous and Robust Visual SLAM	2025-09-26
10	MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training	2025-09-26
11	MASt3R-Fusion: Integrating Feed-Forward Visual Model with IMU, GNSS for High-Functionality SLAM	2025-09-25
12	UniTac2Pose: A Unified Approach Learned in Simulation for Category-level Visuotactile In-hand Pose Estimation	2025-09-19
13	Seg2Track-SAM2: SAM2-based Multi-object Tracking and Segmentation for Zero-shot Generalization	2025-09-15
14	IMD: A 6-DoF Pose Estimation Benchmark for Industrial Metallic Objects	2025-09-15
15	Hierarchical Reactive Grasping via Task-Space Velocity Fields and Joint-Space Quadratic Programming	2025-09-01
16	PRISM-DP: Spatial Pose-based Observations for Diffusion-Policies via Segmentation, Mesh Generation, and Pose Tracking	2025-04-29
17	BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation	2025-04-10
18	Faster Model Predictive Control via Self-Supervised Initialization Learning	2024-08-06
19	Matching Anything by Segmenting Anything	2024-06-06
20	Multi-Scale Memory Comparison for Zero-/Few-Shot Anomaly Detection	2023-08-09
21	Zero-Shot Anomaly Detection with Pre-trained Segmentation Models	2023-06-15
22	APRIL-GAN: A Zero-/Few-Shot Anomaly Classification and Segmentation Method for CVPR 2023 VAND Workshop Challenge Tracks 1&2: 1st Place on Zero-shot AD and 4th Place on Few-shot AD	2023-05-27
23	Unifying Tracking and Image-Video Object Detection	2022-11-20
24	Exploring the Effectiveness of Self-supervised Learning and Classifier Chains in Emotion Recognition of Nonverbal Vocalizations	2022-06-21
25	The Multi-speaker Multi-style Voice Cloning Challenge 2021	2021-04-05

序号	标题	日期
1	PAD3R: Pose-Aware Dynamic 3D Reconstruction from Casual Videos	2025-09-29
2	Light-SQ: Structure-aware Shape Abstraction with Superquadrics for Generated Meshes	2025-09-29
3	UP2You: Fast Reconstruction of Yourself from Unconstrained Photo Collections	2025-09-29
4	Towards Fine-Grained Text-to-3D Quality Assessment: A Benchmark and A Two-Stage Rank-Learning Metric	2025-09-28
5	ZeroScene: A Zero-Shot Framework for 3D Scene Generation from a Single Image and Controllable Texture Editing	2025-09-28
6	Drag4D: Align Your Motion with Text-Driven 3D Scene Generation	2025-09-26
7	Vision-Language Models as Differentiable Semantic and Spatial Rewards for Text-to-3D Generation	2025-09-19
8	AToken: A Unified Tokenizer for Vision	2025-09-17
9	T2Bs: Text-to-Character Blendshapes via Video Generation	2025-09-12
10	One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation	2025-09-09
11	A Scalable Attention-Based Approach for Image-to-3D Texture Mapping	2025-09-05
12	TexTailor: Customized Text-aligned Texturing via Effective Resampling	2025-06-12
13	CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx	2025-06-05
14	ART-DECO: Arbitrary Text Guidance for 3D Detailizer Construction	2025-05-26
15	Making Physical Objects with Generative AI and Robotic Assembly: Considering Fabrication Constraints, Sustainability, Time, Functionality, and Accessibility	2025-04-27
16	CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading	2025-04-09
17	ProcTex: Consistent and Interactive Text-to-texture Synthesis for Procedural Models	2025-01-28
18	FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction	2024-12-12
19	MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D	2024-11-04
20	3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation	2024-10-24
21	Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control	2024-10-09
22	FlashTex: Fast Relightable Mesh Texturing with LightControlNet	2024-02-20
23	Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints	2023-10-05

序号	标题	日期
1	Towards Verified Code Reasoning by LLMs	2025-09-30
2	Learning-Based Testing for Deep Learning: Enhancing Model Robustness with Adversarial Input Prioritization	2025-09-28
3	GPM: The Gaussian Pancake Mechanism for Planting Undetectable Backdoors in Differential Privacy	2025-09-28
4	PAT-Agent: Autoformalization for Model Checking	2025-09-28
5	L-Mosaics and Bounded Join-Semilattices in Isabelle/HOL	2025-09-24
6	EconProver: Towards More Economical Test-Time Scaling for Automated Theorem Proving	2025-09-16
7	Saturation-Driven Dataset Generation for LLM Mathematical Reasoning in the TPTP Ecosystem	2025-09-08
8	Contradictions	2025-09-07
9	Formal Modeling and Verification of the Algorand Consensus Protocol in CADP	2025-08-26
10	An ACL2s Interface to Z3	2025-07-25
11	Generalized Tree Edit Distance (GTED): A Faithful Evaluation Metric for Statement Autoformalization	2025-07-10
12	Quantifying Bounded Rationality: Formal Verification of Simon's Satisficing Through Flexible Stochastic Dominance	2025-07-02
13	Prover Agent: An Agent-Based Framework for Formal Mathematical Proofs	2025-06-24
14	Logic Gate Neural Networks are Good for Verification	2025-05-26
15	A Formal Proof of Complexity Bounds on Diophantine Equations	2025-05-22
16	Generalizable Process Reward Models via Formally Verified Training Data	2025-05-21
17	Canonical for Automated Theorem Proving in Lean	2025-04-08
18	Automated Discovery of Tactic Libraries for Interactive Theorem Proving	2025-03-31
19	LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction	2025-02-25
20	Proving the Coding Interview: A Benchmark for Formally Verified Code Generation	2025-02-08
21	A Certified Proof Checker for Deep Neural Network Verification in Imandra	2024-05-17
22	Consensus-Free Spreadsheet Integration	2022-09-28

每月论文更新 - 2025年10月02日 #27

Description

最后更新：2025-10-02 00:09

论文汇总（202篇）

1. efficient RL

2. partial observable markov decision process/pomdp

3. sparse reward reinforcement learning

4. casual RL/counterfactual RL/casual reinforcement learning

5. causal inference/causal discovery/counterfactual reasoning

6. video super resolution

7. knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding

8. combinatorial game theory/xiangqi/chinese chess

9. code llm

10. speech recognition

11. zero shot tracking/few shot tracking/pose tracking/pose estimation

12. text to 3d/image to 3d/text to texture

13. automated theorem proving/interactive theorem proving/formal verification

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions