Skip to content

每月论文更新 - 2025年09月02日 #26

@github-actions

Description

@github-actions

最后更新:2025-09-02 00:09

本次更新执行命令

D:\a\MyAutoPapers\MyAutoPapers\target\release\my_auto_papers.exe --keywords=
             efficient RL,
             partial observable markov decision process/pomdp,sparse reward reinforcement learning,
             casual RL/counterfactual RL/casual reinforcement learning,
             causal inference/causal discovery/counterfactual reasoning,
             video super resolution,
             knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding,
             combinatorial game theory/xiangqi/chinese chess,
             code llm,
             speech recognition,
             zero shot tracking/few shot tracking/pose tracking/pose estimation,
             text to 3d/image to 3d/text to texture,
             automated theorem proving/interactive theorem proving/formal verification
              --exclude-keywords=multi-agent,multiagent --per-keyword-max-result=8

参数详解

  • 关键词:efficient RL, partial observable markov decision process/pomdp, sparse reward reinforcement learning, casual RL/counterfactual RL/casual reinforcement learning, causal inference/causal discovery/counterfactual reasoning, video super resolution, knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding, combinatorial game theory/xiangqi/chinese chess, code llm, speech recognition, zero shot tracking/few shot tracking/pose tracking/pose estimation, text to 3d/image to 3d/text to texture, automated theorem proving/interactive theorem proving/formal verification
  • 排除关键词:multi-agent, multiagent
  • 每关键词最大结果:8
  • 目标领域:cs, stat
  • 每关键词重试次数:3

论文汇总(201篇)

更好的阅读体验请访问 Github页面

1. efficient RL

序号 标题 日期
1 rStar2-Agent: Agentic Reasoning Technical Report 2025-08-28
2 M2IO-R1: An Efficient RL-Enhanced Reasoning Framework for Multimodal Retrieval Augmented Multimodal Generation 2025-08-08
3 Shuffle-R1: Efficient RL framework for Multimodal Large Language Models via Data-centric Dynamic Shuffle 2025-08-07
4 MindSpeed RL: Distributed Dataflow for Scalable and Efficient RL Training on Ascend NPU Cluster 2025-07-25
5 Reinforcement Learning Fine-Tunes a Sparse Subnetwork in Large Language Models 2025-07-23
6 Efficient RL for optimizing conversation level outcomes with an LLM-based tutor 2025-07-22
7 Efficient RL Training for Reasoning Models via Length-Aware Optimization 2025-05-18
8 RL$^3$: Boosting Meta Reinforcement Learning via RL inside RL$^2$ 2023-06-28

2. partial observable markov decision process/pomdp

序号 标题 日期
1 Convergence of regularized agent-state-based Q-learning in POMDPs 2025-08-29
2 Uncertainty-Resilient Active Intention Recognition for Robotic Assistants 2025-08-26
3 A coalgebraic perspective on predictive processing 2025-08-23
4 Dream to Chat: Model-based Reinforcement Learning on Dialogues with User Belief Modeling 2025-08-23
5 Universal Reinforcement Learning in Coalgebras: Asynchronous Stochastic Computation via Conduction 2025-08-20
6 Towards Agent-based Test Support Systems: An Unsupervised Environment Design Approach 2025-08-19
7 Multi-Group Equivariant Augmentation for Reinforcement Learning in Robot Manipulation 2025-08-15
8 Sensitivity of Filter Kernels and Robustness to Incorrect Transition and Measurement Kernel Perturbations in Partially Observable Stochastic Control 2025-08-14
9 Learning-Enabled Adaptive Power Capping Scheme for Cloud Data Centers 2025-08-09
10 Robust Finite-Memory Policy Gradients for Hidden-Model POMDPs 2025-05-14
11 Hierarchical Object-Oriented POMDP Planning for Object Rearrangement 2024-12-02
12 Maintenance Optimization for Asset Networks with Unknown Degradation Parameters 2024-10-23
13 Pessimistic Iterative Planning with RNNs for Robust POMDPs 2024-08-16

3. sparse reward reinforcement learning

序号 标题 日期
1 LLM-Driven Intrinsic Motivation for Sparse Reward Reinforcement Learning 2025-08-25
2 SuperRL: Reinforcement Learning with Supervision to Boost Language Model Reasoning 2025-06-01
3 DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning 2025-05-26
4 STAR-R1: Spatial TrAnsformation Reasoning by Reinforcing Multimodal LLMs 2025-05-21
5 Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model 2025-03-14
6 Hedging with Sparse Reward Reinforcement Learning 2025-03-06
7 Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations 2024-12-02
8 Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning 2023-09-08

4. casual RL/counterfactual RL/casual reinforcement learning

序号 标题 日期
1 Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL 2025-02-18
2 Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation 2020-12-16

5. causal inference/causal discovery/counterfactual reasoning

序号 标题 日期
1 Orientability of Causal Relations in Time Series using Summary Causal Graphs and Faithful Distributions 2025-08-29
2 Treatment effects at the margin: Everyone is marginal 2025-08-29
3 ORCA: ORchestrating Causal Agent 2025-08-29
4 ChainReaction! Structured Approach with Causal Chains as Intermediate Representations for Improved and Explainable Causal Video Question Answering 2025-08-28
5 Understanding and evaluating computer vision models through the lens of counterfactuals 2025-08-28
6 When Is Causal Inference Possible? A Statistical Test for Unmeasured Confounding 2025-08-28
7 Stochastic Gradients under Nuisances 2025-08-28
8 MOCHA: Discovering Multi-Order Dynamic Causality in Temporal Point Processes 2025-08-26
9 Explainable Counterfactual Reasoning in Depression Medication Selection at Multi-Levels (Personalized and Population) 2025-08-24
10 Causal Beam Selection for Reliable Initial Access in AI-driven Beam Management 2025-08-22
11 A Logic of Stability: Formalizing Similarity in Counterfactual Reasoning 2025-08-17
12 ORBIT: An Object Property Reasoning Benchmark for Visual Inference Tasks 2025-08-14
13 Inference on Nonlinear Counterfactual Functionals under a Multiplicative IV Model 2025-07-21
14 Boosting Temporal Sentence Grounding via Causal Inference 2025-07-07
15 Causal Feedback Discovery using Convergence Cross Mapping from Sea Ice Data 2025-05-13
16 What Changed and What Could Have Changed? State-Change Counterfactuals for Procedure-Aware Video Representation Learning 2025-03-27
17 Causal resilience curves: A data-driven framework for quantifying the spatiotemporal impacts of metro service disruptions 2023-10-11
18 Sophisticated Learning: A novel algorithm for active learning during model-based planning 2023-08-15
19 Robust Universal Inference For Misspecified Models 2023-07-08
20 Integrating Large Language Model for Improved Causal Discovery 2023-06-29
21 A Survey on Causal Discovery: Theory and Practice 2023-05-17
22 Identifiability of causal graphs under nonadditive conditionally parametric causal models 2023-03-27

6. video super resolution

序号 标题 日期
1 Structural Damage Detection Using AI Super Resolution and Visual Language Model 2025-08-23
2 Trajectory-aware Shifted State Space Models for Online Video Super-Resolution 2025-08-14
3 QuantVSR: Low-Bit Post-Training Quantization for Real-World Video Super-Resolution 2025-08-06
4 Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework 2025-08-06
5 Semantic and Temporal Integration in Latent Diffusion Space for High-Fidelity Video Super-Resolution 2025-08-01
6 RealisVSR: Detail-enhanced Diffusion for Real-World 4K Video Super-Resolution 2025-07-25
7 UltraVSR: Achieving Ultra-Realistic Video Super-Resolution with Efficient One-Step Diffusion Space 2025-05-26
8 Spatio-Temporal Distortion Aware Omnidirectional Video Super-Resolution 2024-10-15

7. knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding

序号 标题 日期
1 Improving Biomedical Knowledge Graph Quality: A Community Approach 2025-08-29
2 Unsupervised Video Continual Learning via Non-Parametric Deep Embedded Clustering 2025-08-29
3 Geospatial Question Answering on Historical Maps Using Spatio-Temporal Knowledge Graphs and Large Language Models 2025-08-29
4 A Knowledge Distillation-empowered Adaptive Federated Reinforcement Learning Framework for Multi-Domain IoT Applications Scheduling 2025-08-29
5 MyGO: Memory Yielding Generative Offline-consolidation for Lifelong Learning Systems 2025-08-29
6 Addressing accuracy and hallucination of LLMs in Alzheimer's disease research through knowledge graphs 2025-08-28
7 Efficient Large-Scale Cross-Domain Sequential Recommendation with Dynamic State Representations 2025-08-28
8 Re4: Scientific Computing Agent with Rewriting, Resolution, Review and Revision 2025-08-28
9 Unified Multi-task Learning for Voice-Based Detection of Diverse Clinical Conditions 2025-08-28
10 MobileCLIP2: Improving Multi-Modal Reinforced Training 2025-08-28
11 Enhancing Semantic Document Retrieval- Employing Group Steiner Tree Algorithm with Domain Knowledge Enrichment 2025-08-28
12 Dual-Model Weight Selection and Self-Knowledge Distillation for Medical Image Classification 2025-08-28
13 KG-CQR: Leveraging Structured Relation Representations in Knowledge Graphs for Contextual Query Retrieval 2025-08-28
14 ATMS-KD: Adaptive Temperature and Mixed Sample Knowledge Distillation for a Lightweight Residual CNN in Agricultural Embedded Systems 2025-08-27
15 Complementary Learning System Empowers Online Continual Learning of Vehicle Motion Forecasting in Smart Cities 2025-08-27
16 Toward Edge General Intelligence with Agentic AI and Agentification: Concepts, Technologies, and Future Directions 2025-08-26
17 Pandora: Leveraging Code-driven Knowledge Transfer for Unified Structured Knowledge Reasoning 2025-08-25
18 CultranAI at PalmX 2025: Data Augmentation for Cultural Knowledge Representation 2025-08-24
19 Information Ecosystem Reengineering via Public Sector Knowledge Representation 2025-08-21
20 Transplant Then Regenerate: A New Paradigm for Text Data Augmentation 2025-08-20
21 Statistical Comparative Analysis of Semantic Similarities and Model Transferability Across Datasets for Short Answer Grading 2025-08-19
22 Semantic Discrepancy-aware Detector for Image Forgery Identification 2025-08-17
23 Beyond the Rosetta Stone: Unification Forces in Generalization Dynamics 2025-08-14
24 Physical Autoregressive Model for Robotic Manipulation without Action Pretraining 2025-08-13
25 What Breaks Knowledge Graph based RAG? Empirical Insights into Reasoning under Incomplete Knowledge 2025-08-11
26 Pr$^2$R: Information-Fused and Style-Aware Privacy-Preserving Replay for Lifelong Person Re-Identification 2025-08-03
27 Can Large Language Models Develop Strategic Reasoning? Post-training Insights from Learning Chess 2025-07-01
28 On the Fundamental Impossibility of Hallucination Control in Large Language Models 2025-06-04
29 Hydra: Structured Cross-Source Enhanced Large Language Model Reasoning 2025-05-23
30 FedSDAF: Leveraging Source Domain Awareness for Enhanced Federated Domain Generalization 2025-05-05
31 Evaluating Knowledge Graph Based Retrieval Augmented Generation Methods under Knowledge Incompleteness 2025-04-07
32 VectorFit : Adaptive Singular & Bias Vector Fine-Tuning of Pre-trained Foundation Models 2025-03-25
33 Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning 2025-03-11
34 Retrieval-Augmented Machine Translation with Unstructured Knowledge 2024-12-05
35 Query Optimization for Parametric Knowledge Refinement in Retrieval-Augmented Large Language Models 2024-11-12
36 SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Models 2024-11-01
37 Rethinking Invariance Regularization in Adversarial Training to Improve Robustness-Accuracy Trade-off 2024-02-22

8. combinatorial game theory/xiangqi/chinese chess

序号 标题 日期
1 Xiangqi-R1: Enhancing Spatial Strategic Reasoning in LLMs for Chinese Chess via Reinforcement Learning 2025-07-16
2 On 3-terminal positions in Hex 2025-07-11
3 A number game reconciliation 2025-07-07
4 Deep Reinforcement Learning Xiangqi Player with Monte Carlo Tree Search 2025-06-18
5 Circular Game Coloring of Signed Graphs 2025-05-27
6 Computational and Algebraic Structure of Board Games 2025-02-18
7 RemoteChess: Enhancing Older Adults' Social Connectedness via Designing a Virtual Reality Chinese Chess (Xiangqi) Community 2025-02-17
8 Temperatures of Robin Hood 2025-01-13
9 On Conway's Numbers and Games, the Von Neumann Universe, and Pure Set Theory 2025-01-08
10 Complete Implementation of WXF Chinese Chess Rules 2024-12-23
11 Maker-Breaker on Galton-Watson trees 2024-12-11
12 Relationship between misère NIM and two-player GOISHI HIROI 2024-12-05
13 The Game Value of Sequential Compounds of Integers and Stars 2024-11-13
14 Mastering Chinese Chess AI (Xiangqi) Without Search 2024-10-07
15 XQSV: A Structurally Variable Network to Imitate Human Play in Xiangqi 2024-07-05
16 Shogi and Frieze group 2023-11-15
17 JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games 2023-08-09
18 Niel's Chess -- Rules for Xiangqi 2023-06-27
19 On the complexity of Dark Chinese Chess 2021-12-06

9. code llm

序号 标题 日期
1 RepoMark: A Code Usage Auditing Framework for Code Large Language Models 2025-08-29
2 The Fools are Certain; the Wise are Doubtful: Exploring LLM Confidence in Code Completion 2025-08-22
3 Hallucination in LLM-Based Code Generation: An Automotive Case Study 2025-08-15
4 VisCodex: Unified Multimodal Code Generation via Merging Vision and Coding Models 2025-08-13
5 A Taxonomy of Inefficiencies in LLM-Generated Python Code 2025-03-08
6 RefineCoder: Iterative Improving of Large Language Models via Adaptive Critique Refinement for Code Generation 2025-02-13
7 HAFix: History-Augmented Large Language Models for Bug Fixing 2025-01-15
8 Robo-Instruct: Simulator-Augmented Instruction Alignment For Finetuning Code LLMs 2024-05-30

10. speech recognition

序号 标题 日期
1 Towards Improved Speech Recognition through Optimized Synthetic Data Generation 2025-08-29
2 NSPDI-SNN: An efficient lightweight SNN based on nonlinear synaptic pruning and dendritic integration 2025-08-29
3 Can Layer-wise SSL Features Improve Zero-Shot ASR Performance for Children's Speech? 2025-08-28
4 Benchmarking Large Pretrained Multilingual Models on Québec French Speech Recognition 2025-08-28
5 OLMoASR: Open Models and Data for Training Robust Speech Recognition Models 2025-08-28
6 Generative Annotation for ASR Named Entity Correction 2025-08-28
7 MoTAS: MoE-Guided Feature Selection from TTS-Augmented Speech for Enhanced Multimodal Alzheimer's Early Screening 2025-08-28
8 OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset 2023-01-16

11. zero shot tracking/few shot tracking/pose tracking/pose estimation

序号 标题 日期
1 Efficient Diffusion-Based 3D Human Pose Estimation with Hierarchical Temporal Pruning 2025-08-29
2 PHD: Personalized 3D Human Body Fitting with Point Diffusion 2025-08-28
3 COMETH: Convex Optimization for Multiview Estimation and Tracking of Humans 2025-08-28
4 Estimating 2D Keypoints of Surgical Tools Using Vision-Language Models with Low-Rank Adaptation 2025-08-28
5 ROBUST-MIPS: A Combined Skeletal Pose and Instance Segmentation Dataset for Laparoscopic Surgical Instruments 2025-08-27
6 WEBEYETRACK: Scalable Eye-Tracking for the Browser via On-Device Few-Shot Personalization 2025-08-27
7 PersPose: 3D Human Pose Estimation with Perspective Encoding and Perspective Rotation 2025-08-24
8 6-DoF Object Tracking with Event-based Optical Flow and Frames 2025-08-20
9 DynamicPose: Real-time and Robust 6D Object Pose Tracking for Fast-Moving Cameras and Objects 2025-08-16
10 Visuomotor Grasping with World Models for Surgical Robots 2025-08-15
11 Embracing Dynamics: Dynamics-aware 4D Gaussian Splatting SLAM 2025-04-07
12 PicoPose: Progressive Pixel-to-Pixel Correspondence Learning for Novel Object Pose Estimation 2025-04-03
13 Bring Your Rear Cameras for Egocentric 3D Human Pose Estimation 2025-03-14
14 Learning Whole-Body Loco-Manipulation for Omni-Directional Task Space Pose Tracking with a Wheeled-Quadrupedal-Manipulator 2024-12-04
15 OmniPose6D: Towards Short-Term Object Pose Tracking in Dynamic Scenes from Monocular RGB 2024-10-09
16 Faster Model Predictive Control via Self-Supervised Initialization Learning 2024-08-06
17 Matching Anything by Segmenting Anything 2024-06-06
18 Input-Output Extension of Underactuated Nonlinear Systems 2024-03-05
19 Multi-Scale Memory Comparison for Zero-/Few-Shot Anomaly Detection 2023-08-09
20 Zero-Shot Anomaly Detection with Pre-trained Segmentation Models 2023-06-15
21 APRIL-GAN: A Zero-/Few-Shot Anomaly Classification and Segmentation Method for CVPR 2023 VAND Workshop Challenge Tracks 1&2: 1st Place on Zero-shot AD and 4th Place on Few-shot AD 2023-05-27
22 Unifying Tracking and Image-Video Object Detection 2022-11-20
23 Exploring the Effectiveness of Self-supervised Learning and Classifier Chains in Emotion Recognition of Nonverbal Vocalizations 2022-06-21
24 The Multi-speaker Multi-style Voice Cloning Challenge 2021 2021-04-05

12. text to 3d/image to 3d/text to texture

序号 标题 日期
1 DATR: Diffusion-based 3D Apple Tree Reconstruction Framework with Sparse-View 2025-08-27
2 Structural Energy-Guided Sampling for View-Consistent Text-to-3D 2025-08-23
3 MV-RAG: Retrieval Augmented Multiview Diffusion 2025-08-22
4 Say It, See It: A Systematic Evaluation on Speech-Based 3D Content Generation Methods in Augmented Reality 2025-08-17
5 CoreEditor: Consistent 3D Editing via Correspondence-constrained Diffusion 2025-08-15
6 Enhancing Monocular 3D Hand Reconstruction with Learned Texture Priors 2025-08-13
7 TexTailor: Customized Text-aligned Texturing via Effective Resampling 2025-06-12
8 CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx 2025-06-05
9 MonoCoP: Chain-of-Prediction for Monocular 3D Object Detection 2025-05-07
10 SORT3D: Spatial Object-centric Reasoning Toolbox for Zero-Shot 3D Grounding Using Large Language Models 2025-04-25
11 CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading 2025-04-09
12 Text-to-3D Generation using Jensen-Shannon Score Distillation 2025-03-08
13 ProcTex: Consistent and Interactive Text-to-texture Synthesis for Procedural Models 2025-01-28
14 Improving Viewpoint Consistency in 3D Generation via Structure Feature and CLIP Guidance 2024-12-03
15 Fancy123: One Image to High-Quality 3D Mesh Generation via Plug-and-Play Deformation 2024-11-25
16 MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D 2024-11-04
17 3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation 2024-10-24
18 Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control 2024-10-09
19 Localized Gaussian Splatting Editing with Contextual Awareness 2024-07-31
20 REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment 2024-05-28
21 FlashTex: Fast Relightable Mesh Texturing with LightControlNet 2024-02-20

13. automated theorem proving/interactive theorem proving/formal verification

序号 标题 日期
1 Verifying Probabilistic Regions of Attraction with Neural Lyapunov Functions for Stochastic Systems 2025-08-28
2 Formal Modeling and Verification of the Algorand Consensus Protocol in CADP 2025-08-26
3 Formal Verification of Physical Layer Security Protocols for Next-Generation Communication Networks (extended version) 2025-08-26
4 MoveScanner: Analysis of Security Risks of Move Smart Contracts 2025-08-25
5 Lean Meets Theoretical Computer Science: Scalable Synthesis of Theorem Proving Challenges in Formal-Informal Pairs 2025-08-21
6 Repairing General Game Descriptions (extended version) 2025-08-14
7 TPTP World Infrastructure for Non-classical Logics 2025-08-12
8 Policy Design in Zero-Trust Distributed Networks: Challenges and Solutions 2025-08-06
9 Goedel-Prover-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction 2025-08-05
10 StepFun-Prover Preview: Let's Think and Verify Step by Step 2025-07-27
11 An ACL2s Interface to Z3 2025-07-25
12 The AlphaPhysics Term Rewriting System for Marking Algebraic Expressions in Physics Exams 2025-07-24
13 Leveraging LLMs for Formal Software Requirements -- Challenges and Prospects 2025-07-18
14 Generalized Tree Edit Distance (GTED): A Faithful Evaluation Metric for Statement Autoformalization 2025-07-10
15 Quantifying Bounded Rationality: Formal Verification of Simon's Satisficing Through Flexible Stochastic Dominance 2025-07-02
16 Software is infrastructure: failures, successes, costs, and the case for formal verification 2025-06-15
17 APOLLO: Automated LLM and Lean Collaboration for Advanced Formal Reasoning 2025-05-09
18 TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving 2025-04-22
19 Automated Discovery of Tactic Libraries for Interactive Theorem Proving 2025-03-31
20 LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction 2025-02-25
21 Proving the Coding Interview: A Benchmark for Formally Verified Code Generation 2025-02-08
22 Learning Rules Explaining Interactive Theorem Proving Tactic Prediction 2024-11-02
23 A Certified Proof Checker for Deep Neural Network Verification in Imandra 2024-05-17

Metadata

Metadata

Assignees

No one assigned

    Labels

    documentationImprovements or additions to documentation

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions