Skip to content

每月论文更新 - 2025年10月02日 #27

@github-actions

Description

@github-actions

最后更新:2025-10-02 00:09

本次更新执行命令

D:\a\MyAutoPapers\MyAutoPapers\target\release\my_auto_papers.exe --keywords=
             efficient RL,
             partial observable markov decision process/pomdp,sparse reward reinforcement learning,
             casual RL/counterfactual RL/casual reinforcement learning,
             causal inference/causal discovery/counterfactual reasoning,
             video super resolution,
             knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding,
             combinatorial game theory/xiangqi/chinese chess,
             code llm,
             speech recognition,
             zero shot tracking/few shot tracking/pose tracking/pose estimation,
             text to 3d/image to 3d/text to texture,
             automated theorem proving/interactive theorem proving/formal verification
              --exclude-keywords=multi-agent,multiagent --per-keyword-max-result=8

参数详解

  • 关键词:efficient RL, partial observable markov decision process/pomdp, sparse reward reinforcement learning, casual RL/counterfactual RL/casual reinforcement learning, causal inference/causal discovery/counterfactual reasoning, video super resolution, knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding, combinatorial game theory/xiangqi/chinese chess, code llm, speech recognition, zero shot tracking/few shot tracking/pose tracking/pose estimation, text to 3d/image to 3d/text to texture, automated theorem proving/interactive theorem proving/formal verification
  • 排除关键词:multi-agent, multiagent
  • 每关键词最大结果:8
  • 目标领域:cs, stat
  • 每关键词重试次数:3

论文汇总(202篇)

更好的阅读体验请访问 Github页面

1. efficient RL

序号 标题 日期
1 RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation 2025-09-19
2 TDRM: Smooth Reward Models with Temporal Difference for LLM RL and Inference 2025-09-18
3 Gradient Free Deep Reinforcement Learning With TabPFN 2025-09-14
4 SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning 2025-09-11
5 RLFactory: A Plug-and-Play Reinforcement Learning Post-Training Framework for LLM Multi-Turn Tool-Use 2025-08-31
6 rStar2-Agent: Agentic Reasoning Technical Report 2025-08-28
7 Angles Don't Lie: Unlocking Training-Efficient RL Through the Model's Own Signals 2025-06-02
8 Efficient RL Training for Reasoning Models via Length-Aware Optimization 2025-05-18

2. partial observable markov decision process/pomdp

序号 标题 日期
1 Accelerating Transformers in Online RL 2025-09-30
2 Model-Based Reinforcement Learning under Random Observation Delays 2025-09-25
3 Assistive Decision-Making for Right of Way Navigation at Uncontrolled Intersections 2025-09-22
4 Dream to Chat: Model-based Reinforcement Learning on Dialogues with User Belief Modeling 2025-08-23
5 Synthetic POMDPs to Challenge Memory-Augmented RL: Memory Demand Structure Modeling 2025-08-06
6 PIGDreamer: Privileged Information Guided World Models for Safe Partially Observable Reinforcement Learning 2025-08-04
7 Mixing Any Cocktail with Limited Ingredients: On the Structure of Payoff Sets in Multi-Objective POMDPs and its Impact on Randomised Strategies 2025-02-25
8 Solving Truly Massive Budgeted Monotonic POMDPs with Oracle-Guided Meta-Reinforcement Learning 2024-08-13
9 Contributions on complexity bounds for Deterministic Partially Observed Markov Decision Process 2023-01-20

3. sparse reward reinforcement learning

序号 标题 日期
1 What Fundamental Structure in Reward Functions Enables Efficient Sparse-Reward Learning? 2025-09-04
2 LLM-Driven Intrinsic Motivation for Sparse Reward Reinforcement Learning 2025-08-25
3 SuperRL: Reinforcement Learning with Supervision to Boost Language Model Reasoning 2025-06-01
4 DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning 2025-05-26
5 STAR-R1: Spatial TrAnsformation Reasoning by Reinforcing Multimodal LLMs 2025-05-21
6 Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model 2025-03-14
7 Hedging with Sparse Reward Reinforcement Learning 2025-03-06
8 Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations 2024-12-02

4. casual RL/counterfactual RL/casual reinforcement learning

序号 标题 日期
1 Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL 2025-02-18
2 Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation 2020-12-16

5. causal inference/causal discovery/counterfactual reasoning

序号 标题 日期
1 Computationally and statistically efficient estimation of time-smoothed counterfactual curves 2025-09-30
2 An Orthogonal Learner for Individualized Outcomes in Markov Decision Processes 2025-09-30
3 MR$^2$-Bench: Going Beyond Matching to Reasoning in Multimodal Retrieval 2025-09-30
4 Staged Event Trees for Transparent Treatment Effect Estimation 2025-09-30
5 Characterization and Learning of Causal Graphs with Latent Confounders and Post-treatment Selection from Interventional Data 2025-09-30
6 MuPlon: Multi-Path Causal Optimization for Claim Verification through Controlling Confounding 2025-09-30
7 TimeOmni-1: Incentivizing Complex Reasoning with Time Series in Large Language Models 2025-09-29
8 Guide: Generalized-Prior and Data Encoders for DAG Estimation 2025-09-28
9 Diagnosing Failure Root Causes in Platform-Orchestrated Agentic Systems: Dataset, Taxonomy, and Benchmark 2025-09-28
10 Improving constraint-based discovery with robust propagation and reliable LLM priors 2025-09-28
11 One-Shot Multi-Label Causal Discovery in High-Dimensional Event Sequences 2025-09-27
12 Efficient Ensemble Conditional Independence Test Framework for Causal Discovery 2025-09-25
13 DeFacto: Counterfactual Thinking with Images for Enforcing Evidence-Grounded and Faithful Reasoning 2025-09-25
14 A Counterfactual Reasoning Framework for Fault Diagnosis in Robot Perception Systems 2025-09-22
15 Causal-Counterfactual RAG: The Integration of Causal-Counterfactual Reasoning into RAG 2025-09-17
16 Causality-guided Prompt Learning for Vision-language Models via Visual Granulation 2025-09-04
17 Mapping beyond diseases: Controlled variable selection for secondary phenotypes using tilted knockoffs 2025-08-25
18 Deep Graph Learning for Industrial Carbon Emission Analysis and Policy Impact 2025-06-25
19 EgoVIS@CVPR: What Changed and What Could Have Changed? State-Change Counterfactuals for Procedure-Aware Video Representation Learning 2025-05-30
20 No Black Boxes: Interpretable and Interactable Predictive Healthcare with Knowledge-Enhanced Agentic Causal Discovery 2025-05-22
21 A Review on Riemannian Metric Learning: Closer to You than You Imagine 2025-03-07
22 Multi-View Causal Discovery without Non-Gaussianity: Identifiability and Algorithms 2025-02-27
23 Can LLMs Explain Themselves Counterfactually? 2025-02-25

6. video super resolution

序号 标题 日期
1 Continuous Space-Time Video Super-Resolution with 3D Fourier Fields 2025-09-30
2 PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution 2025-09-30
3 Asymmetric VAE for One-Step Video Super-Resolution Acceleration 2025-09-29
4 Towards Redundancy Reduction in Diffusion Models for Efficient Video Super-Resolution 2025-09-28
5 VividFace: High-Quality and Efficient One-Step Diffusion For Video Face Enhancement 2025-09-28
6 MedVSR: Medical Video Super-Resolution with Cross State-Space Propagation 2025-09-25
7 OS-DiffVSR: Towards One-step Latent Diffusion Model for High-detailed Real-world Video Super-Resolution 2025-09-20
8 SimpleGVR: A Simple Baseline for Latent-Cascaded Video Super-Resolution 2025-06-24

7. knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding

序号 标题 日期
1 TAP: Two-Stage Adaptive Personalization of Multi-task and Multi-Modal Foundation Models in Federated Learning 2025-09-30
2 Revealing the Power of Post-Training for Small Language Models via Knowledge Distillation 2025-09-30
3 Combining Knowledge Graphs and NLP to Analyze Instant Messaging Data in Criminal Investigations 2025-09-30
4 OntoAligner Meets Knowledge Graph Embedding Aligners 2025-09-30
5 Efficient and Transferable Agentic Knowledge Graph RAG via Reinforcement Learning 2025-09-30
6 Interpret, prune and distill Donut : towards lightweight VLMs for VQA on document 2025-09-30
7 Type-Less yet Type-Aware Inductive Link Prediction with Pretrained Language Models 2025-09-30
8 MEDAKA: Construction of Biomedical Knowledge Graphs Using Large Language Models 2025-09-30
9 Items Proxy Bridging: Enabling Frictionless Critiquing in Knowledge Graph Recommendations 2025-09-30
10 CAST: Continuous and Differentiable Semi-Structured Sparsity-Aware Training for Large Language Models 2025-09-30
11 Data-Free Continual Learning of Server Models in Model-Heterogeneous Federated learning 2025-09-30
12 Autonomy-Aware Clustering: When Local Decisions Supersede Global Prescriptions 2025-09-30
13 How Does Preconditioning Guide Feature Learning in Deep Neural Networks? 2025-09-30
14 DAM: Dual Active Learning with Multimodal Foundation Model for Source-Free Domain Adaptation 2025-09-29
15 Patient-specific Biomolecular Instruction Tuning 2025-09-26
16 Advancing Natural Language Formalization to First Order Logic with Fine-tuned LLMs 2025-09-26
17 Frustratingly Easy Zero-Day Audio DeepFake Detection via Retrieval Augmentation and Profile Matching 2025-09-26
18 One Filters All: A Generalist Filter for State Estimation 2025-09-24
19 Dual-View Alignment Learning with Hierarchical-Prompt for Class-Imbalance Multi-Label Classification 2025-09-22
20 OpenGVL -- Benchmarking Visual Temporal Progress for Data Curation 2025-09-22
21 K-DeCore: Facilitating Knowledge Transfer in Continual Structured Knowledge Reasoning via Knowledge Decoupling 2025-09-21
22 Language-Instructed Reasoning for Group Activity Detection via Multimodal Large Language Model 2025-09-19
23 Artificially Fluent: Swahili AI Performance Benchmarks Between English-Trained and Natively-Trained Datasets 2025-09-03
24 Semantic Discrepancy-aware Detector for Image Forgery Identification 2025-08-17
25 Learning Unified User Quantized Tokenizers for User Representation 2025-08-01
26 Static Word Embeddings for Sentence Semantic Representation 2025-06-05
27 Personalized Subgraph Federated Learning with Differentiable Auxiliary Projections 2025-05-29
28 Multilingual Prompting for Improving LLM Generation Diversity 2025-05-21
29 Language-Specific Latent Process Hinders Cross-Lingual Performance 2025-05-19
30 Simple yet Effective Semi-supervised Knowledge Distillation from Vision-Language Models via Dual-Head Optimization 2025-05-12
31 KDC-Diff: A Latent-Aware Diffusion Model with Knowledge Retention for Memory-Efficient Image Generation 2025-05-11
32 Using Knowledge Graphs to harvest datasets for efficient CLIP model training 2025-05-05
33 A Survey of Graph Retrieval-Augmented Generation for Customized Large Language Models 2025-01-21
34 Efficient Dynamic Ensembling for Multiple LLM Experts 2024-12-10
35 CoT-TL: Low-Resource Temporal Knowledge Representation of Planning Instructions Using Chain-of-Thought Reasoning 2024-10-21
36 On the Integration of Spatial-Temporal Knowledge: A Lightweight Approach to Atmospheric Time Series Forecasting 2024-08-19
37 Adaptive Modality Balanced Online Knowledge Distillation for Brain-Eye-Computer based Dim Object Detection 2024-07-02
38 Representing Knowledge and Querying Data using Double-Functorial Semantics 2024-03-28
39 Semantic Data Representation for Explainable Windows Malware Detection Models 2024-03-18

8. combinatorial game theory/xiangqi/chinese chess

序号 标题 日期
1 Various Diamond Properties in Combinatorial Game Theory 2025-09-26
2 Xiangqi-R1: Enhancing Spatial Strategic Reasoning in LLMs for Chinese Chess via Reinforcement Learning 2025-07-16
3 On 3-terminal positions in Hex 2025-07-11
4 A number game reconciliation 2025-07-07
5 Deep Reinforcement Learning Xiangqi Player with Monte Carlo Tree Search 2025-06-18
6 Circular Game Coloring of Signed Graphs 2025-05-27
7 Computational and Algebraic Structure of Board Games 2025-02-18
8 RemoteChess: Enhancing Older Adults' Social Connectedness via Designing a Virtual Reality Chinese Chess (Xiangqi) Community 2025-02-17
9 Temperatures of Robin Hood 2025-01-13
10 On Conway's Numbers and Games, the Von Neumann Universe, and Pure Set Theory 2025-01-08
11 Complete Implementation of WXF Chinese Chess Rules 2024-12-23
12 Maker-Breaker on Galton-Watson trees 2024-12-11
13 Relationship between misère NIM and two-player GOISHI HIROI 2024-12-05
14 Mastering Chinese Chess AI (Xiangqi) Without Search 2024-10-07
15 XQSV: A Structurally Variable Network to Imitate Human Play in Xiangqi 2024-07-05
16 Shogi and Frieze group 2023-11-15
17 JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games 2023-08-09
18 Niel's Chess -- Rules for Xiangqi 2023-06-27
19 On the complexity of Dark Chinese Chess 2021-12-06

9. code llm

序号 标题 日期
1 Bridging Developer Instructions and Code Completion Through Instruction-Aware Fill-in-the-Middle Paradigm 2025-09-29
2 Verification Limits Code LLM Training 2025-09-25
3 Do Code Semantics Help? A Comprehensive Study on Execution Trace-Based Information for Code Large Language Models 2025-09-15
4 CADmium: Fine-Tuning Code Language Models for Text-Driven Sequential CAD Design 2025-07-13
5 Can Code Language Models Learn Clarification-Seeking Behaviors? 2025-04-23
6 A Preliminary Study on the Robustness of Code Generation by Large Language Models 2025-03-26
7 ExeCoder: Empowering Large Language Models with Executability Representation for Code Translation 2025-01-30
8 GALLa: Graph Aligned Large Language Models for Improved Source Code Understanding 2024-09-06

10. speech recognition

序号 标题 日期
1 IR-UWB Radar-Based Contactless Silent Speech Recognition with Attention-Enhanced Temporal Convolutional Networks 2025-09-30
2 ASR Under Noise: Exploring Robustness for Sundanese and Javanese 2025-09-30
3 Beyond WER: Probing Whisper's Sub-token Decoder Across Diverse Language Resource Levels 2025-09-29
4 Confidence-Guided Error Correction for Disordered Speech Recognition 2025-09-29
5 MeanFlowSE: One-Step Generative Speech Enhancement via MeanFlow 2025-09-27
6 Streaming Sequence-to-Sequence Learning with Delayed Streams Modeling 2025-09-10
7 Regularizing Learnable Feature Extraction for Automatic Speech Recognition 2025-06-11
8 Exploring the Impact of Data Quantity on ASR in Extremely Low-resource Languages 2024-09-13

11. zero shot tracking/few shot tracking/pose tracking/pose estimation

序号 标题 日期
1 TTT3R: 3D Reconstruction as Test-Time Training 2025-09-30
2 A Multi-purpose Tracking Framework for Salmon Welfare Monitoring in Challenging Environments 2025-09-30
3 User-Centric Communication Service Provision for Edge-Assisted Mobile Augmented Reality 2025-09-30
4 Physics-Informed Learning for Human Whole-Body Kinematics Prediction via Sparse IMUs 2025-09-30
5 Robust Visual Localization in Compute-Constrained Environments by Salient Edge Rendering and Weighted Hamming Similarity 2025-09-29
6 VGGT-X: When VGGT Meets Dense Novel View Synthesis 2025-09-29
7 PAD3R: Pose-Aware Dynamic 3D Reconstruction from Casual Videos 2025-09-29
8 SDPose: Exploiting Diffusion Priors for Out-of-Domain and Robust Pose Estimation 2025-09-29
9 Good Weights: Proactive, Adaptive Dead Reckoning Fusion for Continuous and Robust Visual SLAM 2025-09-26
10 MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training 2025-09-26
11 MASt3R-Fusion: Integrating Feed-Forward Visual Model with IMU, GNSS for High-Functionality SLAM 2025-09-25
12 UniTac2Pose: A Unified Approach Learned in Simulation for Category-level Visuotactile In-hand Pose Estimation 2025-09-19
13 Seg2Track-SAM2: SAM2-based Multi-object Tracking and Segmentation for Zero-shot Generalization 2025-09-15
14 IMD: A 6-DoF Pose Estimation Benchmark for Industrial Metallic Objects 2025-09-15
15 Hierarchical Reactive Grasping via Task-Space Velocity Fields and Joint-Space Quadratic Programming 2025-09-01
16 PRISM-DP: Spatial Pose-based Observations for Diffusion-Policies via Segmentation, Mesh Generation, and Pose Tracking 2025-04-29
17 BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation 2025-04-10
18 Faster Model Predictive Control via Self-Supervised Initialization Learning 2024-08-06
19 Matching Anything by Segmenting Anything 2024-06-06
20 Multi-Scale Memory Comparison for Zero-/Few-Shot Anomaly Detection 2023-08-09
21 Zero-Shot Anomaly Detection with Pre-trained Segmentation Models 2023-06-15
22 APRIL-GAN: A Zero-/Few-Shot Anomaly Classification and Segmentation Method for CVPR 2023 VAND Workshop Challenge Tracks 1&2: 1st Place on Zero-shot AD and 4th Place on Few-shot AD 2023-05-27
23 Unifying Tracking and Image-Video Object Detection 2022-11-20
24 Exploring the Effectiveness of Self-supervised Learning and Classifier Chains in Emotion Recognition of Nonverbal Vocalizations 2022-06-21
25 The Multi-speaker Multi-style Voice Cloning Challenge 2021 2021-04-05

12. text to 3d/image to 3d/text to texture

序号 标题 日期
1 PAD3R: Pose-Aware Dynamic 3D Reconstruction from Casual Videos 2025-09-29
2 Light-SQ: Structure-aware Shape Abstraction with Superquadrics for Generated Meshes 2025-09-29
3 UP2You: Fast Reconstruction of Yourself from Unconstrained Photo Collections 2025-09-29
4 Towards Fine-Grained Text-to-3D Quality Assessment: A Benchmark and A Two-Stage Rank-Learning Metric 2025-09-28
5 ZeroScene: A Zero-Shot Framework for 3D Scene Generation from a Single Image and Controllable Texture Editing 2025-09-28
6 Drag4D: Align Your Motion with Text-Driven 3D Scene Generation 2025-09-26
7 Vision-Language Models as Differentiable Semantic and Spatial Rewards for Text-to-3D Generation 2025-09-19
8 AToken: A Unified Tokenizer for Vision 2025-09-17
9 T2Bs: Text-to-Character Blendshapes via Video Generation 2025-09-12
10 One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation 2025-09-09
11 A Scalable Attention-Based Approach for Image-to-3D Texture Mapping 2025-09-05
12 TexTailor: Customized Text-aligned Texturing via Effective Resampling 2025-06-12
13 CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx 2025-06-05
14 ART-DECO: Arbitrary Text Guidance for 3D Detailizer Construction 2025-05-26
15 Making Physical Objects with Generative AI and Robotic Assembly: Considering Fabrication Constraints, Sustainability, Time, Functionality, and Accessibility 2025-04-27
16 CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading 2025-04-09
17 ProcTex: Consistent and Interactive Text-to-texture Synthesis for Procedural Models 2025-01-28
18 FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction 2024-12-12
19 MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D 2024-11-04
20 3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation 2024-10-24
21 Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control 2024-10-09
22 FlashTex: Fast Relightable Mesh Texturing with LightControlNet 2024-02-20
23 Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints 2023-10-05

13. automated theorem proving/interactive theorem proving/formal verification

序号 标题 日期
1 Towards Verified Code Reasoning by LLMs 2025-09-30
2 Learning-Based Testing for Deep Learning: Enhancing Model Robustness with Adversarial Input Prioritization 2025-09-28
3 GPM: The Gaussian Pancake Mechanism for Planting Undetectable Backdoors in Differential Privacy 2025-09-28
4 PAT-Agent: Autoformalization for Model Checking 2025-09-28
5 L-Mosaics and Bounded Join-Semilattices in Isabelle/HOL 2025-09-24
6 EconProver: Towards More Economical Test-Time Scaling for Automated Theorem Proving 2025-09-16
7 Saturation-Driven Dataset Generation for LLM Mathematical Reasoning in the TPTP Ecosystem 2025-09-08
8 Contradictions 2025-09-07
9 Formal Modeling and Verification of the Algorand Consensus Protocol in CADP 2025-08-26
10 An ACL2s Interface to Z3 2025-07-25
11 Generalized Tree Edit Distance (GTED): A Faithful Evaluation Metric for Statement Autoformalization 2025-07-10
12 Quantifying Bounded Rationality: Formal Verification of Simon's Satisficing Through Flexible Stochastic Dominance 2025-07-02
13 Prover Agent: An Agent-Based Framework for Formal Mathematical Proofs 2025-06-24
14 Logic Gate Neural Networks are Good for Verification 2025-05-26
15 A Formal Proof of Complexity Bounds on Diophantine Equations 2025-05-22
16 Generalizable Process Reward Models via Formally Verified Training Data 2025-05-21
17 Canonical for Automated Theorem Proving in Lean 2025-04-08
18 Automated Discovery of Tactic Libraries for Interactive Theorem Proving 2025-03-31
19 LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction 2025-02-25
20 Proving the Coding Interview: A Benchmark for Formally Verified Code Generation 2025-02-08
21 A Certified Proof Checker for Deep Neural Network Verification in Imandra 2024-05-17
22 Consensus-Free Spreadsheet Integration 2022-09-28

Metadata

Metadata

Assignees

No one assigned

    Labels

    documentationImprovements or additions to documentation

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions