Skip to content

每月论文更新 - 2025年11月02日 #28

@github-actions

Description

@github-actions

最后更新:2025-11-02 00:08

本次更新执行命令

D:\a\MyAutoPapers\MyAutoPapers\target\release\my_auto_papers.exe --keywords=
             efficient RL,
             partial observable markov decision process/pomdp,sparse reward reinforcement learning,
             casual RL/counterfactual RL/casual reinforcement learning,
             causal inference/causal discovery/counterfactual reasoning,
             video super resolution,
             knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding,
             combinatorial game theory/xiangqi/chinese chess,
             code llm,
             speech recognition,
             zero shot tracking/few shot tracking/pose tracking/pose estimation,
             text to 3d/image to 3d/text to texture,
             automated theorem proving/interactive theorem proving/formal verification
              --exclude-keywords=multi-agent,multiagent --per-keyword-max-result=8

参数详解

  • 关键词:efficient RL, partial observable markov decision process/pomdp, sparse reward reinforcement learning, casual RL/counterfactual RL/casual reinforcement learning, causal inference/causal discovery/counterfactual reasoning, video super resolution, knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding, combinatorial game theory/xiangqi/chinese chess, code llm, speech recognition, zero shot tracking/few shot tracking/pose tracking/pose estimation, text to 3d/image to 3d/text to texture, automated theorem proving/interactive theorem proving/formal verification
  • 排除关键词:multi-agent, multiagent
  • 每关键词最大结果:8
  • 目标领域:cs, stat
  • 每关键词重试次数:3

论文汇总(198篇)

更好的阅读体验请访问 Github页面

1. efficient RL

序号 标题 日期
1 ReSpec: Towards Optimizing Speculative Decoding in Reinforcement Learning Systems 2025-10-30
2 $π_\texttt{RL}$: Online RL Fine-tuning for Flow-based Vision-Language-Action Models 2025-10-29
3 RLBoost: Harvesting Preemptible Resources for Cost-Efficient Reinforcement Learning on LLMs 2025-10-22
4 Boundary-Guided Policy Optimization for Memory-efficient RL of Diffusion Large Language Models 2025-10-13
5 Shuffle-R1: Efficient RL framework for Multimodal Large Language Models via Data-centric Dynamic Shuffle 2025-08-07
6 Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning 2025-07-09
7 When Can Model-Free Reinforcement Learning be Enough for Thinking? 2025-06-20
8 Enhancing Efficiency and Exploration in Reinforcement Learning for LLMs 2025-05-24

2. partial observable markov decision process/pomdp

序号 标题 日期
1 Multi-Environment POMDPs: Discrete Model Uncertainty Under Partial Observability 2025-10-27
2 ESCORT: Efficient Stein-variational and Sliced Consistency-Optimized Temporal Belief Representation for POMDPs 2025-10-24
3 AdaDoS: Adaptive DoS Attack via Deep Adversarial Reinforcement Learning in SDN 2025-10-23
4 The Landscape of Agentic Reinforcement Learning for LLMs: A Survey 2025-09-02
5 Trust-Aware Assistance Seeking in Human-Supervised Autonomy 2024-10-27
6 Online POMDP Planning with Anytime Deterministic Optimality Guarantees 2023-10-03
7 Robust Fitted-Q-Evaluation and Iteration under Sequentially Exogenous Unobserved Confounders 2023-02-01

3. sparse reward reinforcement learning

序号 标题 日期
1 What Fundamental Structure in Reward Functions Enables Efficient Sparse-Reward Learning? 2025-09-04
2 LLM-Driven Intrinsic Motivation for Sparse Reward Reinforcement Learning 2025-08-25
3 SuperRL: Reinforcement Learning with Supervision to Boost Language Model Reasoning 2025-06-01
4 DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning 2025-05-26
5 STAR-R1: Spatial TrAnsformation Reasoning by Reinforcing Multimodal LLMs 2025-05-21
6 Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model 2025-03-14
7 Hedging with Sparse Reward Reinforcement Learning 2025-03-06
8 Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations 2024-12-02

4. casual RL/counterfactual RL/casual reinforcement learning

序号 标题 日期
1 Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL 2025-02-18
2 Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation 2020-12-16

5. causal inference/causal discovery/counterfactual reasoning

序号 标题 日期
1 A Unified Theory for Causal Inference: Direct Debiased Machine Learning via Bregman-Riesz Regression 2025-10-30
2 Discovering Causal Relationships Between Time Series With Spatial Structure 2025-10-30
3 Representation-Level Counterfactual Calibration for Debiased Zero-Shot Recognition 2025-10-30
4 Linear Causal Discovery with Interventional Constraints 2025-10-30
5 Causal Inference with Groupwise Matching 2025-10-30
6 Bias-Corrected Data Synthesis for Imbalanced Learning 2025-10-30
7 Graph Distance Based on Cause-Effect Estimands with Latents 2025-10-28
8 Causal Ordering for Structure Learning From Time Series 2025-10-28
9 Decentralized Causal Discovery using Judo Calculus 2025-10-27
10 Group Interventions on Deep Networks for Causal Discovery in Subsystems 2025-10-27
11 Beyond Prompt Engineering: Neuro-Symbolic-Causal Architecture for Robust Multi-Objective AI Agents 2025-10-27
12 A Hybrid Enumeration Framework for Optimal Counterfactual Generation in Post-Acute COVID-19 Heart Failure 2025-10-21
13 Counterfactual Reasoning for Steerable Pluralistic Value Alignment of Large Language Models 2025-10-21
14 CausalTrace: A Neurosymbolic Causal Analysis Agent for Smart Manufacturing 2025-10-14
15 GCVAMD: A Modified CausalVAE Model for Causal Age-related Macular Degeneration Risk Factor Detection and Prediction 2025-10-03
16 Chiseling: Powerful and Valid Subgroup Selection via Interactive Machine Learning 2025-09-23
17 Revealing Multimodal Causality with Large Language Models 2025-09-22
18 Predictive Causal Inference via Spatio-Temporal Modeling and Penalized Empirical Likelihood 2025-07-11
19 Can LLMs Reconcile Knowledge Conflicts in Counterfactual Reasoning 2025-06-15
20 Counterfactual reasoning: an analysis of in-context emergence 2025-06-05
21 CAUSAL3D: A Comprehensive Benchmark for Causal Learning from Visual Data 2025-03-06
22 Long-Tailed Classification by Keeping the Good and Removing the Bad Momentum Causal Effect 2020-09-28

6. video super resolution

序号 标题 日期
1 BasicAVSR: Arbitrary-Scale Video Super-Resolution via Image Priors and Enhanced Motion Compensation 2025-10-30
2 FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution 2025-10-14
3 Time-Correlated Video Bridge Matching 2025-10-14
4 UniMMVSR: A Unified Multi-Modal Framework for Cascaded Video Super-Resolution 2025-10-09
5 One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution 2025-06-18
6 DOVE: Efficient One-Step Diffusion Model for Real-World Video Super-Resolution 2025-05-22
7 Blind Video Super-Resolution based on Implicit Kernels 2025-03-10
8 FCVSR: A Frequency-aware Method for Compressed Video Super-Resolution 2025-02-10

7. knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding

序号 标题 日期
1 Knowledge Distillation of Noisy Force Labels for Improved Coarse-Grained Force Fields 2025-10-30
2 Inside CORE-KG: Evaluating Structured Prompting and Coreference Resolution for Knowledge Graphs 2025-10-30
3 LINK-KG: LLM-Driven Coreference-Resolved Knowledge Graphs for Human Smuggling Networks 2025-10-30
4 ReSpec: Towards Optimizing Speculative Decoding in Reinforcement Learning Systems 2025-10-30
5 Personalized Treatment Outcome Prediction from Scarce Data via Dual-Channel Knowledge Distillation and Adaptive Fusion 2025-10-30
6 From Amateur to Master: Infusing Knowledge into LLMs via Automated Curriculum Learning 2025-10-30
7 Distilling Multilingual Vision-Language Models: When Smaller Models Stay Multilingual 2025-10-30
8 Do Students Debias Like Teachers? On the Distillability of Bias Mitigation Methods 2025-10-30
9 Rethinking Cross-lingual Alignment: Balancing Transfer and Cultural Erasure in Multilingual LLMs 2025-10-29
10 Robust GNN Watermarking via Implicit Perception of Topological Invariants 2025-10-29
11 BambooKG: A Neurobiologically-inspired Frequency-Weight Knowledge Graph 2025-10-29
12 Cross Learning between Electronic Structure Theories for Unifying Molecular, Surface, and Inorganic Crystal Foundation Force Fields 2025-10-29
13 Parameter Averaging in Link Prediction 2025-10-29
14 A Privacy-Preserving Ecosystem for Developing Machine Learning Algorithms Using Patient Data: Insights from the TUM.ai Makeathon 2025-10-29
15 Model-Document Protocol for AI Search 2025-10-29
16 A word association network methodology for evaluating implicit biases in LLMs compared to humans 2025-10-28
17 Adaptive Knowledge Transferring with Switching Dual-Student Framework for Semi-Supervised Medical Image Segmentation 2025-10-28
18 Beyond Neural Incompatibility: Easing Cross-Scale Knowledge Transfer in Large Language Models through Latent Semantic Alignment 2025-10-28
19 PULSE: Privileged Knowledge Transfer from Electrodermal Activity to Low-Cost Sensors for Stress Monitoring 2025-10-28
20 Mitigating Negative Transfer via Reducing Environmental Disagreement 2025-10-28
21 VADTree: Explainable Training-Free Video Anomaly Detection via Hierarchical Granularity-Aware Tree 2025-10-26
22 Accelerating Materials Design via LLM-Guided Evolutionary Search 2025-10-26
23 On the accuracy of implicit neural representations for cardiovascular anatomies and hemodynamic fields 2025-10-23
24 IKnow: Instruction-Knowledge-Aware Continual Pretraining for Effective Domain Adaptation 2025-10-23
25 LLM-empowered knowledge graph construction: A survey 2025-10-23
26 Collateral Damage Assessment Model for AI System Target Engagement in Military Operations 2025-10-23
27 Code Digital Twin: Empowering LLMs with Tacit Knowledge for Complex Software Development 2025-10-18
28 Manual2Skill++: Connector-Aware General Robotic Assembly from Instruction Manuals via Vision-Language Models 2025-10-18
29 Tunable-Generalization Diffusion Powered by Self-Supervised Contextual Sub-Data for Low-Dose CT Reconstruction 2025-09-28
30 Towards a Common Framework for Autoformalization 2025-09-11
31 DBLPLink 2.0 -- An Entity Linker for the DBLP Scholarly Knowledge Graph 2025-07-30
32 Exploring the In-Context Learning Capabilities of LLMs for Money Laundering Detection in Financial Graphs 2025-07-20
33 Qualitative Analysis of the Teacher and Student Roles in Pair Programming 2025-07-14
34 MTL-KD: Multi-Task Learning Via Knowledge Distillation for Generalizable Neural Vehicle Routing Solver 2025-06-03
35 Raw2Drive: Reinforcement Learning with Aligned World Models for End-to-End Autonomous Driving (in CARLA v2) 2025-05-22
36 Learning Knowledge-based Prompts for Robust 3D Mask Presentation Attack Detection 2025-05-06
37 Empowering Agentic Video Analytics Systems with Video Language Models 2025-05-01
38 Code Digital Twin: Empowering LLMs with Tacit Knowledge for Complex Software Development 2025-03-11
39 Minimalist Market Design: A Framework for Economists with Policy Aspirations 2023-12-30

8. combinatorial game theory/xiangqi/chinese chess

序号 标题 日期
1 Various Diamond Properties in Combinatorial Game Theory 2025-09-26
2 Xiangqi-R1: Enhancing Spatial Strategic Reasoning in LLMs for Chinese Chess via Reinforcement Learning 2025-07-16
3 On 3-terminal positions in Hex 2025-07-11
4 A number game reconciliation 2025-07-07
5 Deep Reinforcement Learning Xiangqi Player with Monte Carlo Tree Search 2025-06-18
6 Circular Game Coloring of Signed Graphs 2025-05-27
7 Computational and Algebraic Structure of Board Games 2025-02-18
8 RemoteChess: Enhancing Older Adults' Social Connectedness via Designing a Virtual Reality Chinese Chess (Xiangqi) Community 2025-02-17
9 Temperatures of Robin Hood 2025-01-13
10 On Conway's Numbers and Games, the Von Neumann Universe, and Pure Set Theory 2025-01-08
11 Complete Implementation of WXF Chinese Chess Rules 2024-12-23
12 Maker-Breaker on Galton-Watson trees 2024-12-11
13 Relationship between misère NIM and two-player GOISHI HIROI 2024-12-05
14 Mastering Chinese Chess AI (Xiangqi) Without Search 2024-10-07
15 XQSV: A Structurally Variable Network to Imitate Human Play in Xiangqi 2024-07-05
16 Shogi and Frieze group 2023-11-15
17 JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games 2023-08-09
18 Niel's Chess -- Rules for Xiangqi 2023-06-27
19 On the complexity of Dark Chinese Chess 2021-12-06

9. code llm

序号 标题 日期
1 Gistify! Codebase-Level Understanding via Runtime Execution 2025-10-30
2 Wisdom and Delusion of LLM Ensembles for Code Generation and Repair 2025-10-24
3 Review of Tools for Zero-Code LLM Based Application Development 2025-10-22
4 LLavaCode: Compressed Code Representations for Retrieval-Augmented Code Generation 2025-10-22
5 TREAT: A Code LLMs Trustworthiness / Reliability Evaluation and Testing Framework 2025-10-20
6 TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar 2025-10-16
7 LongCodeBench: Evaluating Coding LLMs at 1M Context Windows 2025-05-12

10. speech recognition

序号 标题 日期
1 HMM for short independent sequences: Multiple sequence Baum-Welch application 2025-10-30
2 POWSM: A Phonetic Open Whisper-Style Speech Foundation Model 2025-10-28
3 BEST-RQ-Based Self-Supervised Learning for Whisper Domain Adaptation 2025-10-28
4 Audio Signal Processing Using Time Domain Mel-Frequency Wavelet Coefficient 2025-10-28
5 Are ASR foundation models generalized enough to capture features of regional dialects for low-resource languages? 2025-10-27
6 SimulMEGA: MoE Routers are Advanced Policy Makers for Simultaneous Speech Translation 2025-09-01
7 Application of Whisper in Clinical Practice: the Post-Stroke Speech Assessment during a Naming Task 2025-07-23
8 Speak & Spell: LLM-Driven Controllable Phonetic Error Augmentation for Robust Dialogue State Tracking 2024-09-10

11. zero shot tracking/few shot tracking/pose tracking/pose estimation

序号 标题 日期
1 Sketch2PoseNet: Efficient and Generalized Sketch to 3D Human Pose Prediction 2025-10-30
2 JOGS: Joint Optimization of Pose Estimation and 3D Gaussian Splatting 2025-10-30
3 STITCH 2.0: Extending Augmented Suturing with EKF Needle Estimation and Thread Management 2025-10-29
4 GeVI-SLAM: Gravity-Enhanced Stereo Visua Inertial SLAM for Underwater Robots 2025-10-28
5 Capturing Head Avatar with Hand Contacts from a Monocular Video 2025-10-20
6 SPLite Hand: Sparsity-Aware Lightweight 3D Hand Pose Estimation 2025-10-18
7 Autonomous Legged Mobile Manipulation for Lunar Surface Operations via Constrained Reinforcement Learning 2025-10-14
8 Gesplat: Robust Pose-Free 3D Reconstruction via Geometry-Guided Gaussian Splatting 2025-10-11
9 Wrist2Finger: Sensing Fingertip Force for Force-Aware Hand Interaction with a Ring-Watch Wearable 2025-10-05
10 When Tracking Fails: Analyzing Failure Modes of SAM2 for Point-Based Tracking in Surgical Videos 2025-10-02
11 Enabling High-Frequency Cross-Modality Visual Positioning Service for Accurate Drone Landing 2025-10-01
12 User-Centric Communication Service Provision for Edge-Assisted Mobile Augmented Reality 2025-09-30
13 MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training 2025-09-26
14 MASt3R-Fusion: Integrating Feed-Forward Visual Model with IMU, GNSS for High-Functionality SLAM 2025-09-25
15 Seg2Track-SAM2: SAM2-based Multi-object Tracking and Segmentation for Zero-shot Generalization 2025-09-15
16 HAND Me the Data: Fast Robot Adaptation via Hand Path Retrieval 2025-05-26
17 Acoustic Neural 3D Reconstruction Under Pose Drift 2025-03-11
18 Faster Model Predictive Control via Self-Supervised Initialization Learning 2024-08-06
19 Matching Anything by Segmenting Anything 2024-06-06
20 Multi-Scale Memory Comparison for Zero-/Few-Shot Anomaly Detection 2023-08-09
21 Zero-Shot Anomaly Detection with Pre-trained Segmentation Models 2023-06-15
22 APRIL-GAN: A Zero-/Few-Shot Anomaly Classification and Segmentation Method for CVPR 2023 VAND Workshop Challenge Tracks 1&2: 1st Place on Zero-shot AD and 4th Place on Few-shot AD 2023-05-27
23 Unifying Tracking and Image-Video Object Detection 2022-11-20
24 Exploring the Effectiveness of Self-supervised Learning and Classifier Chains in Emotion Recognition of Nonverbal Vocalizations 2022-06-21
25 The Multi-speaker Multi-style Voice Cloning Challenge 2021 2021-04-05

12. text to 3d/image to 3d/text to texture

序号 标题 日期
1 4-Doodle: Text to 3D Sketches that Move! 2025-10-29
2 TurboPortrait3D: Single-step diffusion-based fast portrait novel-view synthesis 2025-10-27
3 TRELLISWorld: Training-Free World Generation from Object Generators 2025-10-27
4 VIST3A: Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator 2025-10-15
5 Generating Surface for Text-to-3D using 2D Gaussian Splatting 2025-10-08
6 OBJVanish: Physically Realizable Text-to-3D Adv. Generation of LiDAR-Invisible Objects 2025-10-08
7 Towards Scalable and Consistent 3D Editing 2025-10-03
8 PAD3R: Pose-Aware Dynamic 3D Reconstruction from Casual Videos 2025-09-29
9 ReXGroundingCT: A 3D Chest CT Dataset for Segmentation of Findings from Free-Text Reports 2025-07-29
10 TexTailor: Customized Text-aligned Texturing via Effective Resampling 2025-06-12
11 CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx 2025-06-05
12 Uni-Instruct: One-step Diffusion Model through Unified Diffusion Divergence Instruction 2025-05-27
13 Constructing a 3D Scene from a Single Image 2025-05-21
14 CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading 2025-04-09
15 Uncertainty-Aware Diffusion Guided Refinement of 3D Scenes 2025-03-19
16 ProcTex: Consistent and Interactive Text-to-texture Synthesis for Part-based Procedural Models 2025-01-28
17 Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction 2024-11-21
18 MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D 2024-11-04
19 3D Audio-Visual Segmentation 2024-11-04
20 SceneComplete: Open-World 3D Scene Completion in Cluttered Real World Environments for Robot Manipulation 2024-10-31
21 3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation 2024-10-24
22 Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control 2024-10-09
23 FlashTex: Fast Relightable Mesh Texturing with LightControlNet 2024-02-20

13. automated theorem proving/interactive theorem proving/formal verification

序号 标题 日期
1 Dissect-and-Restore: AI-based Code Verification with Transient Refactoring 2025-10-29
2 Adaptive Proof Refinement with LLM-Guided Strategy Selection 2025-10-29
3 VeriStruct: AI-assisted Automated Verification of Data-Structure Modules in Verus 2025-10-28
4 A Hamilton-Jacobi Reachability Framework with Soft Constraints for Safety-Critical Systems 2025-10-28
5 Formal Verification of a Token Sale Launchpad: A Compositional Approach in Dafny 2025-10-27
6 A Theorem-Proving-Based Evaluation of Neural Semantic Parsing 2025-10-13
7 Proof Strategy Extraction from LLMs for Enhancing Symbolic Provers 2025-10-11
8 L-Mosaics and Bounded Join-Semilattices in Isabelle/HOL 2025-09-24
9 An ACL2s Interface to Z3 2025-07-25
10 Quantifying Bounded Rationality: Formal Verification of Simon's Satisficing Through Flexible Stochastic Dominance 2025-07-02
11 Prover Agent: An Agent-Based Framework for Formal Mathematical Proofs 2025-06-24
12 Ineq-Comp: Benchmarking Human-Intuitive Compositional Reasoning in Automated Theorem Proving on Inequalities 2025-05-19
13 APOLLO: Automated LLM and Lean Collaboration for Advanced Formal Reasoning 2025-05-09
14 Efficient Formal Verification of Quantum Error Correcting Programs 2025-04-10
15 Automated Discovery of Tactic Libraries for Interactive Theorem Proving 2025-03-31
16 A Natural Homomorphism between the Model Constructions of the Completeness and Compactness Theorems 2025-03-19
17 LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction 2025-02-25
18 Steering LLMs for Formal Theorem Proving 2025-02-21
19 InternLM2.5-StepProver: Advancing Automated Theorem Proving via Critic-Guided Search 2024-10-21
20 Galapagos: Automated N-Version Programming with LLMs 2024-08-18
21 A Certified Proof Checker for Deep Neural Network Verification in Imandra 2024-05-17
22 VerifIoU -- Robustness of Object Detection to Perturbations 2024-01-30

Metadata

Metadata

Assignees

No one assigned

    Labels

    documentationImprovements or additions to documentation

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions