Skip to content

每月论文更新 - 2025年06月02日 #23

@github-actions

Description

@github-actions

最后更新:2025-06-02 00:09

本次更新执行命令

D:\a\MyAutoPapers\MyAutoPapers\target\release\my_auto_papers.exe --keywords=
             efficient RL,
             partial observable markov decision process/pomdp,sparse reward reinforcement learning,
             casual RL/counterfactual RL/casual reinforcement learning,
             causal inference/causal discovery/counterfactual reasoning,
             video super resolution,
             knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding,
             combinatorial game theory/xiangqi/chinese chess,
             code llm,
             speech recognition,
             zero shot tracking/few shot tracking/pose tracking/pose estimation,
             text to 3d/image to 3d/text to texture,
             automated theorem proving/interactive theorem proving/formal verification
              --exclude-keywords=multi-agent,multiagent --per-keyword-max-result=8

参数详解

  • 关键词:efficient RL, partial observable markov decision process/pomdp, sparse reward reinforcement learning, casual RL/counterfactual RL/casual reinforcement learning, causal inference/causal discovery/counterfactual reasoning, video super resolution, knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding, combinatorial game theory/xiangqi/chinese chess, code llm, speech recognition, zero shot tracking/few shot tracking/pose tracking/pose estimation, text to 3d/image to 3d/text to texture, automated theorem proving/interactive theorem proving/formal verification
  • 排除关键词:multi-agent, multiagent
  • 每关键词最大结果:8
  • 目标领域:cs, stat
  • 每关键词重试次数:3

论文汇总(202篇)

更好的阅读体验请访问 Github页面

1. efficient RL

序号 标题 日期
1 Enhancing Efficiency and Exploration in Reinforcement Learning for LLMs 2025-05-24
2 Efficient RL Training for Reasoning Models via Length-Aware Optimization 2025-05-18
3 Synthetic Data RL: Task Definition Is All You Need 2025-05-18
4 Sample Efficient Reinforcement Learning via Large Vision Language Model Distillation 2025-05-16
5 RL-DAUNCE: Reinforcement Learning-Driven Data Assimilation with Uncertainty-Aware Constrained Ensembles 2025-05-08
6 Toward Efficient Exploration by Large Language Model Agents 2025-04-29
7 Token-Efficient RL for LLM Reasoning 2025-04-29
8 On-Robot Reinforcement Learning with Goal-Contrastive Rewards 2024-10-25

2. partial observable markov decision process/pomdp

序号 标题 日期
1 Sequential Monte Carlo for Policy Optimization in Continuous POMDPs 2025-05-22
2 Learning POMDPs with Linear Function Approximation and Finite Memory 2025-05-20
3 Confidence-Regulated Generative Diffusion Models for Reliable AI Agent Migration in Vehicular Metaverses 2025-05-19
4 Unveiling the Black Box: A Multi-Layer Framework for Explaining Reinforcement Learning-Based Cyber Agents 2025-05-16
5 Model Identification Adaptive Control with $ρ$-POMDP Planning 2025-05-14
6 Value Gradients with Action Adaptive Search Trees in Continuous (PO)MDPs 2025-03-15
7 POPGym Arcade: Parallel Pixelated POMDPs 2025-03-03
8 Map Space Belief Prediction for Manipulation-Enhanced Mapping 2025-02-28
9 Simplifying Complex Observation Models in Continuous POMDP Planning with Probabilistic Guarantees and Practice 2023-11-13

3. sparse reward reinforcement learning

序号 标题 日期
1 DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning 2025-05-26
2 STAR-R1: Spatial TrAnsformation Reasoning by Reinforcing Multimodal LLMs 2025-05-21
3 Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model 2025-03-14
4 Hedging with Sparse Reward Reinforcement Learning 2025-03-06
5 Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations 2024-12-02
6 Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning 2023-09-08
7 Language Reward Modulation for Pretraining Reinforcement Learning 2023-08-23
8 Exploiting Transformer in Sparse Reward Reinforcement Learning for Interpretable Temporal Logic Motion Planning 2022-09-27

4. casual RL/counterfactual RL/casual reinforcement learning

序号 标题 日期
1 Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL 2025-02-18
2 Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation 2020-12-16

5. causal inference/causal discovery/counterfactual reasoning

序号 标题 日期
1 A Synthetic Business Cycle Approach to Counterfactual Analysis with Nonstationary Macroeconomic Data 2025-05-28
2 Causal Inference for Experiments with Latent Outcomes: Key Results and Their Implications for Design and Analysis 2025-05-28
3 MAMBO-NET: Multi-Causal Aware Modeling Backdoor-Intervention Optimization for Medical Image Segmentation Network 2025-05-28
4 A Bayesian approach to the survivor average causal effect in cluster-randomized crossover trials 2025-05-27
5 Generating Hypotheses of Dynamic Causal Graphs in Neuroscience: Leveraging Generative Factor Models of Observed Time Series 2025-05-27
6 Causality and "In-the-Wild" Video-Based Person Re-ID: A Survey 2025-05-26
7 Agentic AI Process Observability: Discovering Behavioral Variability 2025-05-26
8 Deriving Strategic Market Insights with Large Language Models: A Benchmark for Forward Counterfactual Generation 2025-05-26
9 CausalDynamics: A large-scale benchmark for structural discovery of dynamical causal models 2025-05-22
10 Causal Cartographer: From Mapping to Reasoning Over Counterfactual Worlds 2025-05-20
11 Walking the Tightrope: Disentangling Beneficial and Detrimental Drifts in Non-Stationary Custom-Tuning 2025-05-19
12 On the Eligibility of LLMs for Counterfactual Reasoning: A Decompositional Study 2025-05-17
13 Beyond the Known: Decision Making with Counterfactual Reasoning Decision Transformer 2025-05-14
14 An Identifiable Cost-Aware Causal Decision-Making Framework Using Counterfactual Reasoning 2025-05-13
15 Exogenous Isomorphism for Counterfactual Identifiability 2025-05-04
16 Generative Framework for Personalized Persuasion: Inferring Causal, Counterfactual, and Latent Knowledge 2025-04-08
17 Prediction-Powered E-Values 2025-02-06
18 PUATE: Efficient Average Treatment Effect Estimation from Treated (Positive) and Unlabeled Units 2025-01-31
19 Integer Programming for Generalized Causal Bootstrap Designs 2024-10-28
20 Bridging the Gap Between Data-Driven And Theory-Driven Modelling - Leveraging Causal Machine Learning for Integrative Modelling of Dynamical Systems 2024-10-12
21 Identifying perturbation targets through causal differential networks 2024-10-04
22 MotifDisco: Motif Causal Discovery For Time Series Motifs 2024-09-23
23 Improving Causal Inference with Measurement Errors in Exposures and Confounders: A New Method and Its Application to Air Pollution Exposure Assessment and Epidemiology 2024-05-13
24 Reconciling Overt Bias and Hidden Bias in Sensitivity Analysis for Matched Observational Studies 2023-11-19

6. video super resolution

序号 标题 日期
1 UltraVSR: Achieving Ultra-Realistic Video Super-Resolution with Efficient One-Step Diffusion Space 2025-05-26
2 DOVE: Efficient One-Step Diffusion Model for Real-World Video Super-Resolution 2025-05-22
3 Hunyuan-Game: Industrial-grade Intelligent Game Creation Model 2025-05-20
4 Blind Restoration of High-Resolution Ultrasound Video 2025-05-20
5 Super-Resolution Generative Adversarial Networks based Video Enhancement 2025-05-14
6 GRNN:Recurrent Neural Network based on Ghost Features for Video Super-Resolution 2025-05-14
7 Rethinking Video Super-Resolution: Towards Diffusion-Based Methods without Motion Alignment 2025-03-05
8 DC-VSR: Spatially and Temporally Consistent Video Super-Resolution with Video Diffusion Prior 2025-02-05

7. knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding

序号 标题 日期
1 Sketch Down the FLOPs: Towards Efficient Networks for Human Sketch 2025-05-29
2 Knowledge Insulating Vision-Language-Action Models: Train Fast, Run Fast, Generalize Better 2025-05-29
3 AutoSchemaKG: Autonomous Knowledge Graph Construction through Dynamic Schema Induction from Web-Scale Corpora 2025-05-29
4 Position Paper: Metadata Enrichment Model: Integrating Neural Networks and Semantic Knowledge Graphs for Cultural Heritage Applications 2025-05-29
5 Diagnosing and Addressing Pitfalls in KG-RAG Datasets: Toward More Reliable Benchmarking 2025-05-29
6 UAQFact: Evaluating Factual Knowledge Utilization of LLMs on Unanswerable Questions 2025-05-29
7 Rethinking Regularization Methods for Knowledge Graph Completion 2025-05-29
8 From Parameters to Prompts: Understanding and Mitigating the Factuality Gap between Fine-Tuned LLMs 2025-05-29
9 Can Large Language Models Trigger a Paradigm Shift in Travel Behavior Modeling? Experiences with Modeling Travel Satisfaction 2025-05-29
10 Query Routing for Retrieval-Augmented Language Models 2025-05-29
11 Knowledge Distillation for Reservoir-based Classifier: Human Activity Recognition 2025-05-29
12 BugWhisperer: Fine-Tuning LLMs for SoC Hardware Vulnerability Detection 2025-05-28
13 RAD: Redundancy-Aware Distillation for Hybrid Models via Self-Speculative Decoding 2025-05-28
14 InfoSAM: Fine-Tuning the Segment Anything Model from An Information-Theoretic Perspective 2025-05-28
15 CAST: Contrastive Adaptation and Distillation for Semi-Supervised Instance Segmentation 2025-05-28
16 Revisiting Self-attention for Cross-domain Sequential Recommendation 2025-05-27
17 FCKT: Fine-Grained Cross-Task Knowledge Transfer with Semantic Contrastive Learning for Targeted Sentiment Analysis 2025-05-27
18 Spatiotemporal Causal Decoupling Model for Air Quality Forecasting 2025-05-26
19 Conversational Lexicography: Querying Lexicographic Data on Knowledge Graphs with SPARQL through Natural Language 2025-05-26
20 Compliance-to-Code: Enhancing Financial Compliance Checking via Code Generation 2025-05-26
21 Balancing Computation Load and Representation Expressivity in Parallel Hybrid Neural Networks 2025-05-26
22 Consistency-based Abductive Reasoning over Perceptual Errors of Multiple Pre-trained Models in Novel Environments 2025-05-25
23 Disentangling Knowledge Representations for Large Language Model Editing 2025-05-24
24 Automated Capability Evaluation of Foundation Models 2025-05-22
25 SD-MAD: Sign-Driven Few-shot Multi-Anomaly Detection in Medical Images 2025-05-22
26 Disentangled Multi-span Evolutionary Network against Temporal Knowledge Graph Reasoning 2025-05-20
27 Language-Specific Latent Process Hinders Cross-Lingual Performance 2025-05-19
28 LifelongAgentBench: Evaluating LLM Agents as Lifelong Learners 2025-05-17
29 Fusing Bidirectional Chains of Thought and Reward Mechanisms A Method for Enhancing Question-Answering Capabilities of Large Language Models for Chinese Intangible Cultural Heritage 2025-05-13
30 SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement 2025-04-10
31 Revealing the Intrinsic Ethical Vulnerability of Aligned Large Language Models 2025-04-07
32 Universal Item Tokenization for Transferable Generative Recommendation 2025-04-06
33 Semantic Web and Software Agents -- A Forgotten Wave of Artificial Intelligence? 2025-03-20
34 BYOS: Knowledge-driven Large Language Models Bring Your Own Operating System More Excellent 2025-03-12
35 Enhancing Semi-supervised Learning with Zero-shot Pseudolabels 2025-02-18
36 Demystifying Catastrophic Forgetting in Two-Stage Incremental Object Detector 2025-02-08
37 A Reality Check on Context Utilisation for Retrieval-Augmented Generation 2024-12-22
38 Federated Continual Graph Learning 2024-11-28
39 Leveraging Large Language Models for Relevance Judgments in Legal Case Retrieval 2024-03-27

8. combinatorial game theory/xiangqi/chinese chess

序号 标题 日期
1 Circular Game Coloring of Signed Graphs 2025-05-27
2 Computational and Algebraic Structure of Board Games 2025-02-18
3 RemoteChess: Enhancing Older Adults' Social Connectedness via Designing a Virtual Reality Chinese Chess (Xiangqi) Community 2025-02-17
4 Temperatures of Robin Hood 2025-01-13
5 On Conway's Numbers and Games, the Von Neumann Universe, and Pure Set Theory 2025-01-08
6 Complete Implementation of WXF Chinese Chess Rules 2024-12-23
7 Maker-Breaker on Galton-Watson trees 2024-12-11
8 Relationship between misère NIM and two-player GOISHI HIROI 2024-12-05
9 The Game Value of Sequential Compounds of Integers and Stars 2024-11-13
10 A New 0(klog n) Algorithm for Josephus Problem 2024-11-10
11 Mastering Chinese Chess AI (Xiangqi) Without Search 2024-10-07
12 An Efficient Multi-Robot Arm Coordination Strategy for Pick-and-Place Tasks using Reinforcement Learning 2024-09-20
13 XQSV: A Structurally Variable Network to Imitate Human Play in Xiangqi 2024-07-05
14 Shogi and Frieze group 2023-11-15
15 JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games 2023-08-09
16 Niel's Chess -- Rules for Xiangqi 2023-06-27
17 On the complexity of Dark Chinese Chess 2021-12-06
18 A Note on Hardness Frameworks and Computational Complexity of Xiangqi and Janggi 2019-03-30
19 Comparison Training for Computer Chinese Chess 2018-01-23

9. code llm

序号 标题 日期
1 From Output to Evaluation: Does Raw Instruction-Tuned Code LLMs Output Suffice for Fill-in-the-Middle Code Generation? 2025-05-24
2 AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios 2025-05-22
3 Success is in the Details: Evaluate and Enhance Details Sensitivity of Code LLMs through Counterfactuals 2025-05-20
4 Is Compression Really Linear with Code Intelligence? 2025-05-16
5 ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation 2025-01-11
6 Enhancing Code LLMs with Reinforcement Learning in Code Generation: A Survey 2024-12-29
7 GALLa: Graph Aligned Large Language Models for Improved Source Code Understanding 2024-09-06
8 WizardCoder: Empowering Code Large Language Models with Evol-Instruct 2023-06-14

10. speech recognition

序号 标题 日期
1 Prompting Whisper for Improved Verbatim Transcription and End-to-end Miscue Detection 2025-05-29
2 Contextualized Automatic Speech Recognition with Dynamic Vocabulary Prediction and Activation 2025-05-29
3 AISHELL-5: The First Open-Source In-Car Multi-Channel Multi-Speaker Speech Dataset for Automatic Speech Diarization and Recognition 2025-05-29
4 NGPU-LM: GPU-Accelerated N-Gram Language Model for Context-Biasing in Greedy ASR Decoding 2025-05-28
5 Evaluation of LLMs in Speech is Often Flawed: Test Set Contamination in Large Language Models for Speech Recognition 2025-05-28
6 VietASR: Achieving Industry-level Vietnamese ASR with 50-hour labeled data and Large-Scale Speech Pretraining 2025-05-23
7 An Effective Training Framework for Light-Weight Automatic Speech Recognition Models 2025-05-22
8 Nexus: An Omni-Perceptive And -Interactive Model for Language, Audio, And Vision 2025-02-26

11. zero shot tracking/few shot tracking/pose tracking/pose estimation

序号 标题 日期
1 Pose-free 3D Gaussian splatting via shape-ray estimation 2025-05-29
2 TwinTrack: Bridging Vision and Contact Physics for Real-Time Tracking of Unknown Dynamic Objects 2025-05-28
3 4DTAM: Non-Rigid Tracking and Mapping via Dynamic Surface Gaussians 2025-05-28
4 MultiFormer: A Multi-Person Pose Estimation System Based on CSI and Attention Mechanism 2025-05-28
5 Event-based Egocentric Human Pose Estimation in Dynamic Environment 2025-05-28
6 ReassembleNet: Learnable Keypoints and Diffusion for 2D Fresco Reconstruction 2025-05-27
7 HAND Me the Data: Fast Robot Adaptation via Hand Path Retrieval 2025-05-26
8 Mouse Lockbox Dataset: Behavior Recognition for Mice Solving Lockboxes 2025-05-21
9 SpikeVideoFormer: An Efficient Spike-Driven Video Transformer with Hamming Attention and $\mathcal{O}(T)$ Complexity 2025-05-15
10 GSFF-SLAM: 3D Semantic Gaussian Splatting SLAM via Feature Field 2025-04-28
11 Robust 6DoF Pose Tracking Considering Contour and Interior Correspondence Uncertainty for AR Assembly Guidance 2025-02-17
12 Semantics-aware Test-time Adaptation for 3D Human Pose Estimation 2025-02-15
13 Towards Better Robustness: Pose-Free 3D Gaussian Splatting for Arbitrarily Long Videos 2025-01-25
14 ERPoT: Effective and Reliable Pose Tracking for Mobile Robots Using Lightweight Polygon Maps 2024-09-23
15 Faster Model Predictive Control via Self-Supervised Initialization Learning 2024-08-06
16 Matching Anything by Segmenting Anything 2024-06-06
17 Multi-Scale Memory Comparison for Zero-/Few-Shot Anomaly Detection 2023-08-09
18 Zero-Shot Anomaly Detection with Pre-trained Segmentation Models 2023-06-15
19 APRIL-GAN: A Zero-/Few-Shot Anomaly Classification and Segmentation Method for CVPR 2023 VAND Workshop Challenge Tracks 1&2: 1st Place on Zero-shot AD and 4th Place on Few-shot AD 2023-05-27
20 Highly Efficient 3D Human Pose Tracking from Events with Spiking Spatiotemporal Transformer 2023-03-16
21 Unifying Tracking and Image-Video Object Detection 2022-11-20
22 Exploring the Effectiveness of Self-supervised Learning and Classifier Chains in Emotion Recognition of Nonverbal Vocalizations 2022-06-21
23 The Multi-speaker Multi-style Voice Cloning Challenge 2021 2021-04-05

12. text to 3d/image to 3d/text to texture

序号 标题 日期
1 Zero-P-to-3: Zero-Shot Partial-View Images to 3D Object 2025-05-29
2 Uni-Instruct: One-step Diffusion Model through Unified Diffusion Divergence Instruction 2025-05-27
3 ART-DECO: Arbitrary Text Guidance for 3D Detailizer Construction 2025-05-26
4 Harnessing the Power of Training-Free Techniques in Text-to-2D Generation for Text-to-3D Generation via Score Distillation Sampling 2025-05-26
5 Constructing a 3D Town from a Single Image 2025-05-21
6 PlantDreamer: Achieving Realistic 3D Plant Models with Diffusion-Guided Gaussian Splatting 2025-05-21
7 Personalize Your Gaussian: Consistent 3D Scene Personalization from a Single Image 2025-05-20
8 Generating Digital Models Using Text-to-3D and Image-to-3D Prompts: Critical Case Study 2025-05-17
9 SOAP: Style-Omniscient Animatable Portraits 2025-05-08
10 DiMeR: Disentangled Mesh Reconstruction Model 2025-04-24
11 CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading 2025-04-09
12 3DGen-Bench: Comprehensive Benchmark Suite for 3D Generative Models 2025-03-27
13 Mean-Shift Distillation for Diffusion Mode Seeking 2025-02-21
14 Dress-1-to-3: Single Image to Simulation-Ready 3D Outfit with Diffusion Prior and Differentiable Physics 2025-02-05
15 ProcTex: Consistent and Interactive Text-to-texture Synthesis for Procedural Models 2025-01-28
16 MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation 2024-12-04
17 MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D 2024-11-04
18 3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation 2024-10-24
19 SeMv-3D: Towards Concurrency of Semantic and Multi-view Consistency in General Text-to-3D Generation 2024-10-10
20 Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control 2024-10-09
21 RoCoTex: A Robust Method for Consistent Texture Synthesis with Diffusion Models 2024-09-30
22 GenesisTex2: Stable, Consistent and High-Quality Text-to-Texture Generation 2024-09-27
23 FlashTex: Fast Relightable Mesh Texturing with LightControlNet 2024-02-20
24 Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints 2023-10-05

13. automated theorem proving/interactive theorem proving/formal verification

序号 标题 日期
1 DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning 2025-05-29
2 Autoformalization in the Era of Large Language Models: A Survey 2025-05-29
3 Towards LLM-based Generation of Human-Readable Proofs in Polynomial Formal Verification 2025-05-29
4 Structural Abstraction and Selective Refinement for Formal Verification 2025-05-29
5 Step-Wise Formal Verification for LLM-Based Mathematical Problem Solving 2025-05-27
6 Grammars of Formal Uncertainty: When to Trust LLMs in Automated Reasoning Tasks 2025-05-26
7 Out of the Shadows: Exploring a Latent Space for Neural Network Verification 2025-05-23
8 HybridProver: Augmenting Theorem Proving with LLM-Driven Proof Synthesis and Refinement 2025-05-21
9 MIRB: Mathematical Information Retrieval Benchmark 2025-05-21
10 Ineq-Comp: Benchmarking Human-Intuitive Compositional Reasoning in Automated Theorem Proving on Inequalities 2025-05-19
11 LLM-based Automated Theorem Proving Hinges on Scalable Synthetic Data Generation 2025-05-17
12 Artificial Intelligence in Number Theory: LLMs for Algorithm Generation and Ensemble Methods for Conjecture Verification 2025-04-28
13 On Coalgebraic Product Constructions for Markov Chains and Automata 2025-04-09
14 Automated Discovery of Tactic Libraries for Interactive Theorem Proving 2025-03-31
15 MA-LoT: Model-Collaboration Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem Proving 2025-03-05
16 Faithful Logic Embeddings in HOL -- Deep and Shallow 2025-02-26
17 LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction 2025-02-25
18 Proving the Coding Interview: A Benchmark for Formally Verified Code Generation 2025-02-08
19 Learning Rules Explaining Interactive Theorem Proving Tactic Prediction 2024-11-02
20 Tableaux for Automated Reasoning in Dependently-Typed Higher-Order Logic (Extended Version) 2024-10-18
21 BAIT: Benchmarking (Embedding) Architectures for Interactive Theorem-Proving 2024-03-06
22 Magnushammer: A Transformer-Based Approach to Premise Selection 2023-03-08

Metadata

Metadata

Assignees

No one assigned

    Labels

    documentationImprovements or additions to documentation

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions