Skip to content

每月论文更新 - 2025年08月02日 #25

@github-actions

Description

@github-actions

最后更新:2025-08-02 00:09

本次更新执行命令

D:\a\MyAutoPapers\MyAutoPapers\target\release\my_auto_papers.exe --keywords=
             efficient RL,
             partial observable markov decision process/pomdp,sparse reward reinforcement learning,
             casual RL/counterfactual RL/casual reinforcement learning,
             causal inference/causal discovery/counterfactual reasoning,
             video super resolution,
             knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding,
             combinatorial game theory/xiangqi/chinese chess,
             code llm,
             speech recognition,
             zero shot tracking/few shot tracking/pose tracking/pose estimation,
             text to 3d/image to 3d/text to texture,
             automated theorem proving/interactive theorem proving/formal verification
              --exclude-keywords=multi-agent,multiagent --per-keyword-max-result=8

参数详解

  • 关键词:efficient RL, partial observable markov decision process/pomdp, sparse reward reinforcement learning, casual RL/counterfactual RL/casual reinforcement learning, causal inference/causal discovery/counterfactual reasoning, video super resolution, knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding, combinatorial game theory/xiangqi/chinese chess, code llm, speech recognition, zero shot tracking/few shot tracking/pose tracking/pose estimation, text to 3d/image to 3d/text to texture, automated theorem proving/interactive theorem proving/formal verification
  • 排除关键词:multi-agent, multiagent
  • 每关键词最大结果:8
  • 目标领域:cs, stat
  • 每关键词重试次数:3

论文汇总(197篇)

更好的阅读体验请访问 Github页面

1. efficient RL

序号 标题 日期
1 MindSpeed RL: Distributed Dataflow for Scalable and Efficient RL Training on Ascend NPU Cluster 2025-07-25
2 Reinforcement Learning Fine-Tunes a Sparse Subnetwork in Large Language Models 2025-07-23
3 Efficient RL for optimizing conversation level outcomes with an LLM-based tutor 2025-07-22
4 Statistical and Algorithmic Foundations of Reinforcement Learning 2025-07-19
5 Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning 2025-07-09
6 Improving Transformer World Models for Data-Efficient RL 2025-02-03
7 Uncovering RL Integration in SSL Loss: Objective-Specific Implications for Data-Efficient RL 2024-10-22
8 RL$^3$: Boosting Meta Reinforcement Learning via RL inside RL$^2$ 2023-06-28

2. partial observable markov decision process/pomdp

序号 标题 日期
1 Partially Observable Monte-Carlo Graph Search 2025-07-28
2 Hybrid quantum-classical algorithm for near-optimal planning in POMDPs 2025-07-24
3 Joint Multi-Target Detection-Tracking in Cognitive Massive MIMO Radar via POMCP 2025-07-23
4 Partially Observable Reference Policy Programming: Solving POMDPs Sans Numerical Optimisation 2025-07-16
5 Coordinated Communication and Inventory Optimization in Multi-Retailer Supply Chains 2025-07-12
6 Age-Aware CSI Acquisition of a Finite-State Markovian Channel 2025-07-07
7 Reinforcement Learning under State and Outcome Uncertainty: A Foundational Distributional Perspective 2025-05-10
8 Goal-Oriented Remote Tracking Through Correlated Observations in Pull-based Communications 2025-03-17
9 Dynamic Information Manipulation Game 2023-12-13

3. sparse reward reinforcement learning

序号 标题 日期
1 SuperRL: Reinforcement Learning with Supervision to Boost Language Model Reasoning 2025-06-01
2 DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning 2025-05-26
3 STAR-R1: Spatial TrAnsformation Reasoning by Reinforcing Multimodal LLMs 2025-05-21
4 Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model 2025-03-14
5 Hedging with Sparse Reward Reinforcement Learning 2025-03-06
6 Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations 2024-12-02
7 Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning 2023-09-08
8 Language Reward Modulation for Pretraining Reinforcement Learning 2023-08-23

4. casual RL/counterfactual RL/casual reinforcement learning

序号 标题 日期
1 Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL 2025-02-18
2 Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation 2020-12-16

5. causal inference/causal discovery/counterfactual reasoning

序号 标题 日期
1 Relative Bias Under Imperfect Identification in Observational Causal Inference 2025-07-31
2 Incorporating structural uncertainty in causal decision making 2025-07-31
3 Causal Reasoning in Pieces: Modular In-Context Learning for Causal Discovery 2025-07-31
4 Risk-inclusive Contextual Bandits for Early Phase Clinical Trials 2025-07-30
5 Dimension Reduction for Conditional Density Estimation with Applications to High-Dimensional Causal Inference 2025-07-30
6 Hybrid Causal Identification and Causal Mechanism Clustering 2025-07-29
7 Causal Link Discovery with Unequal Edge Error Tolerance 2025-07-29
8 From Observations to Causations: A GNN-based Probabilistic Prediction Framework for Causal Discovery 2025-07-27
9 Causal Inference for Circular Data 2025-07-26
10 Probably Approximately Correct Causal Discovery 2025-07-25
11 SMARTAPS: Tool-augmented LLMs for Operations Management 2025-07-23
12 Causal Mechanism Estimation in Multi-Sensor Systems Across Multiple Domains 2025-07-23
13 Canonical Representations of Markovian Structural Causal Models: A Framework for Counterfactual Reasoning 2025-07-22
14 CoLD: Counterfactually-Guided Length Debiasing for Process Reward Models 2025-07-21
15 LLaPa: A Vision-Language Model Framework for Counterfactual-Aware Procedural Planning 2025-07-11
16 CRED: Counterfactual Reasoning and Environment Design for Active Preference Learning 2025-07-07
17 Boosting Temporal Sentence Grounding via Causal Inference 2025-07-07
18 Counterfactual Tuning for Temporal Sensitivity Enhancement in Large Language Model-based Recommendation 2025-07-03
19 Lower Bounds on the Size of Markov Equivalence Classes 2025-06-26
20 Adaptive sample splitting for randomization tests 2025-04-30
21 What Changed and What Could Have Changed? State-Change Counterfactuals for Procedure-Aware Video Representation Learning 2025-03-27
22 A Framework for Covariate-Adjusted Bivariate Causal Discovery 2025-02-14

6. video super resolution

序号 标题 日期
1 RealisVSR: Detail-enhanced Diffusion for Real-World 4K Video Super-Resolution 2025-07-25
2 DAM-VSR: Disentanglement of Appearance and Motion for Video Super-Resolution 2025-07-01
3 TurboVSR: Fantastic Video Upscalers and Where to Find Them 2025-06-30
4 SimpleGVR: A Simple Baseline for Latent-Cascaded Video Super-Resolution 2025-06-24
5 ICME 2025 Grand Challenge on Video Super-Resolution for Video Conferencing 2025-06-13
6 FedVSR: Towards Model-Agnostic Federated Learning in Video Super-Resolution 2025-03-17
7 360-Degree Video Super Resolution and Quality Enhancement Challenge: Methods and Results 2024-11-11
8 RefVSR++: Exploiting Reference Inputs for Reference-based Video Super-resolution 2023-07-06

7. knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding

序号 标题 日期
1 Rule2Text: Natural Language Explanation of Logical Rules in Knowledge Graphs 2025-07-31
2 GraphRAG-R1: Graph Retrieval-Augmented Generation with Process-Constrained Reinforcement Learning 2025-07-31
3 DICE: Dynamic In-Context Example Selection in LLM Agents via Efficient Knowledge Transfer 2025-07-31
4 Mitigating Resolution-Drift in Federated Learning: Case of Keypoint Detection 2025-07-31
5 Beyond Linear Bottlenecks: Spline-Based Knowledge Distillation for Culturally Diverse Art Style Classification 2025-07-31
6 FairReason: Balancing Reasoning and Social Bias in MLLMs 2025-07-30
7 GeoOutageKG: A Multimodal Geospatiotemporal Knowledge Graph for Multiresolution Power Outage Analysis 2025-07-30
8 DBLPLink 2.0 -- An Entity Linker for the DBLP Scholarly Knowledge Graph 2025-07-30
9 Bridging the Gap in Missing Modalities: Leveraging Knowledge Distillation and Style Matching for Brain Tumor Segmentation 2025-07-30
10 Towards Interpretable Renal Health Decline Forecasting via Multi-LMM Collaborative Reasoning Framework 2025-07-30
11 Is SHACL Suitable for Data Quality Assessment? 2025-07-30
12 Color as the Impetus: Transforming Few-Shot Learner 2025-07-29
13 Mitigating Spurious Correlations in Weakly Supervised Semantic Segmentation via Cross-architecture Consistency Regularization 2025-07-29
14 Cross-Architecture Distillation Made Simple with Redundancy Suppression 2025-07-29
15 Multi-Hypothesis Distillation of Multilingual Neural Translation Models for Low-Resource Languages 2025-07-29
16 On Explaining Visual Captioning with Hybrid Markov Logic Networks 2025-07-28
17 Finetuning Stellar Spectra Foundation Models with LoRA 2025-07-28
18 Ontology-Enhanced Knowledge Graph Completion using Large Language Models 2025-07-28
19 NIRS: An Ontology for Non-Invasive Respiratory Support in Acute Care 2025-07-26
20 HypKG: Hypergraph-based Knowledge Graph Contextualization for Precision Healthcare 2025-07-26
21 Adaptive Articulated Object Manipulation On The Fly with Foundation Model Reasoning and Part Grounding 2025-07-24
22 BioGraphFusion: Graph Knowledge Embedding for Biological Completion and Reasoning 2025-07-19
23 An Ecosystem for Ontology Interoperability 2025-07-16
24 DyG-RAG: Dynamic Graph Retrieval-Augmented Generation with Event-Centric Reasoning 2025-07-16
25 FOUNDER: Grounding Foundation Models in World Models for Open-Ended Embodied Decision Making 2025-07-15
26 A Brain Tumor Segmentation Method Based on CLIP and 3D U-Net with Cross-Modal Semantic Guidance and Multi-Level Feature Fusion 2025-07-14
27 KeyKnowledgeRAG (K^2RAG): An Enhanced RAG method for improved LLM question-answering capabilities 2025-07-10
28 Jelly: a Fast and Convenient RDF Serialization Format 2025-06-12
29 AutoSchemaKG: Autonomous Knowledge Graph Construction through Dynamic Schema Induction from Web-Scale Corpora 2025-05-29
30 VectorFit : Adaptive Singular & Bias Vector Fine-Tuning of Pre-trained Foundation Models 2025-03-25
31 ($\boldsymbolθ_l, \boldsymbolθ_u$)-Parametric Multi-Task Optimization: Joint Search in Solution and Infinite Task Spaces 2025-03-11
32 VRM: Knowledge Distillation via Virtual Relation Matching 2025-02-28
33 Application of Vision-Language Model to Pedestrians Behavior and Scene Understanding in Autonomous Driving 2025-01-12
34 Fuse Before Transfer: Knowledge Fusion for Heterogeneous Distillation 2024-10-16
35 KIX: A Knowledge and Interaction-Centric Metacognitive Framework for Task Generalization 2024-02-08
36 Eywa: Automating Model Based Testing using LLMs 2023-12-11
37 One-stage Modality Distillation for Incomplete Multimodal Learning 2023-09-15

8. combinatorial game theory/xiangqi/chinese chess

序号 标题 日期
1 Xiangqi-R1: Enhancing Spatial Strategic Reasoning in LLMs for Chinese Chess via Reinforcement Learning 2025-07-16
2 On 3-terminal positions in Hex 2025-07-11
3 A number game reconciliation 2025-07-07
4 Deep Reinforcement Learning Xiangqi Player with Monte Carlo Tree Search 2025-06-18
5 Circular Game Coloring of Signed Graphs 2025-05-27
6 Computational and Algebraic Structure of Board Games 2025-02-18
7 RemoteChess: Enhancing Older Adults' Social Connectedness via Designing a Virtual Reality Chinese Chess (Xiangqi) Community 2025-02-17
8 Temperatures of Robin Hood 2025-01-13
9 On Conway's Numbers and Games, the Von Neumann Universe, and Pure Set Theory 2025-01-08
10 Complete Implementation of WXF Chinese Chess Rules 2024-12-23
11 Maker-Breaker on Galton-Watson trees 2024-12-11
12 Relationship between misère NIM and two-player GOISHI HIROI 2024-12-05
13 The Game Value of Sequential Compounds of Integers and Stars 2024-11-13
14 Mastering Chinese Chess AI (Xiangqi) Without Search 2024-10-07
15 XQSV: A Structurally Variable Network to Imitate Human Play in Xiangqi 2024-07-05
16 Shogi and Frieze group 2023-11-15
17 JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games 2023-08-09
18 Niel's Chess -- Rules for Xiangqi 2023-06-27
19 On the complexity of Dark Chinese Chess 2021-12-06

9. code llm

序号 标题 日期
1 AutoBridge: Automating Smart Device Integration with Centralized Platform 2025-07-31
2 IFEvalCode: Controlled Code Generation 2025-07-30
3 MOCHA: Are Code Language Models Robust Against Multi-Turn Malicious Coding Prompts? 2025-07-25
4 Improving Code LLM Robustness to Prompt Perturbations via Layer-Aware Model Editing 2025-07-22
5 Applying the Chinese Wall Reverse Engineering Technique to Large Language Model Code Editing 2025-07-21
6 CGP-Tuning: Structure-Aware Soft Prompt Tuning for Code Vulnerability Detection 2025-01-08
7 Selective Prompt Anchoring for Code Generation 2024-08-17
8 ShadowCode: Towards (Automatic) External Prompt Injection Attack against Code LLMs 2024-07-12

10. speech recognition

序号 标题 日期
1 Identifying Hearing Difficulty Moments in Conversational Audio 2025-07-31
2 Exploring Dynamic Parameters for Vietnamese Gender-Independent ASR 2025-07-30
3 Tiny Noise-Robust Voice Activity Detector for Voice Assistants 2025-07-29
4 The Interspeech 2025 Speech Accessibility Project Challenge 2025-07-29
5 Towards Inclusive ASR: Investigating Voice Conversion for Dysarthric Speech Recognition in Low-Resource Languages 2025-05-20
6 BERSting at the Screams: A Benchmark for Distanced, Emotional and Shouted Speech Recognition 2025-04-30
7 CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR 2025-02-27
8 Multi-Input Multi-Output Target-Speaker Voice Activity Detection For Unified, Flexible, and Robust Audio-Visual Speaker Diarization 2024-01-16

11. zero shot tracking/few shot tracking/pose tracking/pose estimation

序号 标题 日期
1 MoGA: 3D Generative Avatar Prior for Monocular Gaussian Avatar Reconstruction 2025-07-31
2 Mitigating Resolution-Drift in Federated Learning: Case of Keypoint Detection 2025-07-31
3 FASTopoWM: Fast-Slow Lane Segment Topology Reasoning with Latent World Models 2025-07-31
4 From Sharp to Blur: Unsupervised Domain Adaptation for 2D Human Pose Estimation Under Extreme Motion Blur Using Event Cameras 2025-07-30
5 G2S-ICP SLAM: Geometry-aware Gaussian Splatting ICP SLAM 2025-07-24
6 Physics-based Human Pose Estimation from a Single Moving RGB Camera 2025-07-23
7 From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation 2025-07-17
8 UniLGL: Learning Uniform Place Recognition for FOV-limited/Panoramic LiDAR Global Localization 2025-07-16
9 Failure Forecasting Boosts Robustness of Sim2Real Rhythmic Insertion Policies 2025-07-09
10 DyWA: Dynamics-adaptive World Action Model for Generalizable Non-prehensile Manipulation 2025-03-21
11 Humanoids in Hospitals: A Technical Study of Humanoid Robot Surrogates for Dexterous Medical Interventions 2025-03-17
12 FloPE: Flower Pose Estimation for Precision Pollination 2025-03-08
13 Tiny LiDARs for Manipulator Self-Awareness: Sensor Characterization and Initial Localization Experiments 2025-03-05
14 Category-level Meta-learned NeRF Priors for Efficient Object Mapping 2025-03-03
15 Faster Model Predictive Control via Self-Supervised Initialization Learning 2024-08-06
16 Matching Anything by Segmenting Anything 2024-06-06
17 Input-Output Extension of Underactuated Nonlinear Systems 2024-03-05
18 Multi-Scale Memory Comparison for Zero-/Few-Shot Anomaly Detection 2023-08-09
19 Zero-Shot Anomaly Detection with Pre-trained Segmentation Models 2023-06-15
20 APRIL-GAN: A Zero-/Few-Shot Anomaly Classification and Segmentation Method for CVPR 2023 VAND Workshop Challenge Tracks 1&2: 1st Place on Zero-shot AD and 4th Place on Few-shot AD 2023-05-27
21 An Event-based Algorithm for Simultaneous 6-DOF Camera Pose Tracking and Mapping 2023-01-02
22 Unifying Tracking and Image-Video Object Detection 2022-11-20
23 Exploring the Effectiveness of Self-supervised Learning and Classifier Chains in Emotion Recognition of Nonverbal Vocalizations 2022-06-21
24 The Multi-speaker Multi-style Voice Cloning Challenge 2021 2021-04-05

12. text to 3d/image to 3d/text to texture

序号 标题 日期
1 Stereo-GS: Multi-View Stereo Vision Model for Generalizable 3D Gaussian Splatting Reconstruction 2025-07-20
2 DreamScene: 3D Gaussian-based End-to-end Text-to-3D Scene Generation 2025-07-18
3 PhysX-3D: Physical-Grounded 3D Asset Generation 2025-07-16
4 DreamArt: Generating Interactable Articulated Objects from a Single Image 2025-07-08
5 Acquiring and Adapting Priors for Novel Tasks via Neural Meta-Architectures 2025-07-07
6 Masks make discriminative models great again! 2025-07-01
7 Learning Dense Feature Matching via Lifting Single 2D Image to 3D Space 2025-07-01
8 TexTailor: Customized Text-aligned Texturing via Effective Resampling 2025-06-12
9 CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx 2025-06-05
10 Uni-Instruct: One-step Diffusion Model through Unified Diffusion Divergence Instruction 2025-05-27
11 CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading 2025-04-09
12 3DGen-Bench: Comprehensive Benchmark Suite for 3D Generative Models 2025-03-27
13 ProcTex: Consistent and Interactive Text-to-texture Synthesis for Procedural Models 2025-01-28
14 Benchmarking and Learning Multi-Dimensional Quality Evaluator for Text-to-3D Generation 2024-12-15
15 MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation 2024-12-04
16 A Lesson in Splats: Teacher-Guided Diffusion for 3D Gaussian Splats Generation with 2D Supervision 2024-12-01
17 MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D 2024-11-04
18 3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation 2024-10-24
19 Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control 2024-10-09
20 LLM2TEA: An Agentic AI Designer for Discovery with Generative Evolutionary Multitasking 2024-06-21
21 View Selection for 3D Captioning via Diffusion Ranking 2024-04-11
22 FlashTex: Fast Relightable Mesh Texturing with LightControlNet 2024-02-20

13. automated theorem proving/interactive theorem proving/formal verification

序号 标题 日期
1 Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving 2025-07-31
2 Concrete Security Bounds for Simulation-Based Proofs of Multi-Party Computation Protocols 2025-07-30
3 StepFun-Prover Preview: Let's Think and Verify Step by Step 2025-07-27
4 Cryptographic Data Exchange for Nuclear Warheads 2025-07-26
5 An ACL2s Interface to Z3 2025-07-25
6 On Automating Proofs of Multiplier Adder Trees using the RTL Books 2025-07-25
7 IsaMini: Redesigned Isabelle Proof Lanugage for Machine Learning 2025-07-25
8 The AlphaPhysics Term Rewriting System for Marking Algebraic Expressions in Physics Exams 2025-07-24
9 Formal Verification of the Safegcd Implementation 2025-07-23
10 LeanTree: Accelerating White-Box Proof Search with Factorized States in Lean 4 2025-07-19
11 Generalized Tree Edit Distance (GTED): A Faithful Evaluation Metric for Statement Autoformalization 2025-07-10
12 Quantifying Bounded Rationality: Formal Verification of Simon's Satisficing Through Flexible Stochastic Dominance 2025-07-02
13 Prover Agent: An Agent-based Framework for Formal Mathematical Proofs 2025-06-24
14 Automated Synthesis of Formally Verified Multi-Abstraction Function Summaries 2025-06-11
15 Statistical Runtime Verification for LLMs via Robustness Estimation 2025-04-24
16 Leanabell-Prover: Posttraining Scaling in Formal Reasoning 2025-04-08
17 Automated Discovery of Tactic Libraries for Interactive Theorem Proving 2025-03-31
18 LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction 2025-02-25
19 Proving the Coding Interview: A Benchmark for Formally Verified Code Generation 2025-02-08
20 Automating Mathematical Proof Generation Using Large Language Model Agents and Knowledge Graphs 2025-02-04
21 Learning Rules Explaining Interactive Theorem Proving Tactic Prediction 2024-11-02
22 A Certified Proof Checker for Deep Neural Network Verification in Imandra 2024-05-17

Metadata

Metadata

Assignees

No one assigned

    Labels

    documentationImprovements or additions to documentation

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions