Skip to content

每月论文更新 - 2025年07月02日 #24

@github-actions

Description

@github-actions

最后更新:2025-07-02 00:08

本次更新执行命令

D:\a\MyAutoPapers\MyAutoPapers\target\release\my_auto_papers.exe --keywords=
             efficient RL,
             partial observable markov decision process/pomdp,sparse reward reinforcement learning,
             casual RL/counterfactual RL/casual reinforcement learning,
             causal inference/causal discovery/counterfactual reasoning,
             video super resolution,
             knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding,
             combinatorial game theory/xiangqi/chinese chess,
             code llm,
             speech recognition,
             zero shot tracking/few shot tracking/pose tracking/pose estimation,
             text to 3d/image to 3d/text to texture,
             automated theorem proving/interactive theorem proving/formal verification
              --exclude-keywords=multi-agent,multiagent --per-keyword-max-result=8

参数详解

  • 关键词:efficient RL, partial observable markov decision process/pomdp, sparse reward reinforcement learning, casual RL/counterfactual RL/casual reinforcement learning, causal inference/causal discovery/counterfactual reasoning, video super resolution, knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding, combinatorial game theory/xiangqi/chinese chess, code llm, speech recognition, zero shot tracking/few shot tracking/pose tracking/pose estimation, text to 3d/image to 3d/text to texture, automated theorem proving/interactive theorem proving/formal verification
  • 排除关键词:multi-agent, multiagent
  • 每关键词最大结果:8
  • 目标领域:cs, stat
  • 每关键词重试次数:3

论文汇总(198篇)

更好的阅读体验请访问 Github页面

1. efficient RL

序号 标题 日期
1 When Can Model-Free Reinforcement Learning be Enough for Thinking? 2025-06-20
2 Exploring and Exploiting the Inherent Efficiency within Large Reasoning Models for Self-Guided Efficiency Enhancement 2025-06-18
3 LearnAlign: Reasoning Data Selection for Reinforcement Learning in Large Language Models Based on Improved Gradient Alignment 2025-06-13
4 Efficient RL-based Cache Vulnerability Exploration by Penalizing Useless Agent Actions 2025-06-08
5 Angles Don't Lie: Unlocking Training-Efficient RL Through the Model's Own Signals 2025-06-02
6 Token-Efficient RL for LLM Reasoning 2025-04-29
7 Improving Transformer World Models for Data-Efficient RL 2025-02-03

2. partial observable markov decision process/pomdp

序号 标题 日期
1 Active Digital Twins via Active Inference 2025-06-17
2 Forward and Backward Simulations for Partially Observable Probability 2025-06-10
3 $τ^2$-Bench: Evaluating Conversational Agents in a Dual-Control Environment 2025-06-09
4 Robust Finite-Memory Policy Gradients for Hidden-Model POMDPs 2025-05-14
5 Learning Attentive Neural Processes for Planning with Pushing Actions 2025-04-24
6 POPGym Arcade: Parallel Pixelated POMDPs 2025-03-03
7 Map Space Belief Prediction for Manipulation-Enhanced Mapping 2025-02-28
8 Limit-sure reachability for small memory policies in POMDPs is NP-complete 2024-12-01

3. sparse reward reinforcement learning

序号 标题 日期
1 SuperRL: Reinforcement Learning with Supervision to Boost Language Model Reasoning 2025-06-01
2 DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning 2025-05-26
3 STAR-R1: Spatial TrAnsformation Reasoning by Reinforcing Multimodal LLMs 2025-05-21
4 Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model 2025-03-14
5 Hedging with Sparse Reward Reinforcement Learning 2025-03-06
6 Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations 2024-12-02
7 Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning 2023-09-08
8 Language Reward Modulation for Pretraining Reinforcement Learning 2023-08-23

4. casual RL/counterfactual RL/casual reinforcement learning

序号 标题 日期
1 Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL 2025-02-18
2 Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation 2020-12-16

5. causal inference/causal discovery/counterfactual reasoning

序号 标题 日期
1 When Additive Noise Meets Unobserved Mediators: Bivariate Denoising Diffusion for Causal Discovery 2025-06-29
2 Auto-Doubly Robust Estimation of Causal Effects on a Network 2025-06-29
3 P-CRE-DML: A Novel Approach for Causal Inference in Non-Linear Panel Data 2025-06-29
4 Token Activation Map to Visually Explain Multimodal LLMs 2025-06-29
5 Resilient-Native and Intelligent Next-Generation Wireless Systems: Key Enablers, Foundations, and Applications 2025-06-28
6 Can Video Large Multimodal Models Think Like Doubters-or Double-Down: A Study on Defeasible Video Entailment 2025-06-27
7 Less Greedy Equivalence Search 2025-06-27
8 HalluSegBench: Counterfactual Visual Reasoning for Segmentation Hallucination Evaluation 2025-06-26
9 Active Inference AI Systems for Scientific Discovery 2025-06-26
10 Lower Bounds on the Size of Markov Equivalence Classes 2025-06-26
11 Learning Causally Predictable Outcomes from Psychiatric Longitudinal Data 2025-06-19
12 An introduction to Causal Modelling 2025-06-19
13 Exploring Big Five Personality and AI Capability Effects in LLM-Simulated Negotiation Dialogues 2025-06-19
14 DeVisE: Behavioral Testing of Medical Large Language Models 2025-06-18
15 Resilient-native and Intelligent NextG Systems 2025-06-15
16 LLMs Struggle to Perform Counterfactual Reasoning with Parametric Knowledge 2025-06-15
17 What Makes Treatment Effects Identifiable? Characterizations and Estimators Beyond Unconfoundedness 2025-06-04
18 Local Markov Equivalence and Local Causal Discovery for Identifying Controlled Direct Effects 2025-05-05
19 Linear scaling causal discovery from high-dimensional time series by dynamical community detection 2025-01-18
20 Semiparametric Double Reinforcement Learning with Applications to Long-Term Causal Inference 2025-01-12
21 CauSkelNet: Causal Representation Learning for Human Behaviour Analysis 2024-09-23
22 Empirical evidence of Large Language Model's influence on human spoken communication 2024-09-03
23 A General Framework on Conditions for Constraint-based Causal Learning 2024-08-14
24 Quantification and cross-fitting inference of asymmetric relations under generative exposure mapping models 2023-11-08

6. video super resolution

序号 标题 日期
1 TurboVSR: Fantastic Video Upscalers and Where to Find Them 2025-06-30
2 VSRM: A Robust Mamba-Based Framework for Video Super-Resolution 2025-06-28
3 SimpleGVR: A Simple Baseline for Latent-Cascaded Video Super-Resolution 2025-06-24
4 One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution 2025-06-18
5 Super-Resolution Generative Adversarial Networks based Video Enhancement 2025-05-14
6 Spatial Degradation-Aware and Temporal Consistent Diffusion Model for Compressed Video Super-Resolution 2025-02-11
7 Low-Resource Video Super-Resolution using Memory, Wavelets, and Deformable Convolutions 2025-02-03
8 RefVSR++: Exploiting Reference Inputs for Reference-based Video Super-resolution 2023-07-06

7. knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding

序号 标题 日期
1 The Trilemma of Truth in Large Language Models 2025-06-30
2 Efficient Interleaved Speech Modeling through Knowledge Distillation 2025-06-30
3 When Test-Time Adaptation Meets Self-Supervised Models 2025-06-30
4 Competitive Distillation: A Simple Learning Strategy for Improving Visual Classification 2025-06-29
5 Context-Driven Knowledge Graph Completion with Semantic-Aware Relational Message Passing 2025-06-29
6 Flow-Modulated Scoring for Semantic-Aware Knowledge Graph Completion 2025-06-29
7 ReMem: Mutual Information-Aware Fine-tuning of Pretrained Vision Transformers for Effective Knowledge Distillation 2025-06-29
8 Beyond Code: The Multidimensional Impacts of Large Language Models in Software Development 2025-06-28
9 Partial CLIP is Enough: Chimera-Seg for Zero-shot Semantic Segmentation 2025-06-27
10 Seismic resolution enhancement via deep Learning with Knowledge Distillation and Domain Adaptation 2025-06-27
11 Shifting Narratives: A Longitudinal Analysis of Media Trends and Public Attitudes on Homelessness 2025-06-26
12 Unveiling Causal Reasoning in Large Language Models: Reality or Mirage? 2025-06-26
13 Condensed Representation of RDF and its Application on Graph Versioning 2025-06-26
14 Distilling Normalizing Flows 2025-06-26
15 Towards Text-free Graph Foundation Models: Rethinking Multi-Domain Graph Contrastive Learning 2025-06-26
16 The role of preprints in open science: Accelerating knowledge transfer from science to technology 2025-06-25
17 KnowMap: Efficient Knowledge-Driven Task Adaptation for LLMs 2025-06-24
18 Generalizing vision-language models to novel domains: A comprehensive survey 2025-06-23
19 Dual-Forward Path Teacher Knowledge Distillation: Bridging the Capacity Gap Between Teacher and Student 2025-06-23
20 Action Language BC+ 2025-06-22
21 A Community-driven vision for a new Knowledge Resource for AI 2025-06-19
22 Approximation Fixpoint Theory with Refined Approximation Spaces 2025-06-19
23 KG-FGNN: Knowledge-guided GNN Foundation Model for Fertilisation-oriented Soil GHG Flux Prediction 2025-06-18
24 RobotSmith: Generative Robotic Tool Design for Acquisition of Complex Manipulation Skills 2025-06-17
25 Casper: Inferring Diverse Intents for Assistive Teleoperation with Vision Language Models 2025-06-17
26 Two-dimensional Taxonomy for N-ary Knowledge Representation Learning Methods 2025-06-05
27 Distillation and Refinement of Reasoning in Small Language Models for Document Re-ranking 2025-04-04
28 Mitigating Knowledge Discrepancies among Multiple Datasets for Task-agnostic Unified Face Alignment 2025-03-28
29 Agentic Medical Knowledge Graphs Enhance Medical Question Answering: Bridging the Gap Between LLMs and Evolving Medical Knowledge 2025-02-18
30 Bridge: A Unified Framework to Knowledge Graph Completion via Language Models and Knowledge Representation 2024-11-11
31 Core Knowledge Deficits in Multi-Modal Language Models 2024-10-06
32 MaPPER: Multimodal Prior-guided Parameter Efficient Tuning for Referring Expression Comprehension 2024-09-20
33 AWF: Adaptive Weight Fusion for Enhanced Class Incremental Semantic Segmentation 2024-09-13
34 HyperMono: A Monotonicity-aware Approach to Hyper-Relational Knowledge Representation 2024-04-15
35 ChatDBG: Augmenting Debugging with Large Language Models 2024-03-25
36 FedDTG:Federated Data-Free Knowledge Distillation via Three-Player Generative Adversarial Networks 2022-01-10

8. combinatorial game theory/xiangqi/chinese chess

序号 标题 日期
1 Deep Reinforcement Learning Xiangqi Player with Monte Carlo Tree Search 2025-06-18
2 Circular Game Coloring of Signed Graphs 2025-05-27
3 Computational and Algebraic Structure of Board Games 2025-02-18
4 RemoteChess: Enhancing Older Adults' Social Connectedness via Designing a Virtual Reality Chinese Chess (Xiangqi) Community 2025-02-17
5 Temperatures of Robin Hood 2025-01-13
6 On Conway's Numbers and Games, the Von Neumann Universe, and Pure Set Theory 2025-01-08
7 Complete Implementation of WXF Chinese Chess Rules 2024-12-23
8 Maker-Breaker on Galton-Watson trees 2024-12-11
9 Relationship between misère NIM and two-player GOISHI HIROI 2024-12-05
10 The Game Value of Sequential Compounds of Integers and Stars 2024-11-13
11 A New 0(klog n) Algorithm for Josephus Problem 2024-11-10
12 Mastering Chinese Chess AI (Xiangqi) Without Search 2024-10-07
13 An Efficient Multi-Robot Arm Coordination Strategy for Pick-and-Place Tasks using Reinforcement Learning 2024-09-20
14 XQSV: A Structurally Variable Network to Imitate Human Play in Xiangqi 2024-07-05
15 Shogi and Frieze group 2023-11-15
16 JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games 2023-08-09
17 Niel's Chess -- Rules for Xiangqi 2023-06-27
18 On the complexity of Dark Chinese Chess 2021-12-06
19 A Note on Hardness Frameworks and Computational Complexity of Xiangqi and Janggi 2019-03-30

9. code llm

序号 标题 日期
1 The Debugging Decay Index: Rethinking Debugging Strategies for Code LLMs 2025-06-23
2 CodeMorph: Mitigating Data Leakage in Large Language Model Assessment 2025-06-21
3 Re-Evaluating Code LLM Benchmarks Under Semantic Mutation 2025-06-20
4 From Output to Evaluation: Does Raw Instruction-Tuned Code LLMs Output Suffice for Fill-in-the-Middle Code Generation? 2025-05-24
5 LongCodeBench: Evaluating Coding LLMs at 1M Context Windows 2025-05-12
6 Enhancing Code LLMs with Reinforcement Learning in Code Generation: A Survey 2024-12-29
7 Context-Augmented Code Generation Using Programming Knowledge Graphs 2024-10-09
8 Beyond Functional Correctness: Investigating Coding Style Inconsistencies in Large Language Models 2024-06-29

10. speech recognition

序号 标题 日期
1 Research on Comprehensive Classroom Evaluation System Based on Multiple AI Models 2025-06-29
2 Mind the Gap: Entity-Preserved Context-Aware ASR Structured Transcriptions 2025-06-28
3 Boosting CTC-Based ASR Using LLM-Based Intermediate Loss Regularization 2025-06-28
4 A Self-Training Approach for Whisper to Enhance Long Dysarthric Speech Recognition 2025-06-28
5 Speaker Targeting via Self-Speaker Adaptation for Multi-talker ASR 2025-06-27
6 Cross-lingual Data Selection Using Clip-level Acoustic Similarity for Enhancing Low-resource Automatic Speech Recognition 2025-06-27
7 AI-Generated Song Detection via Lyrics Transcripts 2025-06-23
8 State-Space Models in Efficient Whispered and Multi-dialect Speech Recognition 2025-06-20

11. zero shot tracking/few shot tracking/pose tracking/pose estimation

序号 标题 日期
1 Validation of AI-Based 3D Human Pose Estimation in a Cyber-Physical Environment 2025-06-30
2 MGPRL: Distributed Multi-Gaussian Processes for Wi-Fi-based Multi-Robot Relative Localization in Large Indoor Environments 2025-06-30
3 TVG-SLAM: Robust Gaussian Splatting SLAM with Tri-view Geometric Constraints 2025-06-29
4 Deterministic Object Pose Confidence Region Estimation 2025-06-28
5 Evaluating Pointing Gestures for Target Selection in Human-Robot Collaboration 2025-06-27
6 What Makes a Dribble Successful? Insights From 3D Pose Tracking Data 2025-06-25
7 RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking 2025-06-20
8 Full-Pose Tracking via Robust Control for Over-Actuated Multirotors 2025-06-19
9 Fluoroscopic Shape and Pose Tracking of Catheters with Custom Radiopaque Markers 2025-06-11
10 Mouse Lockbox Dataset: Behavior Recognition for Mice Solving Lockboxes 2025-05-21
11 BST: Badminton Stroke-type Transformer for Skeleton-based Action Recognition in Racket Sports 2025-02-28
12 PoI: A Filter to Extract Pixel of Interest from Novel View Synthesis for Scene Coordinate Regression 2025-02-07
13 G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation 2024-11-27
14 AsymDex: Asymmetry and Relative Coordinates for RL-based Bimanual Dexterity 2024-11-20
15 Faster Model Predictive Control via Self-Supervised Initialization Learning 2024-08-06
16 SignMusketeers: An Efficient Multi-Stream Approach for Sign Language Translation at Scale 2024-06-11
17 Matching Anything by Segmenting Anything 2024-06-06
18 Disentangled Diffusion-Based 3D Human Pose Estimation with Hierarchical Spatial and Temporal Denoiser 2024-03-07
19 Multi-Scale Memory Comparison for Zero-/Few-Shot Anomaly Detection 2023-08-09
20 Zero-Shot Anomaly Detection with Pre-trained Segmentation Models 2023-06-15
21 APRIL-GAN: A Zero-/Few-Shot Anomaly Classification and Segmentation Method for CVPR 2023 VAND Workshop Challenge Tracks 1&2: 1st Place on Zero-shot AD and 4th Place on Few-shot AD 2023-05-27
22 Unifying Tracking and Image-Video Object Detection 2022-11-20
23 Exploring the Effectiveness of Self-supervised Learning and Classifier Chains in Emotion Recognition of Nonverbal Vocalizations 2022-06-21
24 The Multi-speaker Multi-style Voice Cloning Challenge 2021 2021-04-05

12. text to 3d/image to 3d/text to texture

序号 标题 日期
1 AlignCVC: Aligning Cross-View Consistency for Single-Image-to-3D Generation 2025-06-29
2 DreamAnywhere: Object-Centric Panoramic 3D Scene Generation 2025-06-25
3 3D Arena: An Open Platform for Generative 3D Evaluation 2025-06-23
4 DreamJourney: Perpetual View Generation with Video Diffusion Models 2025-06-21
5 Dive3D: Diverse Distillation-based Text-to-3D Generation via Score Implicit Matching 2025-06-16
6 TexTailor: Customized Text-aligned Texturing via Effective Resampling 2025-06-12
7 EmbodiedGen: Towards a Generative 3D World Engine for Embodied Intelligence 2025-06-12
8 DreamCS: Geometry-Aware Text-to-3D Generation with Unpaired 3D Reward Supervision 2025-06-11
9 R3D2: Realistic 3D Asset Insertion via Diffusion for Autonomous Driving Simulation 2025-06-09
10 AI-powered Contextual 3D Environment Generation: A Systematic Review 2025-06-05
11 CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx 2025-06-05
12 Unify3D: An Augmented Holistic End-to-end Monocular 3D Human Reconstruction via Anatomy Shaping and Twins Negotiating 2025-04-25
13 CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading 2025-04-09
14 ProcTex: Consistent and Interactive Text-to-texture Synthesis for Procedural Models 2025-01-28
15 Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction 2024-11-21
16 MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D 2024-11-04
17 SceneComplete: Open-World 3D Scene Completion in Cluttered Real World Environments for Robot Manipulation 2024-10-31
18 3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation 2024-10-24
19 Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control 2024-10-09
20 FlatFusion: Delving into Details of Sparse Transformer-based Camera-LiDAR Fusion for Autonomous Driving 2024-08-13
21 SceneMotifCoder: Example-driven Visual Program Learning for Generating 3D Object Arrangements 2024-08-05
22 LLM2TEA: Agentic AI Designer Finds Innovative Objects with Generative Evolutionary Multitasking 2024-06-21
23 FlashTex: Fast Relightable Mesh Texturing with LightControlNet 2024-02-20

13. automated theorem proving/interactive theorem proving/formal verification

序号 标题 日期
1 A Survey on Vision-Language-Action Models for Autonomous Driving 2025-06-30
2 What Challenges Do Developers Face When Using Verification-Aware Programming Languages? 2025-06-30
3 Universal Gluing and Contextual Choice: Categorical Logic and the Foundations of Analytic Approximation 2025-06-28
4 Can Large Language Models Help Students Prove Software Correctness? An Experimental Study with Dafny 2025-06-27
5 Diophantine Equations over $\mathbb Z$: Universal Bounds and Parallel Formalization 2025-06-26
6 The Composition of Digital Twins for Systems-of-Systems: a Systematic Literature Review 2025-06-25
7 Prover Agent: An Agent-based Framework for Formal Mathematical Proofs 2025-06-24
8 Reviving DSP for Advanced Theorem Proving in the Era of Reasoning Models 2025-06-13
9 StepProof: Step-by-step verification of natural language mathematical proofs 2025-06-12
10 The Alignment Trap: Complexity Barriers 2025-06-12
11 MATP-BENCH: Can MLLM Be a Good Automated Theorem Prover for Multimodal Problems? 2025-06-06
12 ProofNet++: A Neuro-Symbolic System for Formal Proof Verification with Self-Correction 2025-05-30
13 DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning 2025-05-29
14 Autoformalization in the Era of Large Language Models: A Survey 2025-05-29
15 Automated Discovery of Tactic Libraries for Interactive Theorem Proving 2025-03-31
16 Local Look-Ahead Guidance via Verifier-in-the-Loop for Automated Theorem Proving 2025-03-12
17 Faithful Logic Embeddings in HOL -- Deep and Shallow 2025-02-26
18 LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction 2025-02-25
19 Proving the Coding Interview: A Benchmark for Formally Verified Code Generation 2025-02-08
20 Learning Rules Explaining Interactive Theorem Proving Tactic Prediction 2024-11-02
21 Tableaux for Automated Reasoning in Dependently-Typed Higher-Order Logic (Extended Version) 2024-10-18
22 A Certified Proof Checker for Deep Neural Network Verification in Imandra 2024-05-17
23 Magnushammer: A Transformer-Based Approach to Premise Selection 2023-03-08

Metadata

Metadata

Assignees

No one assigned

    Labels

    documentationImprovements or additions to documentation

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions