Skip to content

每月论文更新 - 2025年05月02日 #22

@github-actions

Description

@github-actions

最后更新:2025-05-02 00:10

本次更新执行命令

D:\a\MyAutoPapers\MyAutoPapers\target\release\my_auto_papers.exe --keywords=
             efficient RL,
             partial observable markov decision process/pomdp,sparse reward reinforcement learning,
             casual RL/counterfactual RL/casual reinforcement learning,
             causal inference/causal discovery/counterfactual reasoning,
             video super resolution,
             knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding,
             combinatorial game theory/xiangqi/chinese chess,
             code llm,
             speech recognition,
             zero shot tracking/few shot tracking/pose tracking/pose estimation,
             text to 3d/image to 3d/text to texture,
             automated theorem proving/interactive theorem proving/formal verification
              --exclude-keywords=multi-agent,multiagent --per-keyword-max-result=8

参数详解

  • 关键词:efficient RL, partial observable markov decision process/pomdp, sparse reward reinforcement learning, casual RL/counterfactual RL/casual reinforcement learning, causal inference/causal discovery/counterfactual reasoning, video super resolution, knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding, combinatorial game theory/xiangqi/chinese chess, code llm, speech recognition, zero shot tracking/few shot tracking/pose tracking/pose estimation, text to 3d/image to 3d/text to texture, automated theorem proving/interactive theorem proving/formal verification
  • 排除关键词:multi-agent, multiagent
  • 每关键词最大结果:8
  • 目标领域:cs, stat
  • 每关键词重试次数:3

论文汇总(204篇)

更好的阅读体验请访问 Github页面

1. efficient RL

序号 标题 日期
1 Toward Efficient Exploration by Large Language Model Agents 2025-04-29
2 Offline Robotic World Model: Learning Robotic Policies without a Physics Simulator 2025-04-23
3 Tina: Tiny Reasoning Models via LoRA 2025-04-22
4 Handling Delay in Real-Time Reinforcement Learning 2025-03-30
5 Provably Efficient RL for Linear MDPs under Instantaneous Safety Constraints in Non-Convex Feature Spaces 2025-02-25
6 Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning 2024-10-29
7 Uncovering RL Integration in SSL Loss: Objective-Specific Implications for Data-Efficient RL 2024-10-22
8 Computationally Efficient RL under Linear Bellman Completeness for Deterministic Dynamics 2024-06-17

2. partial observable markov decision process/pomdp

序号 标题 日期
1 Learning Attentive Neural Processes for Planning with Pushing Actions 2025-04-24
2 An Addendum to NeBula: Towards Extending TEAM CoSTAR's Solution to Larger Scale Environments 2025-04-18
3 Integrated Control and Active Perception in POMDPs for Temporal Logic Tasks and Information Acquisition 2025-04-17
4 An Efficient Reservation Protocol for Medium Access: When Tree Splitting Meets Reinforcement Learning 2025-04-03
5 Real-time Tracking System with Partially Coupled Sources 2025-03-27
6 Enhancing Robustness in Language-Driven Robotics: A Modular Approach to Failure Reduction 2024-11-08
7 When to Localize? A Risk-Constrained Reinforcement Learning Approach 2024-11-05
8 Belief-State Query Policies for User-Aligned POMDPs 2024-05-24
9 Online POMDP Planning with Anytime Deterministic Optimality Guarantees 2023-10-03
10 A Strong Duality Result for Constrained POMDPs with Multiple Cooperative Agents 2023-03-27
11 Decisiveness for countable MDPs and insights for NPLCSs and POMDPs 2020-08-24

3. sparse reward reinforcement learning

序号 标题 日期
1 Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model 2025-03-14
2 Hedging with Sparse Reward Reinforcement Learning 2025-03-06
3 Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations 2024-12-02
4 Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning 2023-09-08
5 Language Reward Modulation for Pretraining Reinforcement Learning 2023-08-23
6 Exploiting Transformer in Sparse Reward Reinforcement Learning for Interpretable Temporal Logic Motion Planning 2022-09-27
7 Abstract Demonstrations and Adaptive Exploration for Efficient and Stable Multi-step Sparse Reward Reinforcement Learning 2022-07-19

4. casual RL/counterfactual RL/casual reinforcement learning

序号 标题 日期
1 Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL 2025-02-18
2 Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation 2020-12-16

5. causal inference/causal discovery/counterfactual reasoning

序号 标题 日期
1 Conditional independence testing with a single realization of a multivariate nonstationary nonlinear time series 2025-04-30
2 Convergence rate for Nearest Neighbour matching: geometry of the domain and higher-order regularity 2025-04-30
3 Powerful randomization tests for subgroup analysis 2025-04-30
4 Multi-Domain Causal Discovery in Bijective Causal Models 2025-04-30
5 Artificial Intelligence for Personalized Prediction of Alzheimer's Disease Progression: A Survey of Methods, Data Challenges, and Future Directions 2025-04-29
6 A Hamiltonian Higher-Order Elasticity Framework for Dynamic Diagnostics(2HOED) 2025-04-29
7 Modular Machine Learning: An Indispensable Path towards New-Generation Large Language Models 2025-04-28
8 Probabilistic and Causal Satisfiability: Constraining the Model 2025-04-28
9 Inference with few treated units 2025-04-28
10 Causal-Copilot: An Autonomous Causal Analysis Agent 2025-04-17
11 A conceptual synthesis of causal assumptions for causal discovery and inference 2025-04-15
12 OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning 2025-04-06
13 When Counterfactual Reasoning Fails: Chaos and Real-World Complexity 2025-03-31
14 Counterfactual Situation Testing: From Single to Multidimensional Discrimination 2025-02-03
15 Automatic Double Reinforcement Learning in Semiparametric Markov Decision Processes with Applications to Long-Term Causal Inference 2025-01-12
16 Causal-discovery-based root-cause analysis and its application in time-series prediction error diagnosis 2024-11-11
17 A Skewness-Based Criterion for Addressing Heteroscedastic Noise in Causal Discovery 2024-10-08
18 Higher order definition of causality by optimally conditioned transfer entropy 2024-08-30
19 Debiased Estimating Equation Method for Robust and Efficient Mendelian Randomization Using a Large Number of Correlated Weak and Invalid Instruments 2024-08-09
20 Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering 2024-06-02
21 A Scoping Review of Earth Observation and Machine Learning for Causal Inference: Implications for the Geography of Poverty 2024-05-30
22 OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning 2024-05-02
23 Generating Pragmatic Examples to Train Neural Program Synthesizers 2023-11-09
24 Nonlinear Causal Discovery with Confounders 2023-02-07

6. video super resolution

序号 标题 日期
1 RepNet-VSR: Reparameterizable Architecture for High-Fidelity Video Super-Resolution 2025-04-22
2 Event-Enhanced Blurry Video Super-Resolution 2025-04-17
3 FedVSR: Towards Model-Agnostic Federated Learning in Video Super-Resolution 2025-03-17
4 Video Super-Resolution: All You Need is a Video Diffusion Model 2025-03-05
5 Low-Resource Video Super-Resolution using Memory, Wavelets, and Deformable Convolutions 2025-02-03
6 BF-STVSR: B-Splines and Fourier-Best Friends for High Fidelity Spatial-Temporal Video Super-Resolution 2025-01-19
7 Spatio-Temporal Distortion Aware Omnidirectional Video Super-Resolution 2024-10-15
8 Local-Global Temporal Difference Learning for Satellite Video Super-Resolution 2023-04-10

7. knowledge graph/knowledge distillation/knowledge representation/knowledge transfer/knowledge embedding

序号 标题 日期
1 Early Exit and Multi Stage Knowledge Distillation in VLMs for Video Summarization 2025-04-30
2 Automatic Mapping of AutomationML Files to Ontologies for Graph Queries and Validation 2025-04-30
3 CAE-DFKD: Bridging the Transferability Gap in Data-Free Knowledge Distillation 2025-04-30
4 Enhancing New-item Fairness in Dynamic Recommender Systems 2025-04-30
5 How to Backdoor the Knowledge Distillation 2025-04-30
6 Redundancy Analysis and Mitigation for Machine Learning-Based Process Monitoring of Additive Manufacturing 2025-04-30
7 Federated One-Shot Learning with Data Privacy and Objective-Hiding 2025-04-29
8 DS_FusionNet: Dynamic Dual-Stream Fusion with Bidirectional Knowledge Distillation for Plant Disease Recognition 2025-04-29
9 Head-Tail-Aware KL Divergence in Knowledge Distillation for Spiking Neural Networks 2025-04-29
10 An Empirical Study on Common Defects in Modern Web Browsers Using Knowledge Embedding in GPT-4o 2025-04-29
11 Mitigating Modality Bias in Multi-modal Entity Alignment from a Causal Perspective 2025-04-28
12 A Comprehensive Survey of Knowledge-Based Vision Question Answering Systems: The Lifecycle of Knowledge in Visual Reasoning Task 2025-04-24
13 Assessing the Capability of Large Language Models for Domain-Specific Ontology Generation 2025-04-24
14 Multi-modal Knowledge Graph Generation with Semantics-enriched Prompts 2025-04-18
15 Pandora: A Code-Driven Large Language Model Agent for Unified Reasoning Across Diverse Structured Knowledge 2025-04-17
16 Convergence and Implicit Bias of Gradient Descent on Continual Linear Classification 2025-04-17
17 Coding-Prior Guided Diffusion Network for Video Deblurring 2025-04-16
18 Mutual Understanding between People and Systems via Neurosymbolic AI and Knowledge Graphs 2025-04-15
19 Language and Knowledge Representation: A Stratified Approach 2025-04-14
20 A Survey of Large Language Model-Powered Spatial Intelligence Across Scales: Advances in Embodied Agents, Smart Cities, and Earth Science 2025-04-14
21 HeteRAG: A Heterogeneous Retrieval-augmented Generation Framework with Decoupled Knowledge Representations 2025-04-12
22 Learning Optimal Prompt Ensemble for Multi-source Visual Prompt Transfer 2025-04-09
23 Knowledge Graph Completion with Relation-Aware Anchor Enhancement 2025-04-08
24 Revealing the Intrinsic Ethical Vulnerability of Aligned Large Language Models 2025-04-07
25 Multimodal machine learning with large language embedding model for polymer property prediction 2025-03-29
26 Predicting clinical outcomes from patient care pathways represented with temporal knowledge graphs 2025-02-28
27 SparseTransX: Efficient Training of Translation-Based Knowledge Graph Embeddings Using Sparse Matrix Operations 2025-02-24
28 Leveraging Conditional Mutual Information to Improve Large Language Model Fine-Tuning For Classification 2025-02-16
29 KnowRA: Knowledge Retrieval Augmented Method for Document-level Relation Extraction with Comprehensive Reasoning Abilities 2024-12-31
30 GaGA: Towards Interactive Global Geolocation Assistant 2024-12-12
31 Partial Knowledge Distillation for Alleviating the Inherent Inter-Class Discrepancy in Federated Learning 2024-11-23
32 Retrieval, Reasoning, Re-ranking: A Context-Enriched Framework for Knowledge Graph Completion 2024-11-12
33 Domain Consistency Representation Learning for Lifelong Person Re-Identification 2024-09-30
34 Cross-Lingual Speech Emotion Recognition: Humans vs. Self-Supervised Models 2024-09-25
35 Historically Relevant Event Structuring for Temporal Knowledge Graph Reasoning 2024-05-17
36 ChatDBG: Augmenting Debugging with Large Language Models 2024-03-25
37 Embedding Ontologies via Incorporating Extensional and Intensional Knowledge 2024-01-20
38 Generative Meta-Learning for Zero-Shot Relation Triplet Extraction 2023-05-03

8. combinatorial game theory/xiangqi/chinese chess

序号 标题 日期
1 Computational and Algebraic Structure of Board Games 2025-02-18
2 RemoteChess: Enhancing Older Adults' Social Connectedness via Designing a Virtual Reality Chinese Chess (Xiangqi) Community 2025-02-17
3 Temperatures of Robin Hood 2025-01-13
4 On Conway's Numbers and Games, the Von Neumann Universe, and Pure Set Theory 2025-01-08
5 Complete Implementation of WXF Chinese Chess Rules 2024-12-23
6 Maker-Breaker on Galton-Watson trees 2024-12-11
7 Relationship between misère NIM and two-player GOISHI HIROI 2024-12-05
8 The Game Value of Sequential Compounds of Integers and Stars 2024-11-13
9 A New 0(klog n) Algorithm for Josephus Problem 2024-11-10
10 Mastering Chinese Chess AI (Xiangqi) Without Search 2024-10-07
11 An Efficient Multi-Robot Arm Coordination Strategy for Pick-and-Place Tasks using Reinforcement Learning 2024-09-20
12 XQSV: A Structurally Variable Network to Imitate Human Play in Xiangqi 2024-07-05
13 Degrees are Useless in SNORT When Measuring Temperature 2024-06-04
14 Shogi and Frieze group 2023-11-15
15 JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games 2023-08-09
16 Niel's Chess -- Rules for Xiangqi 2023-06-27
17 On the complexity of Dark Chinese Chess 2021-12-06
18 A Note on Hardness Frameworks and Computational Complexity of Xiangqi and Janggi 2019-03-30
19 Comparison Training for Computer Chinese Chess 2018-01-23

9. code llm

序号 标题 日期
1 CrashFixer: A crash resolution agent for the Linux kernel 2025-04-29
2 EduBot -- Can LLMs Solve Personalized Learning and Programming Assignments? 2025-04-23
3 ClarifyCoder: Clarification-Aware Fine-Tuning for Programmatic Problem Solving 2025-04-23
4 Inducing Vulnerable Code Generation in LLM Coding Assistants 2025-04-22
5 LeetCodeDataset: A Temporal Dataset for Robust Evaluation and Efficient Training of Code LLMs 2025-04-20
6 Risk Assessment Framework for Code LLMs via Leveraging Internal States 2025-04-20
7 On Benchmarking Code LLMs for Android Malware Analysis 2025-04-01
8 Less is More: Towards Green Code Large Language Models via Unified Structural Pruning 2024-12-20

10. speech recognition

序号 标题 日期
1 Retrieval-Enhanced Few-Shot Prompting for Speech Event Extraction 2025-04-30
2 A Comprehensive Part-of-Speech Tagging to Standardize Central-Kurdish Language: A Research Guide for Kurdish Natural Language Processing Tasks 2025-04-28
3 Kimi-Audio Technical Report 2025-04-25
4 Augmenting Captions with Emotional Cues: An AR Interface for Real-Time Accessible Communication 2025-04-24
5 Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation 2025-01-23
6 Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling 2024-09-25
7 Revise, Reason, and Recognize: LLM-Based Emotion Recognition via Emotion-Specific Prompts and ASR Error Correction 2024-09-23
8 Mamba in Speech: Towards an Alternative to Self-Attention 2024-05-21

11. zero shot tracking/few shot tracking/pose tracking/pose estimation

序号 标题 日期
1 Self-Supervised Monocular Visual Drone Model Identification through Improved Occlusion Handling 2025-04-30
2 Dance Style Recognition Using Laban Movement Analysis 2025-04-29
3 Adept: Annotation-Denoising Auxiliary Tasks with Discrete Cosine Transform Map and Keypoint for Human-Centric Pretraining 2025-04-29
4 A Survey on Event-based Optical Marker Systems 2025-04-29
5 PRISM-DP: Spatial Pose-based Observations for Diffusion-Policies via Segmentation, Mesh Generation, and Pose Tracking 2025-04-29
6 GSFF-SLAM: 3D Semantic Gaussian Splatting SLAM via Feature Field 2025-04-28
7 Probabilistic Task Parameterization of Tool-Tissue Interaction via Sparse Landmarks Tracking in Robotic Surgery 2025-04-14
8 A Modular Edge Device Network for Surgery Digitalization 2025-03-18
9 BRIGHT-VO: Brightness-Guided Hybrid Transformer for Visual Odometry with Multi-modality Refinement Module 2025-01-15
10 Stereo4D: Learning How Things Move in 3D from Internet Stereo Videos 2024-12-12
11 HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos 2024-11-28
12 Whole-body End-Effector Pose Tracking 2024-09-24
13 Faster Model Predictive Control via Self-Supervised Initialization Learning 2024-08-06
14 Matching Anything by Segmenting Anything 2024-06-06
15 Input-Output Extension of Underactuated Nonlinear Systems 2024-03-05
16 SignDiff: Diffusion Model for American Sign Language Production 2023-08-30
17 Multi-Scale Memory Comparison for Zero-/Few-Shot Anomaly Detection 2023-08-09
18 Zero-Shot Anomaly Detection with Pre-trained Segmentation Models 2023-06-15
19 APRIL-GAN: A Zero-/Few-Shot Anomaly Classification and Segmentation Method for CVPR 2023 VAND Workshop Challenge Tracks 1&2: 1st Place on Zero-shot AD and 4th Place on Few-shot AD 2023-05-27
20 GarmentTracking: Category-Level Garment Pose Tracking 2023-03-24
21 Unifying Tracking and Image-Video Object Detection 2022-11-20
22 Exploring the Effectiveness of Self-supervised Learning and Classifier Chains in Emotion Recognition of Nonverbal Vocalizations 2022-06-21
23 The Multi-speaker Multi-style Voice Cloning Challenge 2021 2021-04-05
24 GroundSLAM: A Robust Visual SLAM System for Warehouse Robots Using Ground Textures 2017-10-16

12. text to 3d/image to 3d/text to texture

序号 标题 日期
1 CoherenDream: Boosting Holistic Text Coherence in 3D Generation via Multimodal Large Language Models Feedback 2025-04-28
2 Making Physical Objects with Generative AI and Robotic Assembly: Considering Fabrication Constraints, Sustainability, Time, Functionality, and Accessibility 2025-04-27
3 SORT3D: Spatial Object-centric Reasoning Toolbox for Zero-Shot 3D Grounding Using Large Language Models 2025-04-25
4 Unify3D: An Augmented Holistic End-to-end Monocular 3D Human Reconstruction via Anatomy Shaping and Twins Negotiating 2025-04-25
5 DiMeR: Disentangled Mesh Reconstruction Model 2025-04-24
6 Text-based Animatable 3D Avatars with Morphable Model Alignment 2025-04-22
7 TwoSquared: 4D Generation from 2D Image Pairs 2025-04-17
8 ESCT3D: Efficient and Selectively Controllable Text-Driven 3D Content Generation with Gaussian Splatting 2025-04-14
9 Text To 3D Object Generation For Scalable Room Assembly 2025-04-12
10 CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading 2025-04-09
11 3D Gaussian Particle Approximation of VDB Datasets: A Study for Scientific Visualization 2025-04-07
12 AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction 2025-03-17
13 ProcTex: Consistent and Interactive Text-to-texture Synthesis for Procedural Models 2025-01-28
14 unPIC: A Geometric Multiview Prior for Image to 3D Synthesis 2024-12-13
15 MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D 2024-11-04
16 3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation 2024-10-24
17 Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control 2024-10-09
18 RoCoTex: A Robust Method for Consistent Texture Synthesis with Diffusion Models 2024-09-30
19 GenesisTex2: Stable, Consistent and High-Quality Text-to-Texture Generation 2024-09-27
20 Controlling Space and Time with Diffusion Models 2024-07-10
21 Garment3DGen: 3D Garment Stylization and Texture Generation 2024-03-27
22 FlashTex: Fast Relightable Mesh Texturing with LightControlNet 2024-02-20
23 DreamDistribution: Learning Prompt Distribution for Diverse In-distribution Generation 2023-12-21

13. automated theorem proving/interactive theorem proving/formal verification

序号 标题 日期
1 Scenario-based Compositional Verification of Autonomous Systems with Neural Perception 2025-04-29
2 A Formal Framework for Naturally Specifying and Verifying Sequential Algorithms 2025-04-28
3 Automatic Goal Clone Detection in Rocq 2025-04-27
4 SynFuzz: Leveraging Fuzzing of Netlist to Detect Synthesis Bugs 2025-04-26
5 Towards Robust LLMs: an Adversarial Robustness Measurement Framework 2025-04-24
6 Canonical for Automated Theorem Proving in Lean 2025-04-08
7 Leanabell-Prover: Posttraining Scaling in Formal Reasoning 2025-04-08
8 A 2-Categorical Bridge Between Henkin Constructions and Lawvere's Fixed-Point Theorem: Unifying Completeness and Compactness 2025-04-04
9 Automated Discovery of Tactic Libraries for Interactive Theorem Proving 2025-03-31
10 A Natural Transformation between the Model Constructions of the Completeness and Compactness Theorems, Enhanced by Rigidity and 2-Categorical Strengthening 2025-03-19
11 Local Look-Ahead Guidance via Verifier-in-the-Loop for Automated Theorem Proving 2025-03-12
12 Faithful Logic Embeddings in HOL -- A recipe to have it all: deep and shallow, automated and interactive, heavy and light, proofs and counterexamples, meta and object level 2025-02-26
13 LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction 2025-02-25
14 Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving 2025-02-11
15 Proving the Coding Interview: A Benchmark for Formally Verified Code Generation 2025-02-08
16 A Hybrid Deep Learning and Model-Checking Framework for Accurate Brain Tumor Detection and Validation 2024-12-31
17 HunyuanProver: A Scalable Data Synthesis Framework and Guided Tree Search for Automated Theorem Proving 2024-12-30
18 Neuro-Symbolic Evaluation of Text-to-Video Models using Formal Verification 2024-11-22
19 Learning Rules Explaining Interactive Theorem Proving Tactic Prediction 2024-11-02
20 A Unit Proofing Framework for Code-level Verification: A Research Agenda 2024-10-18
21 Tableaux for Automated Reasoning in Dependently-Typed Higher-Order Logic (Extended Version) 2024-10-18
22 BAIT: Benchmarking (Embedding) Architectures for Interactive Theorem-Proving 2024-03-06
23 Trocq: Proof Transfer for Free, With or Without Univalence 2023-10-21
24 Magnushammer: A Transformer-Based Approach to Premise Selection 2023-03-08

Metadata

Metadata

Assignees

No one assigned

    Labels

    documentationImprovements or additions to documentation

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions